Collections:
Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package?
✍: FYIcenter.com
Bio.AlignIO module allows you to read and write Sequence Alignments
as MultipleSeqAlignment objects.
Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format.
>COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRLFKKFSSKA >Q9T0Q8_BPIKE/1-52 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKLFKKFVSRA >COATB_BPI22/32-83 DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRLFKKFSSKA >COATB_BPM13/24-72 AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPZJ2/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFASKA >Q9T0Q9_BPFD/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPIF1/22-73 FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRA
Read the above sequence alignment file with the Bio.AlignIO.read() function.
fyicenter$ python
>>> from Bio import AlignIO
>>> alignment = AlignIO.read("PF05371_seed.faa", "fasta")
>>>
>>> print(alignment)
Alignment with 7 rows and 52 columns
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRL...SKA COATB_BPIKE/30-81
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKL...SRA Q9T0Q8_BPIKE/1-52
DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRL...SKA COATB_BPI22/32-83
AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPM13/24-72
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPZJ2/1-49
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA Q9T0Q9_BPFD/1-49
FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKL...SRA COATB_BPIF1/22-73
⇒ Calculate Substitutions in Alignments
⇐ Scan Prosite Databas with Bio.ExPASy.ScanProsite.scan()
2023-09-05, 942🔥, 0💬
Popular Posts:
Molecule Summary: ID: FYI-1003296 Names: InChIKey: JSMRVXYDHGMPPA-UHFFFAOYS A-NSMILES: COc4cccc(c3nc...
Molecule Summary: ID: FYI-1001865 SMILES: [H]/C=N\\\\c1c([H])c([H] )c(-c2c(-c3c([H])c([H])c (/N=C\\\\[...
Molecule Summary: ID: FYI-1004436 Names: InChIKey: FRCCEHPWNOQAEU-UHFFFAOYS A-NSMILES: ClC2=C(Cl)C3(...
Molecule Summary: ID: FYI-1003053 Names: InChIKey: GOTQYQJBHFXCPO-UHFFFAOYS A-NSMILES: N#Cc3cc(c2ccc...
Molecule Summary: ID: FYI-1002056 Names: LIPITOR; InChIKey: FQCKMBLVYCEXJB-MNSAWQCAS A-LSMILES: CC(C...