Collections:
Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package?
✍: FYIcenter.com
Bio.AlignIO module allows you to read and write Sequence Alignments
as MultipleSeqAlignment objects.
Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format.
>COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRLFKKFSSKA >Q9T0Q8_BPIKE/1-52 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKLFKKFVSRA >COATB_BPI22/32-83 DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRLFKKFSSKA >COATB_BPM13/24-72 AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPZJ2/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFASKA >Q9T0Q9_BPFD/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPIF1/22-73 FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRA
Read the above sequence alignment file with the Bio.AlignIO.read() function.
fyicenter$ python
>>> from Bio import AlignIO
>>> alignment = AlignIO.read("PF05371_seed.faa", "fasta")
>>>
>>> print(alignment)
Alignment with 7 rows and 52 columns
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRL...SKA COATB_BPIKE/30-81
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKL...SRA Q9T0Q8_BPIKE/1-52
DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRL...SKA COATB_BPI22/32-83
AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPM13/24-72
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPZJ2/1-49
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA Q9T0Q9_BPFD/1-49
FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKL...SRA COATB_BPIF1/22-73
⇒ Calculate Substitutions in Alignments
⇐ Scan Prosite Databas with Bio.ExPASy.ScanProsite.scan()
2023-09-05, 998🔥, 0💬
Popular Posts:
Molecule Summary: ID: FYI-1001429 SMILES: N#CC1C=NC(=CC=1)C2=C(N=C C=N2)[C@H](C)NC(=O)C3C=C (C=C(OC(F)...
What are SMILES string representations for reactions? SMILES (Simplified Molecular Input Line Entry ...
Molecule Summary: ID: FYI-1002033 Names: InChIKey: TZIHFWKZFHZASV-UHFFFAOYS A-OSMILES: C/[O+]=C\\O R...
Molecule Summary: ID: FYI-1002129 Names: InChIKey: IZFHEQBZOYJLPK-UHFFFAOYS A-NSMILES: O=C(O)CCCCC(S...
Molecule Summary: ID: FYI-1005703 Names: InChIKey: SSIGXLVDPINUCZ-XPKAQORNS A-NSMILES: CC#CCNC(=O)C2...