Collections:
Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package?
✍: FYIcenter.com
Bio.AlignIO module allows you to read and write Sequence Alignments
as MultipleSeqAlignment objects.
Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format.
>COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRLFKKFSSKA >Q9T0Q8_BPIKE/1-52 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKLFKKFVSRA >COATB_BPI22/32-83 DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRLFKKFSSKA >COATB_BPM13/24-72 AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPZJ2/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFASKA >Q9T0Q9_BPFD/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPIF1/22-73 FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRA
Read the above sequence alignment file with the Bio.AlignIO.read() function.
fyicenter$ python
>>> from Bio import AlignIO
>>> alignment = AlignIO.read("PF05371_seed.faa", "fasta")
>>>
>>> print(alignment)
Alignment with 7 rows and 52 columns
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRL...SKA COATB_BPIKE/30-81
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKL...SRA Q9T0Q8_BPIKE/1-52
DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRL...SKA COATB_BPI22/32-83
AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPM13/24-72
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPZJ2/1-49
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA Q9T0Q9_BPFD/1-49
FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKL...SRA COATB_BPIF1/22-73
⇒ Calculate Substitutions in Alignments
⇐ Scan Prosite Databas with Bio.ExPASy.ScanProsite.scan()
2023-09-05, 850🔥, 0💬
Popular Posts:
How to download and install PyMol Open Source edition on macOS? If you want to try the open source e...
Molecule Summary: ID: FYI-1003951 Names: InChIKey: FAUPJUJQOJXWNE-UHFFFAOYS A-NSMILES: CCc4nc(C(N)=O...
How to generate class path for dependences with Maven? If you run your application JAR file generate...
Molecule Summary: ID: FYI-1003809 Names: InChIKey: BQCVLYJBPWSIIR-UHFFFAOYS A-NSMILES: C1OCOCOCOCOCO...
Molecule Summary: ID: FYI-1004077 Names: InChIKey: KGHUQBINMJRPLK-UHFFFAOYS A-NSMILES: O=c4c(=O)c3c(...