Skip to content
Steve Bond edited this page Oct 9, 2017 · 5 revisions

--screw_formats, -sf

Description

Convert between BioPython supported formats. Not all conversions are well supported by BioPython, and there hasn't been much effort yet in SeqBuddy to handle the conversions that don't work. Standard formats should be fine though, like fasta, genbank, nexus, and phylip.

The 'raw' format is specific to SeqBuddy, and strip all information from the sequences (including ids and metadata). All that is returned are the sequences separated by line breaks.

Argument

Format ( str )

The desired output format. The following formats are currently officially supported, and if you encounter any errors reading or writing them, please contact the developers.

  • FASTA
  • GenBank
  • EMBL
  • SeqXML
  • PHYLIP
  • NEXUS
  • Stockholm
  • CLUSTAL
  • FASTQ
  • SwissProt (read only)

Modifier

--in_place/-i

Works the same as all other in place calls, except that the file will be given a new extension to match the new format.

Examples

Input file: Drosophila.nex

#NEXUS
begin data;
	dimensions ntax=4 nchar=315;
	format datatype=protein missing=? gap=-;
matrix
'Dme-Panxδ3' --------GFI----KIDNMVFRCHYRITAI-LFTCCIIVTANNLIGDPISCI--IPMHVINTFCWITYTYTV-----AGPGLE-K-HSYYQWVPFVLFFQGLMFYVPHWVWKM--DGKIRMITGV---DDRDRILKYFVNNT--HNGYSFYFFCELLNFINVIVNIFMVDKFLGGAFMSYGTDVLKFSNMDQD-RFDPMIEIFPRLTKCTFHKFGPSGSVQKHDTLCVLALNILNEKIYIFLWFWFIILATISGVAVLYSVVITRTIRKE----------------GDFLILHFLSQNLSTRSYSDMLQ-----
'Dme-Panxδ2' MDVFGSVKGLLKID-QVDNNVFRMHYKATVIILIAFSLLVTSRQYIGDPIDCIVEIPLGVMDTYCWIYSTFTVPEGRDVQPGSE-KYHKYYQWVCFVLFFQAILFYVPRYLWKSWEGGRLKMLVDLSVNDKDRKIVDYF-GNLNRHNFYAFFFVCEALNFVNVIGQIYFVDFFLDGEFSTYGSDVLKFTELEPDERIDPMARVFPKVTKCTFHKYGPSGSVQTHDGLCVLPLNIVNEKIYVFLWFWFIILSIMS-ISLIYRIAVAPKLRHLLLRARSRAESEVEVAIGDWFLLYQLGKNIDPLIYKEVISDLEMG
'Dme-Panxδ4' MAAVKPLSKYLQFKVHIYDAIFTLHSKVTVALLLACTFLLSSKQYFGDPIQCFGDKD---MDAFCWIYGAYLQCA---VSKVVE-NYITYYQWVVLVLLLESFVFYMPAFLWKIWEGGRLKHLCDFK--RTHRVLVNYF----ETHFRYFVYVFCEILNLSISILNFLLLDVFFGGFWGRY-----------RNALYNQWIAVFPKCAKCEY-KGGPSGSSNIYDYLCLLPLNILNEKIFAFLWIWFILAMLIS-LKFLYRLAVLYPMRLQLLRPKKHLQVALNCSFGDWFVLMRVGNNISPELFRKLLEEL---
'Dme-Panxδ1' YKLLGSLKSYLKWQIQTDNAVFRLHNSFTTVLLLTCSLIITATQYVGQPISCIVGVP-HVVNTFCWIHSTFTMPDRREVHPGVDFKYYTYYQWVCFVLFFQAMACYTPKFLWNKFEGGLMRMIVGLNITRKRDALLDYLIKHVKRHKLYA-YWACEFLCCINIIVQMYLMNRFFDGEFLSYGTNIMKLSDVPQEQRVDPMVYVFPRVTKCTFHKYGPSGSLQKHDSLCILPLNIVNEKTYVFIWFWFWILLVL--LGLVFRCIIFPKFRPRLLNASNRIPMECRLDIGDWWLIYMLGRNLDPVIYKDVMSEFQVP
;
end;

Usage example 1

$: sb Drosophila.nex -sf fasta

Output

>Dme-Panxδ3
--------GFI----KIDNMVFRCHYRITAI-LFTCCIIVTANNLIGDPISCI--IPMHV
INTFCWITYTYTV-----AGPGLE-K-HSYYQWVPFVLFFQGLMFYVPHWVWKM--DGKI
RMITGV---DDRDRILKYFVNNT--HNGYSFYFFCELLNFINVIVNIFMVDKFLGGAFMS
YGTDVLKFSNMDQD-RFDPMIEIFPRLTKCTFHKFGPSGSVQKHDTLCVLALNILNEKIY
IFLWFWFIILATISGVAVLYSVVITRTIRKE----------------GDFLILHFLSQNL
STRSYSDMLQ-----
>Dme-Panxδ2
MDVFGSVKGLLKID-QVDNNVFRMHYKATVIILIAFSLLVTSRQYIGDPIDCIVEIPLGV
MDTYCWIYSTFTVPEGRDVQPGSE-KYHKYYQWVCFVLFFQAILFYVPRYLWKSWEGGRL
KMLVDLSVNDKDRKIVDYF-GNLNRHNFYAFFFVCEALNFVNVIGQIYFVDFFLDGEFST
YGSDVLKFTELEPDERIDPMARVFPKVTKCTFHKYGPSGSVQTHDGLCVLPLNIVNEKIY
VFLWFWFIILSIMS-ISLIYRIAVAPKLRHLLLRARSRAESEVEVAIGDWFLLYQLGKNI
DPLIYKEVISDLEMG
>Dme-Panxδ4
MAAVKPLSKYLQFKVHIYDAIFTLHSKVTVALLLACTFLLSSKQYFGDPIQCFGDKD---
MDAFCWIYGAYLQCA---VSKVVE-NYITYYQWVVLVLLLESFVFYMPAFLWKIWEGGRL
KHLCDFK--RTHRVLVNYF----ETHFRYFVYVFCEILNLSISILNFLLLDVFFGGFWGR
Y-----------RNALYNQWIAVFPKCAKCEY-KGGPSGSSNIYDYLCLLPLNILNEKIF
AFLWIWFILAMLIS-LKFLYRLAVLYPMRLQLLRPKKHLQVALNCSFGDWFVLMRVGNNI
SPELFRKLLEEL---
>Dme-Panxδ1
YKLLGSLKSYLKWQIQTDNAVFRLHNSFTTVLLLTCSLIITATQYVGQPISCIVGVP-HV
VNTFCWIHSTFTMPDRREVHPGVDFKYYTYYQWVCFVLFFQAMACYTPKFLWNKFEGGLM
RMIVGLNITRKRDALLDYLIKHVKRHKLYA-YWACEFLCCINIIVQMYLMNRFFDGEFLS
YGTNIMKLSDVPQEQRVDPMVYVFPRVTKCTFHKYGPSGSLQKHDSLCILPLNIVNEKTY
VFIWFWFWILLVL--LGLVFRCIIFPKFRPRLLNASNRIPMECRLDIGDWWLIYMLGRNL
DPVIYKDVMSEFQVP

Usage example 2

$: sb Drosophila.nex -sf raw

Output

--------GFI----KIDNMVFRCHYRITAI-LFTCCIIVTANNLIGDPISCI--IPMHVINTFCWITYTYTV-----AGPGLE-K-HSYYQWVPFVLFFQGLMFYVPHWVWKM--DGKIRMITGV---DDRDRILKYFVNNT--HNGYSFYFFCELLNFINVIVNIFMVDKFLGGAFMSYGTDVLKFSNMDQD-RFDPMIEIFPRLTKCTFHKFGPSGSVQKHDTLCVLALNILNEKIYIFLWFWFIILATISGVAVLYSVVITRTIRKE----------------GDFLILHFLSQNLSTRSYSDMLQ-----

MDVFGSVKGLLKID-QVDNNVFRMHYKATVIILIAFSLLVTSRQYIGDPIDCIVEIPLGVMDTYCWIYSTFTVPEGRDVQPGSE-KYHKYYQWVCFVLFFQAILFYVPRYLWKSWEGGRLKMLVDLSVNDKDRKIVDYF-GNLNRHNFYAFFFVCEALNFVNVIGQIYFVDFFLDGEFSTYGSDVLKFTELEPDERIDPMARVFPKVTKCTFHKYGPSGSVQTHDGLCVLPLNIVNEKIYVFLWFWFIILSIMS-ISLIYRIAVAPKLRHLLLRARSRAESEVEVAIGDWFLLYQLGKNIDPLIYKEVISDLEMG

MAAVKPLSKYLQFKVHIYDAIFTLHSKVTVALLLACTFLLSSKQYFGDPIQCFGDKD---MDAFCWIYGAYLQCA---VSKVVE-NYITYYQWVVLVLLLESFVFYMPAFLWKIWEGGRLKHLCDFK--RTHRVLVNYF----ETHFRYFVYVFCEILNLSISILNFLLLDVFFGGFWGRY-----------RNALYNQWIAVFPKCAKCEY-KGGPSGSSNIYDYLCLLPLNILNEKIFAFLWIWFILAMLIS-LKFLYRLAVLYPMRLQLLRPKKHLQVALNCSFGDWFVLMRVGNNISPELFRKLLEEL---

YKLLGSLKSYLKWQIQTDNAVFRLHNSFTTVLLLTCSLIITATQYVGQPISCIVGVP-HVVNTFCWIHSTFTMPDRREVHPGVDFKYYTYYQWVCFVLFFQAMACYTPKFLWNKFEGGLMRMIVGLNITRKRDALLDYLIKHVKRHKLYA-YWACEFLCCINIIVQMYLMNRFFDGEFLSYGTNIMKLSDVPQEQRVDPMVYVFPRVTKCTFHKYGPSGSLQKHDSLCILPLNIVNEKTYVFIWFWFWILLVL--LGLVFRCIIFPKFRPRLLNASNRIPMECRLDIGDWWLIYMLGRNLDPVIYKDVMSEFQVP

Usage example 3

$: sb Drosophila.nex -sf gb -i

Output

File over-written at:
/home/Documents/BuddySuite/workshop/test_files/Drosophila.gb

Main Toolkit Pages





Further Reading

Clone this wiki locally