--- file_transformation: - Prefixed amino acid fasta files with glyso.PI483463.gnm1.ann1. and removed .p suffix: - perl -pi -e 's/>(\S+).p />glyso.PI483463.gnm1.ann1.$1 /' *.faa - Prefixed gene IDs in nucleotide fasta: - perl -pi -e 's/>(\S+) />glyso.PI483463.gnm1.ann1.$1 /' *fna - Added chromosome prefix to GFFs: - perl -pi -e 's/^([^#]\S+)/glyso.PI483463.gnm1.$1/' *gff3 - Added gene prefixes to GFFs (also replacing version suffixes): - perl -pi -e 's/\.v1\.1//g; s/=GlysoPI483463./=glyso.PI483463.gnm1.ann1.GlysoPI483463./g' *gff3 changes: - 2018-10-04 initial Legume Federation Data Store file preparation - 2018-10-10 Applied prefixes to fasta and GFF files - see file_transformations above. - 2020-09-22 added gene family assignments - 2021-05-25 updated gene family assignments to use score rather than e-value - 2022-03-05 adf: corrected truncated gene id in gfa file - 2024-04-11 sc: update synopsis to include JGI numbering - 2024-11-26 sc: Remove gratuitous prefix from Name attribute: s/Name=glyso.PI483463.gnm1.ann1./Name=/ - 2024-12-10 sc: For consistency with other ds_souschef collections, remove Phytozome descriptions, starting from pacid=, from sequence files - 2025-04-16 adf: add glyso.PI483463.gnm1.ann1.3Q3Q.legume.fam3.VLMQ.gfa.tsv.gz