--- file_transformation: - Prefixed fasta files with glyma.Wm82.gnm2. and removed .p suffix: - perl -pi -e 's/>(\S+).p* />glyma.Lee.gnm1.ann1.$1 /' *.faa - Prefixed gene IDs in nucleotide fasta: - perl -pi -e 's/>(\S+) />glyma.Lee.gnm1.ann1.$1 /' *fna - Added chromosome prefix to GFFs: - perl -pi -e 's/^([^#]\S+)/glyma.Lee.gnm1.ann1.$1/' *gff3 - Added gene prefixes to GFFs (also replacing version suffixes): - perl -pi -e 's/\.v1\.1//g; s/=GlymaLee./=glyma.Lee.gnm1.ann1.GlymaLee./g' *gff3 - perl -pi -e 's/^glyma.Lee.gnm1.ann1./glyma.Lee.gnm1./' *gff3 changes: - 2018-10-04 initial Legume Federation Data Store file preparation - 2018-10-10 Applied prefixes to fasta and GFF files - see file_transformations above. - 2019-06-28 Removed ".ann1" from reference (chromosome) IDs in GFF files - see file_transformations above. - 2020-04-13 Sorted GFF, i.e. gff3sort.pl --precise glyma.Lee.gnm1.ann1.6NZV.gene_models_main.gff3 - 2020-09-22 added gene family assignments - 2021-05-25 updated gene family assignments to use score rather than e-value - 2023-02-26 adf: resorted gff using datastore-specifications/scripts/special_or_deprecated/sort_gff.pl to fix the issue reported for this specific genome in https://github.com/legumeinfo/datastore-issues/issues/146 - 2024-04-11 sbc: update synopsis to include JGI numbering