--- file_transformation: - Applied DSCensor normalizer to file at https://medicago.toulouse.inra.fr/MtrunA17r5.0-ANR/downloads/1.6/MtrunA17r5.0-ANR-EGN-r1.6.gff3.zip creating medtr.A17.gnm5.ann1_6.L2RX.gene_models_main.gff3.gz. No changes were made to any feature coordinates or types. GFF3 is resorted and IDs are prefixed with the standardized prefixing. changes: - 2019-09-26 Added fasta files for cds, mrna, and protein, derived from gff using gffread - 2019-09-26 Added cds_primaryTranscript, corresponding with protein_primaryTranscript - 2020-10-02 Added gene family assignment files - 2022-01-25 adf: completed AHRD, without removing original attributes with which ours may be slightly inconsistent; re-separated repeats out of gene_models_main, using the source field as a guide (EuGene, ope_rescue and smallA all denoted gene-finding approaches mentioned in the paper; everything else was some manner of repeat element identification) - 2022-03-05 adf: drop "gene:" from ids in gfa files, to make them match with the gff which seems to have gotten changed; the latter change may need to be revisited since now gene and mRNA IDs are identical - 2022-04-06 adf: revisited the problem mentioned above, fixed by adding .1 to mRNA (and ncRNA); also made fasta files consistent with IDs in gff - 2022-06-02 adf: dropped :mRNA from protein ids in gfa and iprscan files to match what had been done elsewhere; fixes https://github.com/legumeinfo/datastore-issues/issues/105 - 2022-07-14 adf: added phytozome_10_2 prefixes to medtr.A17.gnm5.ann1_6.L2RX.phytozome_10_2.HFNR.gfa.tsv.gz; fixes https://github.com/legumeinfo/datastore-issues/issues/112#issuecomment-1184721731 - 2022-11-07 adf: added .1 suffixes to protein column in both gfa files; fixes https://github.com/legumeinfo/datastore-issues/issues/132 - 2022-11-09 sc: added cds.bed file, derived from gene_models_main.gff3