-
Notifications
You must be signed in to change notification settings - Fork 50
/
ReleaseNotes
142 lines (126 loc) · 6.66 KB
/
ReleaseNotes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
4.1.7-p1
- This is a critical fix to the 4.1.7 release. The -species option
was not being interpreted correctly by RepeatMasker/ProcessRepeats.
The most obvious change for mammals/primates was that the output
format (*.tbl file) looked different than it had before. Under the
hood this also changed the way some species were handled in the
library searches.
4.1.7
- Handle the case were the default search engine is not configured
and require the user to provide it using -e/-engine.
- Use the global alignment bandwidth simulation for the refinement
step.
- Allow for minimal database install and warn when used with -species.
- Fixed a bug on configure with setting the default search engine when
only one search engine is configured.
- RepeatProteinMask wasn't obtaining the correct search engine if
configure didn't set it up correctly.
- FamDB wasn't exporting RepeatMasker.lib with class-name suffixes.
This had an impact on the RepeatModeler/RepeatClassifier package
but not on RepeatMasker itself. Upgraded to FamDB 1.0.5
- Fixed an issue with parsing RepBase. Some records were not being
parsed correctly for records that do not include the keyword "repbase"
following the identifier.
- New semantantics for NCBIBlastSearchEngine bandwidth interpretation.
- Fixed a divide-by-zero error in ProcessRepeat's merging of
DNA tranpsoson overlapping fragments.
- FamDB fixed inconsistency between curated/uncurated counts vs
exports.
4.1.6
- Upgraded to FamDB 1.0.2 to support Dfam 3.8 and the new partitioned
database format.
- Added Libraries/RMRB_spec_to_tax.json to project. This maps the
RepBase taxanames to current NCBI tax_ids and needs to be refreshed
with each new Dfam release.
- Added softmasking support to NCBIBlastSearchEngine.pm.
- Added new '--uncurated' flag to handle single export Dfam format.
If this flag is used the CONS/HMM cached directories will be suffixed
with "_wunc".
- Fixed sunk error messages from famdb.py. Now they will be displayed
and cause RepeatMasker to quit.
- Additional library setup steps and error checking for configure utility.
- CAF documentation in SearchResult.
- calcDivergenceFromAlign clarified use of "-a" in documentation.
4.1.5
- Updated codebase for Dfam 3.7 compatibility (famdb format 4.3)
- Penelope classification change caused *.tbl file accounting
to place them in the Unknown category. Also fixed landscape
generation tool.
- Added a new utility to merge *.out *.align files generated by
running RepeatMasker serially.
- Repbase Metadata is seriously out-of-date, updated species names
so that they match the current NCBI Taxonomy names.
- Fixed an issue with the HMM parser. It wasn't recognizing
negative values for Tau with models that do not have GA thresholds.
4.1.4
- Added support for RMBlast 2.13.0
- Release of the TE genome browser visualization (UCSC) and trackhub
generation tool.
- New CpGSites and unadjusted Kimura stats in the *.align file.
- Fixed a bug that caused the read-only state of the input fasta file
to propogate to the intermediate files and cause the program to exit.
- Removed DateRepeats as it's based on old library formats - this functionality
will return with the refactored version of RM in the works.
4.1.3-p2
- A recent change in 4.1.3 to correct blank fragment ID fields can in rare
cases causing the error message: 'Can't call method "setLeftLinkedHit"'.
- The RepeatAnnotationData.pm file containing necessary information for
recognizing equivalent fragments of DNA transposons was missing data.
- The MULE-MuDR class was added to the *.tbl file for "-lib" searches.
4.1.3
- A new utility for generating trackHubs for our new UCSC TE visualization
- Fix a bug where killing RM while starting up can leave the cached libraries
in an inconsistent state.
- Fixed a bug where in rare cases the joined fragment ID field is blank
- Merged in changes to Dupmasker supporting multi-threaded use
- Fixed legacy RepBase taxonomic labels
- Added support for GFF v3 output and fixed the utility/rmOutToGFF3.pl
4.1.2-p1
- Releases 4.1.1-4.1.2 contained a bug with the processing of Alu sequences
in primates. The step where an initial annotation is refined into a
particular Alu subfamily was not performed and the annotations remained
labeled with the initial capture sequence ( AluJb, AluSx, or AluY ).
This patch release fixes this one issue.
4.1.2
- Fixed 21 protein family classifications in RepeatProteinLib.
- Fixed a problem with the generation of the RepeatMasker.lib file for use by
RepeatModeler. In release 4.1.1 it did not add the classification info to
this auxilary file.
- Fixed a "log(0)" error that can cause the program to fault in rare circumstances.
- buildSummary now supports FamDB and has improved documentation.
- Bugfixes and improvements to FamDB.
4.1.1
- Dfam (starting with version 3.2) is now distributed in the FamDB file format
based on HDF5, which has improved support for large datasets compared to the
EMBL and HMM formats that were previously used. RepeatMasker therefore includes
a copy of famdb.py, and depends on the python package h5py.
o The 'configure' script and other parts of RepeatMasker have been updated to
accomodate these changes.
o The utilities 'queryTaxonomyDatabase.pl' and 'queryRepeatDatabase.pl' are no
longer included, since that data is now included in FamDB. The 'famdb.py'
tool can be used to make many of the same queries as the removed utilies,
and even more.
4.1.0
- Refactored the RepeatMasker configuration system.
The changes make it easier to automate installation and/or
bundle RepeatMasker.
4.0.9-p2
- Another improvement in TRF error handling
4.0.9-p1 : Tue Apr 16 08:51:22 PDT 2019
- Input files containing multiple FASTA sequences
caused RepeatMasker to error out with a message
like:
"WARNING: TRF returned an error (Return code = ### )
TRF parameters: 2.7.7.80.10.50.10
A search phase could not complete on this batch.
The batch file will be re-run and if possible the
program will resume.
WARNING: Retrying batch ( 1 ) [ 255,, 195]..."
This bug was introduced in an attempt to improve TRF
error catching. Unfortunatly the return codes are
not documented for TRF and the assumption that 256
is the only successful return code is wrong. The
"success" code appears to change depending on the
number of sequences in the file. The workaround is
to fail only if there is a message in the error output
file.