Created by: stephenroller
Patch Description We've been doing a lot of training on different corpora, and using symbolic links to create the mixture. Because of the way the caching system works, this means we rebuild our line indexes every time we create a new mix of corpora.
This patch builds the indexes next to the original data, not the symlink. This helps re-use indexes across distinct runs.
Testing steps Been using this for a week plus.