Wednesday, October 14, 2009
J's and U's Updated / Speed Increases
My main motivation for making this update is because certain passages stored in The Latin Library reflect the older conventions of using J's for consonantal I's or U's for both consonantal and vocalic V's. Numen's parsing engine was having trouble recognizing forms like jecit (iecit) and uuius (vivus). So now as a result -- after a bit of work -- the engine is updated and now recognizes more possibilities than ever. Incidentally, internally J's are stored as I's and U's are stored as V's.
Another project I completed at the same time is an order-of-magnitude speed improvement for parsing. I was trying to figure out ways to make the engine faster and I discovered a shortcut that boosts speed tremendously. When parsing a word, the engine used to spend between 250ms and 500ms parsing each word! That was always disappointing to me, but I had gotten around the problem by caching the results. Now, however, word parsing takes about 25ms!
Why bother improving the speed? Because soon I will be implementing word lists and frequency lists! A word list, of course, is just a "mini-lexicon" that defines only the words in your chosen passage, and a frequency list is a list of words in order of how often they appear in a passage. The word list will be helpful to quickly work on vocabulary for a passage, and a frequency list will help Latin students study more effectively by giving them the most frequent words first. I'm very excited about this feature, but I don't anticipate it will be done before January 10th (giving me the winter holiday to work on it).
That's all for now!
Labels: accuracy, database, development, features, frequency lists, google cache, orthography, parsing engine, performance, slowness, vergil, word lists
Monday, March 16, 2009
Saving the Latinitas Recens Lexicon (Update)
I'm sad to see that it has gone missing from the web. Unfortunately, the Google Cache has also expired. Since this very important resource is in danger of extinction, I took the liberty of mirroring the Latin-English portion of the site (including sigla). I will keep the page posted until the original maintainer (Florus) can re-upload his version.
Here is the link: Latinitas Recens (Speculum)
Seeing as how one group-member suggested a one-page version, that's what this mirror is. Keep in mind it's about 430K.
Two notes: I am missing pages 8 and 9 (no longer in cache, not in archive). If you have them, please email them to me so I can include them. Also, I will do the English-Latin as time permits.
I hope you all find it useful. Feedback is appreciated.
Update: Thanks go to Rodericius who has generously provided page 9.
Labels: google cache, latin, latinitas, lexicon, mirror, modern, neolatin, recens, web archive
