Saturday, December 26, 2009

Perfect Syncopation

Look at this:
18204 total word(s)
17369 word(s) found
20 word(s) not found
815 word(s) ignored
0.11% of words not found
4.48% of words ignored
3264 unique word(s)
But what does it mean???

Well, I've just run the word analysis tool on Livy Ab Urbe Condita Book 2. The important thing to note is that out of eighteen thousand words, only 20 weren't parsed and found in the dictionary. That's pretty much amazing.

How did this happen? Well, two things had to happen. First, I ignore capitalized words that weren't located in the dictionary. Essentially, I'm ignoring proper names and place names. Second, I programmed Numen's ability to parse syncopated perfect verbs: laudasse (laudavisse), norat (noverat), et cetera.

I still have a bit of testing to do to make sure I didn't break anything, but this was one of the few major hurdles that I needed to overcome to get a nearly perfect parsing engine!

Labels: , , ,


Friday, March 13, 2009

Verb Paradigms

As I promised, I've been working on verb paradigms. As of now, they're up and running. Just search for any Latin word and click "see the complete paradigm".

Most verbs show up just fine, but of course some irregular verbs will show odd glitches. Therefore the data is "beta" but the paradigms should still prove helpful.

Here are some caveats:
  • certain irregular verbs will have weird forms, for instance, the participles for esse (which didn't exist until late antiquity).
  • deponent verbs will show active forms. Remember that deponent verbs do have active participles, and the imperfect subjunctive is formed from the "reconstructed" active infinitive. I'm trying to imagine a way to "gray-out" the unused active forms, but I haven't decided fully on that yet.
  • as a result of deponent verbs having "active" forms, they are now stored in the dictionary in their active forms, although on flashcards they will still show their deponent forms. So for instance sequor will be searchable under sequo.
  • unusual forms, such as dic, duc, and fac will show up as dice, duce, and face. I haven't implemented and "irregular forms" system yet, even though I've half mapped it out. UPDATE: It turns out that Plautus was fond of using forms like dice, duce, and face even though they were later rejected by Terence.
  • UPDATE: Some forms which are not known to exist (in other words, we don't have a record of them) but can logically be deduced will show up on the paradigm charts. For instance, the rare future active participle of volo, voliturus shows up and so does it's non-extant future active infinitive voliturus esse. Many grammar books will not show these forms simply because we don't have a record of them. Nonetheless, it is logical to assume they existed or would have been known to exist during Roman times (at least in theory).
So, work continues! Enjoy.

Labels: , , , , , ,


Monday, August 11, 2008

Paradigm Updates

I spent a few minutes today and added the demonstrative pronouns is, iste and idem and the paradigms for R type possessive pronouns noster and vester/voster. I also added the three defective verbs, coepi, odi and memini. Finally, I added the eo and ferre verb conjugations.

Labels: , , , ,


Wednesday, August 6, 2008

Prettying up the Joint

So I haven't had as much free time as I wanted this week. I've been busy at my day job getting everything "taken care of" before my last day there on August 13th. After that, I'll just be contracting with them for 5 hours a week, because I'll be a full time teaching and grading assistant for the University of New Mexico Foreign Languages and Literatures Department. I'm pretty excited about that!

Nevertheless, I had a few hours free tonight, so I did some sprucing up. I made some icons, fixed some style sheets and squashed some small bugs. There are a few things I want to include before the semester starts:
  • Add a few pronoun paradigms: is and iste for sure.
  • Add some verb paradigms: perhaps volo verbs.
  • Fix up the database backend, especially in the realm of update cascades (it's technical, and you're probably wondering what that means -- don't worry, it'll make things better).
  • Speed up the morphology lookup. It's not slow by any means, coming in at approximately 100 milliseconds per word. But still, I think I can get it down to 40ms. Every bit helps, especially if this site ever gets popular!
  • Make a new database and web server. Right now it's being graciously hosted at the place I work here on campus (Natural Heritage New Mexico). I've been the system admin there for about 5 years, but now -- as I wrote earlier in this post -- I won't be there for very much longer.
I'm sure there's more to update, but I think it's time to take a break for the evening. Enjoy!

Labels: , , , , , , , , ,