These are the main things we’re working on at the moment:

  • text digitization
    • History of Middle-earth (initial digitization of vols 1–9 done)
  • general markup
  • referencing/citation systems
  • initial modelling of named entities in Lord of the Rings
  • initial modelling of direct speech in Lord of the Rings and the Hobbit
  • initial modelling of time indicators in Lord of the Rings and the Hobbit along with visualizations of narrative time (Mythmoot VIII talk)
  • initial sentence tokenization, lemmatization, and dependency analysis of the Hobbit and Lord of the Rings
  • term-document matrices and other related analyses
    • The Hobbit, Lord of the Rings, and the Silmarillion (all mostly done)
    • see also TF-IDF Demo

Secondarily:

Also related, see: