Text Mining BootCamp

October 8th, 2010

Got a lot of digitized text? Not sure what to do with it? Try text mining!

I’d like to hold a Text Mining BootCamp for those interested in using computers to extract information from raw text. At one level above messy OCR, I will first introduce the teriminology and  possibilities – when we talk about the “information in text” what do we mean? What kinds of things has computational linguistics made it possible to extract from words, sentences, and document collections?  To make it concrete, I will work with example scholarly questions from real humanists, and show how to they are translated into computational terms.

Then the tools: I will introduce and demonstrate the text mining toolkits accessible to scholars with no programming experience, and touch upon other tools, suitable for more experienced programmers.


4 Responses to “Text Mining BootCamp”

  1. Viola Lasmana on October 8, 2010 7:48 pm

    This sounds like exactly what I’m looking for!

  2. Carl Schmitz on October 8, 2010 9:15 pm

    Sounds perfect. I work with a wide variety of digital media and would be interested in generating better research query responses from scanned texts.

  3. Jeremy Kahn on October 9, 2010 12:12 am

    I’ll show up for this. I might even be able to chip in and offer some pointers to neat NLP and comp-ling research questions!

  4. » Things Learned About Natural Language Processing at THATcamp Bay Area MaisonBisson.com on October 10, 2010 7:36 pm

    […] first session I joined at THATcamp was Aditi Muralidharan‘s text mining boot camp, and the topic seemed to set my agenda for the rest of the event (though I wish Aditi had […]

Comments are closed.

  • About

    This is an area on your website where you can add text. This will serve as an informative location on your website, where you can talk about your site.

  • Blogroll
  • Admin