Comments are closed.
Text Mining BootCamp
Got a lot of digitized text? Not sure what to do with it? Try text mining!
I’d like to hold a Text Mining BootCamp for those interested in using computers to extract information from raw text. At one level above messy OCR, I will first introduce the teriminology and possibilities – when we talk about the “information in text” what do we mean? What kinds of things has computational linguistics made it possible to extract from words, sentences, and document collections? To make it concrete, I will work with example scholarly questions from real humanists, and show how to they are translated into computational terms.
Then the tools: I will introduce and demonstrate the text mining toolkits accessible to scholars with no programming experience, and touch upon other tools, suitable for more experienced programmers.
test Filed under BootCamp | Comments (4)4 Responses to “Text Mining BootCamp”
This sounds like exactly what I’m looking for!
Sounds perfect. I work with a wide variety of digital media and would be interested in generating better research query responses from scanned texts.
I’ll show up for this. I might even be able to chip in and offer some pointers to neat NLP and comp-ling research questions!
[…] first session I joined at THATcamp was Aditi Muralidharan‘s text mining boot camp, and the topic seemed to set my agenda for the rest of the event (though I wish Aditi had […]