Software/Data

Github Repository for NLP software

Computational Language Understanding (CLULAB) on Github
2016 onward

Explanations for Science Questions
University of Arizona, Stony Brook University & Allen Institute for Artificial Intelligence
This is the dataset for the paper What’s in an Explanation? Characterizing Knowledge and Inference Requirements for Elementary Science Exams (COLING’16). The data contains: gold explanation sentences supporting 363 science questions, relation annotation for a subset of those explanations, and a graphical annotation tool with annotation guidelines.
[ COLING2016_Explanations_Oct2016.zip ]

SISTA-QA Discourse-aware Question Answering System (co-author)
A state-of-the-art question answering system including shallow and deep (RST) discourse models.
[ http://nlp.sista.arizona.edu/releases/acl2014/ ]

Straw2Gold (co-author)
A package for training monolingual alignment and lexical semantic models using discourse structure.
[ http://clulab.cs.arizona.edu/software.php ]

NLP Processors (contributor)
A one-stop package for NLP processors and data structures, through a Scala API.
[ https://github.com/clulab/processors ]

SayWhen: Speech Onset Detection (co-author)
A state-of-the-art automated algorithm and interface for highly-accurate speech onset detection in
psycholinguistics and cognitive experiments.
[ http://cogsci.mcmaster.ca/peter/saywhen/ ]