Show simple item record

dc.contributor.authorKatiyar, Arzoo
dc.date.accessioned2019-10-15T16:51:58Z
dc.date.available2019-10-15T16:51:58Z
dc.date.issued2019-08-30
dc.identifier.otherKatiyar_cornellgrad_0058F_11700
dc.identifier.otherhttp://dissertations.umi.com/cornellgrad:11700
dc.identifier.otherbibid: 11050781
dc.identifier.urihttps://hdl.handle.net/1813/67794
dc.description.abstractExtracting information from text entails deriving a structured, and typically domain-specific, representation of entities and relations from unstructured text. The information thus extracted can potentially facilitate applications such as question answering, information retrieval, conversational dialogue and opinion analysis. However, extracting information from text in a structured form is difficult: it requires understanding words and the relations that exist between them in the context of both the current sentence and the document as a whole. In this thesis, we present our research on neural models that learn structured output representations comprised of textual mentions of entities and relations within a sentence. In particular, we propose the use of novel output representations that allow the neural models to learn better dependencies in the output structure and achieve state-of-the-art performance on both tasks. We also propose models which can learn nested variation of the problem of entity mentions and achieves state-of-the-art performance. We also present our recent work on expanding the input context beyond sentences by incorporating coreference resolution to learn entity-level rather than mention-level representations and show that these representations are important for improving relation extraction. We perform analysis to show that the entity-level representations which capture the information regarding the saliency of entities in the document are beneficial for relation extraction. We also briefly mention about incorporating biases into the neural network models and show improvements in the performance of information extraction.
dc.language.isoen_US
dc.subjectnatural language processing
dc.subjectneural networks
dc.subjectComputer science
dc.subjectEntities
dc.subjectHypergraphs
dc.subjectInformation Extraction
dc.subjectStructured Prediction
dc.titleLEARNING STRUCTURED INFORMATION FROM LANGUAGE
dc.typedissertation or thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorCornell University
thesis.degree.levelDoctor of Philosophy
thesis.degree.namePh.D., Computer Science
dc.contributor.chairCardie, Claire T.
dc.contributor.committeeMemberKleinberg, Robert David
dc.contributor.committeeMemberMimno, David
dcterms.licensehttps://hdl.handle.net/1813/59810
dc.identifier.doihttps://doi.org/10.7298/ad50-5p36


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Statistics