id author title date pages extension mime words sentences flesch summary cache txt work_yrmvuqtjgjbyxdvawwajt7csp4 Alan Ritter Modeling Missing Data in Distant Supervision for Information Extraction 2013 12 .pdf application/pdf 6587 905 68 Modeling Missing Data in Distant Supervision for Information Extraction Distant supervision algorithms learn information extraction models given only large readily available databases and text collections. mentions a pair of entities (e1 and e2) that participate in a relation, r, is likely to express the proposition r(e1,e2), so we can treat it as a positive training model obtains a 27% increase in area under the precision recall curve on the sentence-level relation extraction task. entity pair, whereas we jointly model relation extraction and missing data in the text and KB. The model also makes the converse assumption: if Freebase contains the relation BIRTHLOCATION(Barack Obama, Honolulu), then we must extract it from at least one sentence. To learn the parameters of the sentence-level relation mention classifier, θ, we maximize the likelihood of the facts observed in Freebase conditioned missing data model corresponds to choosing the values of αMIT and αMID dynamically based on the entities and relations involved. ./cache/work_yrmvuqtjgjbyxdvawwajt7csp4.pdf ./txt/work_yrmvuqtjgjbyxdvawwajt7csp4.txt