Mining Knowledge From Text Using Information Extraction
Mining Knowledge From Text Using Information Extraction
[9] E. Brill. Transformation-based error-driven learning [21] M. J. Collins. Three generative, lexicalised models for
and natural language processing: A case study statistical parsing. In Proceedings of the 35th Annual
in part-of-speech tagging. Computational Linguistics, Meeting of the Association for Computational Linguis-
21(4):543–565, 1995. tics (ACL-97), pages 16–23, 1997.
[23] M. Craven and J. Kumlien. Constructing biological [36] T. Hasegawa, S. Sekine, and R. Grishman. Discovering
knowledge bases by extracting information from text relations among entities from large corpora. In Proceed-
sources. In Proceedings of the 7th International Con- ings of the 42nd Annual Meeting of the Association for
ference on Intelligent Systems for Molecular Biology Computational Linguistics (ACL-04), pages 416–423,
(ISMB-1999), pages 77–86, Heidelberg, Germany, 1999. Barcelona, Spain, July 2004.
[24] A. Culotta and J. Sorensen. Dependency tree kernels for
[37] N. Kushmerick, D. S. Weld, and R. B. Doorenbos.
relation extraction. In Proceedings of the 42nd Annual
Wrapper induction for information extraction. In Pro-
Meeting of the Association for Computational Linguis-
ceedings of the Fifteenth International Joint Conference
tics (ACL-04), Barcelona, Spain, July 2004.
on Artificial Intelligence (IJCAI-97), pages 729–735,
[25] DARPA, editor. Proceedings of the Seventh Message Nagoya, Japan, 1997.
Understanding Evaluation and Conference (MUC-98),
Fairfax, VA, Apr. 1998. Morgan Kaufmann. [38] J. Lafferty, A. McCallum, and F. Pereira. Conditional
random fields: Probabilistic models for segmenting and
[26] P. Domingos. Unifying instance-based and rule-based labeling sequence data. In Proceedings of 18th Interna-
induction. Machine Learning, 24:141–168, 1996. tional Conference on Machine Learning (ICML-2001),
pages 282–289, Williamstown, MA, 2001.
[27] R. B. Doorenbos, O. Etzioni, and D. S. Weld. A scalable
comparison-shopping agent for the World-Wide Web. [39] E. Marcotte, I. Xenarios, and D. Eisenberg. Mining lit-
In Proceedings of the First International Conference on erature for protein-protein interactions. Bioinformatics,
Autonomous Agents (Agents-97), pages 39–48, Marina Apr;17(4):359–363, 2001.
del Rey, CA, Feb. 1997.
[40] J. Mayfield, P. McNamee, and C. Piatko. Named entity
[28] C. D. Fellbaum. WordNet: An Electronic Lexical Data-
recognition using hundreds of thousands of features. In
base. MIT Press, Cambridge, MA, 1998.
Proceedings of the Seventh Conference on Natural Lan-
[29] D. Freitag. Toward general-purpose learning for infor- guage Learning (CoNLL-2003), Edmonton, Canada,
mation extraction. In Proceedings of the 36th Annual 2003.
Meeting of the Association for Computational Linguis-
tics and COLING-98 (ACL/COLING-98), pages 404– [41] A. McCallum and D. Jensen. A note on the unifica-
408, Montreal, Quebec, 1998. tion of information extraction and data mining using
conditional-probability, relational models. In Proceed-
[30] D. Freitag and N. Kushmerick. Boosted wrapper induc- ings of the IJCAI-2003 Workshop on Learning Statis-
tion. In Proceedings of the Seventeenth National Con- tical Models from Relational Data, Acapulco, Mexico,
ference on Artificial Intelligence (AAAI-2000), pages Aug. 2003.
577–583, Austin, TX, July 2000. AAAI Press / The
MIT Press. [42] A. McCallum, S. Tejada, and D. Quass, editors. Pro-
ceedings of the KDD-03 Workshop on Data Cleaning,
[31] D. Freitag and A. McCallum. Information extraction Record Linkage, and Object Consolidation, Washington,
with HMM structures learned by stochastic optimiza- DC, Aug. 2003.
tion. In Proceedings of the Seventeenth National Con-
ference on Artificial Intelligence (AAAI-2000), Austin, [43] F. D. Meulder and W. Daelemans. Memory-based
TX, 2000. AAAI Press / The MIT Press. named entity recognition using unannotated data. In
Proceedings of the Seventh Conference on Natural Lan-
[32] C. Friedman, P. Kra, H. Yu, M. Krauthammer, and
guage Learning (CoNLL-2003), Edmonton, Canada,
A. Rzhetsky. GENIES: A natural-language processing
2003.
system for the extraction of molecular pathways from
journal articles. Bioinformatics, 17:S74–S82, 2001. Sup-
[44] R. J. Mooney and L. Roy. Content-based book recom-
plement 1.
mending using learning for text categorization. In Pro-
[33] K. Fukuda, T. Tsunoda, A. Tamura, and T. Takagi. ceedings of the Fifth ACM Conference on Digital Li-
Information extraction: Identifying protein names from braries, pages 195–204, San Antonio, TX, June 2000.
biological papers. In Proceedings of the 3rd Pacific Sym-
posium on Biocomputing, pages 707–718, 1998. [45] S. H. Muggleton, editor. Inductive Logic Programming.
Academic Press, New York, NY, 1992.
[34] R. Ghani, R. Jones, D. Mladenić, K. Nigam, and
S. Slattery. Data mining on symbolic knowledge ex- [46] U. Y. Nahm. Text Mining with Information Extraction.
tracted from the Web. In D. Mladenić, editor, Proceed- PhD thesis, Department of Computer Sciences, Univer-
ings of the Sixth International Conference on Knowl- sity of Texas, Austin, TX, Aug. 2004.
[55] C. Perez-Iratxeta, P. Bork, and M. A. Andrade. As- [70] C. A. Thompson, M. E. Califf, and R. J. Mooney. Ac-
sociation of genes to genetically inherited diseases us- tive learning for natural language parsing and informa-
ing data mining. Nature Genetics, 31(3):316–319, July tion extraction. In Proceedings of the Sixteenth Inter-
2002. national Conference on Machine Learning (ICML-99),
pages 406–414, Bled, Slovenia, June 1999.
[56] J. R. Quinlan. C4.5: Programs for Machine Learning. [71] A. J. Viterbi. Error bounds for convolutional codes
Morgan Kaufmann, San Mateo,CA, 1993. and and asymptotically optimum decoding algorithm.
IEEE Transactions on Information Theory, 13(2):260–
[57] L. R. Rabiner. A tutorial on hidden Markov models and
269, 1967.
selected applications in speech recognition. Proceedings
of the IEEE, 77(2):257–286, 1989. [72] L. Wall, T. Christiansen, and R. L. Schwartz. Program-
ming Perl. O’Reilly and Associates, Sebastopol, CA,
[58] A. K. Ramani, R. C. Bunescu, R. J. Mooney, and 1996.
E. M. Marcotte. Consolidating the set of know human
protein-protein interactions in preparation for large- [73] D. Zelenko, C. Aone, and A. Richardella. Kernel meth-
scale mapping of the human interactome. Genome Bi- ods for relation extraction. Journal of Machine Learn-
ology, 6(5):r40, 2005. ing Research, 3:1083–1106, 2003.