Linguistics 715
Seminar: Data manipulation for parser improvement
Autumn 2011
Course goals This seminar will focus on working with the input to and output of syntactic parsers, without necessarily dealing with internal models. That is, the focus is on the linguistic representations of data given to & read from a parser. The specific topics will thus include: various linguistic modifications on the input side (adding (grant)parent annotation, horizontal markovization, projectivization/deprojectivization), parse error detection, parse revision, parser combination, optimal representations of coordination & other phenomena, interactions with semantic role labelers, and so forth. Part of that will depend upon student interest, though there will likely be a greater emphasis placed on working with parser output.
The focus on looking at the linguistic structures of parser input/output means that: a) the types of techniques we will look at need to be very general, i.e., not parser-specific, and b) techniques will have recourse to full parses.
We will examine both constituency & dependency parsing, as many techniques can be applied to either, with a slightly greater emphasis on dependency. Although the emphasis will be on investigating a range of literature on some at times disparate topics, students will be required to develop a hands-on project by the end of the semester.
Office: Memorial Hall (MM) 317
E-mail: md7@pancakeindiana.edu (remove the food)
M | 11:30am-12:30pm |
R | 11:00am-12:00pm |
or by appointment | |
Classroom: Memorial Hall (MH) 401
Course website: http://jones.ling.indiana.edu/~mdickinson/11/715/
Course notes will be posted to this website.
Course prerequisites: L545 and some programming experience are required. L645 or concurrent enrollment is recommended but not strictly required.
Readings: There will be weekly readings for discussion, most of which are available online. See the schedule and bibliography at the end for the full selection and see the course requirements below.
Participation | 15% | |
Discussion leading | 25% | |
Assignments | 20% | |
Final project | 40% | |
Academic Misconduct: Academic misconduct is not allowed in this course. The Indiana University Code of Student Rights, Responsibilities, and Conduct (http://dsa.indiana.edu/Code/) defines academic misconduct as “any activity that tends to undermine the academic integrity of the institution . . . Academic misconduct may involve human, hard-copy, or electronic resources . . . Academic misconduct includes, but is not limited to . . . cheating, fabrication, plagiarism, interference, violation of course rules, and facilitating academic misconduct” (II. G.1-6).
Students with Disabilities: Students who need an accommodation based on the impact of a disability should contact me to arrange an appointment as soon as possible to discuss the course format, to anticipate needs, and to explore potential accommodations.
I rely on Disability Services for Students for assistance in verifying the need for accommodations and developing accommodation strategies. Students who have not previously contacted Disability Services are encouraged to do so (812-855-7578; http://www.indiana.edu/~iubdss/).
(Tentative) Outline of Schedule: This will change a lot, based on what topics we find to be of most interest ...
Aug. 29, 31 | Markus | |
Sep. 7, 12, 14 | No class, I’m gone | |
Sep. 19, 21, 26 | Annotation conversions (1: .pdf, 2x3.pdf, 2: .pdf, 2x3.pdf, 3: .pdf, 2x3.pdf, 4: .pdf, 2x3.pdf) | Markus |
Sep. 28, Oct. 3 | Annotation effects | Sandra |
Oct. 5, 10, 12 | Treebank re-labeling | Chris |
Oct. 17, 19 | Discontinuities & Rules | Daniel |
Oct. 24, 26, 31 | Parse reranking | Joe |
Nov. 2, 7 | Parser bias | Yuyin |
Nov. 9, 14 | Parse combination | Shahab |
Nov. 16, 21 | Parse revision | Amber |
Nov. 28, 30 | Parse error detection/reliability | Amber |
Dec. 5, 7 | Deeper annotation / Semantics | Eric/Wren |
Wed., Dec. 14, 2:45-4:45pm | Project reports | All |
What to do while I’m gone (Sep. 7-14):
Assignments:
LaTeX tutorial: tutorial.tgz
Disclaimer
This syllabus is subject to change. All important changes will be made in writing, with ample time
for adjustment.
Takeshi Abekawa and Manabu Okumura. 2006. Japanese dependency parsing using
co-occurrence information and a combination of case elements. In Proceedings of the 21st
International Conference on Computational Linguistics and 44th Annual Meeting of the
Association for Computational Linguistics, pages 833–840. Association for Computational
Linguistics, Sydney, Australia. URL http://www.aclweb.org/anthology/P06-1105.
Eneko Agirre, Kepa Bengoetxea, Koldo Gojenola, and Joakim Nivre. 2011. Improving
dependency parsing with semantic classes. In Proceedings of the 49th Annual Meeting
of the Association for Computational Linguistics: Human Language Technologies, pages
699–703. Association for Computational Linguistics, Portland, Oregon, USA. URL
http://www.aclweb.org/anthology/P11-2123.
Bharat Ram Ambati. 2010. Importance of linguistic constraints in statistical dependency
parsing. In Proceedings of the ACL 2010 Student Research Workshop, pages 103–108.
Uppsala, Sweden. URL http://www.aclweb.org/anthology/P10-3018.
Giuseppe Attardi and Massimiliano Ciaramita. 2007. Tree revision learning for dependency
parsing. In Human Language Technologies 2007: The Conference of the North American
Chapter of the Association for Computational Linguistics; Proceedings of the Main
Conference, pages 388–395. Association for Computational Linguistics, Rochester, New
York. URL http://www.aclweb.org/anthology/N/N07/N07-1049.pdf.
Giuseppe Attardi and Felice Dell’Orletta. 2009. Reverse revision and linear
tree combination for dependency parsing. In Proceedings of Human Language
Technologies: The 2009 Annual Conference of the North American Chapter of
the Association for Computational Linguistics, Companion Volume: Short Papers,
pages 261–264. Association for Computational Linguistics, Boulder, Colorado. URL
http://www.aclweb.org/anthology/N/N09/N09-2066.pdf.
Timothy Baldwin, Mark Dras, Julia Hockenmaier, Tracy Holloway King, and Gertjan
van Noord. 2007. The impact of deep linguistic processing on parsing technology.
In Proceedings of the Tenth International Conference on Parsing Technologies, pages
36–38. Association for Computational Linguistics, Prague, Czech Republic. URL
http://www.aclweb.org/anthology/W/W07/W07-2205.pdf.
Anja Belz. 2001. Optimisation of corpus-derived probabilistic grammars.
In Proceedings of Corpus Linguistics 2001, pages 46–57. URL
http://www.itri.brighton.ac.uk/~Anja.Belz/Publications/optimisation-of-corpus-derived-grammars-cl2001.ps.gz.
Kepa Bengoetxea and Koldo Gojenola. 2009a. Application of feature propagation to
dependency parsing. In Proceedings of the 11th International Conference on Parsing
Technologies (IWPT’09), pages 142–145. Association for Computational Linguistics, Paris,
France. URL http://www.aclweb.org/anthology/W09-3822.
Kepa Bengoetxea and Koldo Gojenola. 2009b. Exploring
treebank transformations in dependency parsing. In Recent
Advances in Natural Language Processing (RANLP 2009). URL
https://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1248190513/publikoak/ranlp_kepa-koldo-final.pdf.
Kepa Bengoetxea and Koldo Gojenola. 2010. Application of different techniques
to dependency parsing of Basque. In Proceedings of the NAACL HLT 2010
First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages
31–39. Association for Computational Linguistics, Los Angeles, CA, USA. URL
http://www.aclweb.org/anthology/W10-1404.
Eckhard Bick. 2007. Hybrid ways to
improve domain independence in an ML dependency parser. In Proceedings of the CoNLL
Shared Task Session of EMNLP-CoNLL 2007, pages 1119–1123. Prague, Czech Republic.
URL http://www.aclweb.org/anthology/D/D07/D07-1120.pdf.
Johan Bos, Cristina Bosco, and Alessandro Mazzei. 2009. Converting a dependency
treebank to a categorial grammar treebank for Italian. In Proceedings of Proceedings of
the Eighth International Workshop on Treebanks and Linguistic Theories (TLT-8). Milan,
Italy. URL http://tlt8.unicatt.it/FullPaper/C_2.pdf.
Cristina Bosco, Simonetta Montemagni, Alessandro Mazzei, Vincenzo Lombardo,
Felice Dell’Orletta, Alessandro Lenci, Leonardo Lesmo, Giuseppe Attardi, Maria Simi,
Alberto Lavelli, Johan Hall, Jens Nilsson, and Joakim Nivre. 2010. Comparing
the influence of different treebank annotations on dependency parsing. In Nicoletta
Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani,
Jan Odijk, Stelios Piperidis, Mike Rosner, and Daniel Tapias, editors, Proceedings
of the Seventh conference on International Language Resources and Evaluation
(LREC’10). European Language Resources Association (ELRA), Valletta, Malta. URL
http://www.lrec-conf.org/proceedings/lrec2010/summaries/196.html.
Pierre Boullier and Benoît Sagot. 2005. Efficient and robust LFG parsing: SxLFG.
In Proceedings of the Ninth International Workshop on Parsing Technology, pages
1–10. Association for Computational Linguistics, Vancouver, British Columbia. URL
http://www.aclweb.org/anthology/W/W05/W05-1501.pdf.
Adriane Boyd. 2007. Discontinuity revisited: An improved conversion to context-free
representations. In Proceedings of the Linguistic Annotation Workshop, pages
41–44. Association for Computational Linguistics, Prague, Czech Republic. URL
http://www.aclweb.org/anthology/W/W07/W07-1506.pdf.
Eric Brill. 1993.
Automatic grammar induction and parsing free text: A transformation-based approach. In
Proceedings of Human Language Technology (HLT), pages 237–242. Plainsboro, NJ. URL
http://aclweb.org/anthology-new/H/H93/H93-1047.pdf.
Eric Brill and Barbora Hladká. 1998. Core natural language processing technology
applicable to multiple languages. the workshop 98 final report, chapter 5: The
superparser. Technical report, Johns Hopkins University, Baltimore, MD. URL
http://www.clsp.jhu.edu/ws98/projects/nlp/report/chapter5/9805.html.
Francis Brunet-Manquant. 2004. Syntactic parser combination for improved dependency
analysis. In Vincenzo Pallotta and Amalia Todirascu, editors, COLING 2004 3rd Workshop
on Robust Methods in Analysis of Natural Language Data, pages 24–31. COLING, Geneva,
Switzerland. URL http://aclweb.org/anthology-new/W/W04/W04-2004.pdf.
Aoife Cahill, Michael Burke, Ruth O’Donovan, Josef Van Genabith, and Andy Way.
2004. Long-distance dependency resolution in automatically acquired wide-coverage
pcfg-based lfg approximations. In Proceedings of the 42nd Meeting of the Association for
Computational Linguistics (ACL’04), Main Volume, pages 319–326. Barcelona, Spain. URL
http://www.aclweb.org/anthology/P04-1041.
David Campbell and Stephen Johnson. 2002. A transformational-based learner for
dependency grammars in discharge summaries. In Proceedings of the ACL-02 Workshop on
Natural Language Processing in the Biomedical Domain, pages 37–44. Phildadelphia. URL
http://www.aclweb.org/anthology/W02-0306.
Eugene Charniak and Mark Johnson. 2005. Coarse-to-fine n-best parsing and maxent
discriminative reranking. In Proceedings of ACL-05, pages 173–180. Ann Arbor, MI, USA.
URL http://aclweb.org/anthology-new/P/P05/P05-1022.pdf.
Wanxiang Che, Zhenghua Li, Yuxuan Hu, Yongqiang Li, Bing Qin, Ting Liu, and
Sheng Li. 2008. A cascaded syntactic and semantic dependency parsing system. In
CoNLL 2008: Proceedings of the Twelfth Conference on Computational Natural Language
Learning, pages 238–242. Coling 2008 Organizing Committee, Manchester, England. URL
http://www.aclweb.org/anthology/W08-2134.
Wenliang Chen,
Jun’ichi Kazama, Kiyotaka Uchimoto, and Kentaro Torisawa. 2009. Improving dependency
parsing with subtrees from auto-parsed data. In Proceedings of EMNLP-09, pages 570–579.
Singapore. URL http://www.aclweb.org/anthology/D/D09/D09-1060.pdf.
Wenliang Chen, Youzheng Wu, and Hitoshi Isahara. 2008. Learning reliable information
for dependency parsing adaptation. In Proceedings of the 22nd International Conference
on Computational Linguistics (Coling 2008), pages 113–120. Coling 2008 Organizing
Committee, Manchester, UK. URL http://www.aclweb.org/anthology/C08-1015.
Jinho D. Choi and Martha Palmer. 2010. Robust constituent-to-dependency
conversion for english. In Proceedings of the Ninth International Workshop on
Treebanks and Linguistic Theories (TLT-9), pages 55–66. Tartu, Estonia. URL
http://dspace.utlib.ee/dspace/bitstream/handle/10062/15934/tlt9_submission_3.pdf?sequence=1.
Grzegorz Chrupała and Josef van Genabith. 2006. Improving treebank-based
automatic lfg induction for spanish. In Proceedings of the LFG06 Conference. URL
http://www.lsv.uni-saarland.de/personalPages/gchrupala/papers/chrupala-vangenabith-2006b/paper.pdf.
Tagyoung Chung, Matt Post, and Daniel Gildea. 2010. Factors affecting the accuracy of
Korean parsing. In Proceedings of the NAACL HLT 2010 First Workshop on Statistical
Parsing of Morphologically-Rich Languages, pages 49–57. Association for Computational
Linguistics, Los Angeles, CA, USA. URL http://www.aclweb.org/anthology/W10-1406.
Massimiliano Ciaramita and Giuseppe
Attardi. 2007. Dependency parsing with second-order feature maps and annotated semantic
information. In Proceedings of the Tenth International Conference on Parsing Technologies,
pages 133–143. Association for Computational Linguistics, Prague, Czech Republic. URL
http://www.aclweb.org/anthology/W/W07/W07-2217.pdf.
Michael Collins and Terry Koo.
2005. Discriminative reranking for natural language parsing. Computational Linguistics,
31(1):25–69. URL http://aclweb.org/anthology-new/J/J05/J05-1003.pdf.
Brooke Cowan and Michael Collins. 2005. Morphology and reranking for the statistical
parsing of Spanish. In Proceedings of Human Language Technology Conference and
Conference on Empirical Methods in Natural Language Processing, pages 795–802.
Association for Computational Linguistics, Vancouver, British Columbia, Canada. URL
http://www.aclweb.org/anthology/H/H05/H05-1100.pdf.
Michael Daum, Kilian Foth, and Wolfgang Menzel. 2004. Automatic transformation
of phrase treebanks to dependency trees. In Proceedings of the 4th International
Conference on Language Resources and Evaluation, LREC-2004. Lisbon, Portugal. URL
http://nats-www.informatik.uni-hamburg.de/~menzel/papers/lrec2004.ps.gz.
Markus Dickinson. 2010. Detecting errors in automatically-parsed dependency relations.
In The 48th Annual Meeting of the Association for Computational Linguistics (ACL-10).
Uppsala, Sweden. URL
http://jones.ling.indiana.edu/~mdickinson/papers/dickinson10.html.
Pétr Dienes and Amit Dubey. 2003. Deep syntactic processing by combining shallow
methods. In Proceedings of the 41st Annual Meeting of the Association for Computational
Linguistics, pages 431–438. Association for Computational Linguistics, Sapporo, Japan.
URL http://www.aclweb.org/anthology/P03-1055.
Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, João Graca,
and Fernando Pereira. 2007. Frustratingly hard domain adaptation for dependency
parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007,
pages 1051–1055. Association for Computational Linguistics, Prague, Czech Republic. URL
http://www.aclweb.org/anthology/D/D07/D07-1112.pdf.
Mark Fishel and Joakim Nivre. 2009. Voting and
stacking in data-driven dependency parsing. In Proceedings of the 17th Nordic Conference
on Computational Linguistics NODALIDA’2009, pages 219–222. Odense, Denmark. URL
http://lepo.it.da.ut.ee/~fishel/doc/publ/vote-stack.nodalida09.pdf.
Kilian Foth, Ingo Schröder, and Wolfgang Menzel. 2000. A transformation-based parsing
technique with anytime properties.
In Proceedings of the 4th International Workshop on Parsing Technologies, pages 89–100.
URL http://nats-www.informatik.uni-hamburg.de/~menzel/papers/iwpt00.ps.gz.
Kilian A. Foth and Wolfgang Menzel. 2006. Hybrid parsing: Using probabilistic models
as predictors for a symbolic parser. In Proceedings of the 21st International Conference on
Computational Linguistics and 44th Annual Meeting of the Association for Computational
Linguistics, pages 321–328. Association for Computational Linguistics, Sydney, Australia.
URL http://www.aclweb.org/anthology/P06-1041.
Anette Frank and Jiri Semecky. 2004. Corpus-based Induction of an LFG
Syntax-Semantics Interface for Frame Semantic Processing. In Silvia Hansen-Schirra,
Stefan Oepen, and Hans Uszkoreit, editors, Proceedings of the 5th International
Workshop on Linguistically Interpreted Corpora. Geneva, Switzerland. URL
http://www.cl.uni-heidelberg.de/~frank/papers/linc04-frank-semecky-new.pdf.
Andrea Gesmundo, James Henderson, Paola Merlo, and Ivan Titov. 2009. A latent variable
model of synchronous syntactic-semantic parsing for multiple languages. In Proceedings of
the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009):
Shared Task, pages 37–42. Association for Computational Linguistics, Boulder, Colorado.
URL http://www.aclweb.org/anthology/W09-1205.
Daniel Gildea and Julia Hockenmaier. 2003. Identifying semantic roles using combinatory
categorial grammar. In Michael Collins and Mark Steedman, editors, Proceedings of the
2003 Conference on Empirical Methods in Natural Language Processing, pages 57–64. URL
http://www.aclweb.org/anthology/W03-1008.pdf.
Yoav Goldberg and Michael Elhadad. 2009. Hebrew dependency parsing: Initial
results. In Proceedings of the 11th International Conference on Parsing Technologies
(IWPT’09), pages 129–133. Association for Computational Linguistics, Paris, France. URL
http://www.aclweb.org/anthology/W09-3819.
Yoav Goldberg and Michael Elhadad. 2010. Inspecting the structural biases of
dependency parsing algorithms. In Proceedings of the Fourteenth Conference on
Computational Natural Language Learning, pages 234–242. Uppsala, Sweden. URL
http://www.aclweb.org/anthology/W10-2927.
Gholamreza Haffari, Marzieh Razavi, and Anoop Sarkar. 2011. An ensemble model that
combines syntactic and semantic clustering for discriminative dependency parsing. In
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:
Human Language Technologies, pages 710–714. Association for Computational Linguistics,
Portland, Oregon, USA. URL http://www.aclweb.org/anthology/P11-2125.
Keith Hall and Václav Novák. 2005. Corrective modeling for non-projective dependency
parsing. In Proceedings of the Ninth International Workshop on Parsing Technology, pages
42–52. Association for Computational Linguistics, Vancouver, British Columbia. URL
http://www.aclweb.org/anthology/W/W05/W05-1505.pdf.
Tadayoshi Hara, Yusuke Miyao, and Jun’ichi Tsujii. 2009a. Descriptive and
empirical approaches to capturing underlying dependencies among parsing errors.
In Proceedings of the 2009 Conference on Empirical Methods in Natural Language
Processing, pages 1162–1171. Association for Computational Linguistics, Singapore. URL
http://www.aclweb.org/anthology/D/D09/D09-1121.pdf.
Tadayoshi Hara, Yusuke Miyao, and Jun’ichi Tsujii. 2009b. Effective analysis of causes and
inter-dependencies of parsing errors. In Proceedings of the 11th International Conference
on Parsing Technologies (IWPT’09), pages 180–191. Association for Computational
Linguistics, Paris, France. URL http://www.aclweb.org/anthology/W09-3828.
John C. Henderson and Eric Brill. 1999. Exploiting diversity in natural language
processing: Combining parsers. In Proceedings of the Conference on Empirical Methods in
Natural Language Processing and Very Large Corpora, pages 187–194. College Park, MD.
URL http://aclweb.org/anthology-new/W/W99/W99-0623.pdf.
Julia Hockenmaier. 2006. Creating a CCGbank and a wide-coverage CCG lexicon
for German. In Proceedings of the 21st International Conference on Computational
Linguistics and 44th Annual Meeting of the Association for Computational Linguistics,
pages 505–512. Association for Computational Linguistics, Sydney, Australia. URL
http://www.aclweb.org/anthology/P06-1064.
Julia Hockenmaier and Mark Steedman. 2002. Acquiring compact lexicalized
grammars from a cleaner treebank. In Proceedings of Third International
Conference on Language Resources and Evaluation. Las Palmas. URL
http://nlp.cs.illinois.edu/HockenmaierGroup/Papers/LREC2002/HockenmaierSteedmanLREC2002.ps.
Yu-Yin Hsu. 2010. Comparing conversions of discontinuity in pcfg
parsing. In Proceedings of the Ninth International Workshop on Treebanks
and Linguistic Theories (TLT-9), pages 103–113. Tartu, Estonia. URL
http://dspace.utlib.ee/dspace/bitstream/handle/10062/15954/tlt9_submission_32.pdf?sequence=1.
Liang Huang. 2008. Forest reranking: Discriminative parsing with non-local features. In
Proceedings of ACL-08: HLT, pages 586–594. Association for Computational Linguistics,
Columbus, Ohio. URL http://www.aclweb.org/anthology/P/P08/P08-1067.pdf.
Takashi Inui and Kentaro Inui. 2000. Committee-based decision making in probabiiistic
partial parsing. In Proceedings of COLING-00, pages 348–354. Saarbrücken, Germany. URL
http://aclweb.org/anthology-new/C/C00/C00-1051.pdf.
Wenbin Jiang and Qun Liu. 2009. Automatic adaptation of annotation standards
for dependency parsing — using projected treebank as source corpus. In
Proceedings of the 11th International Conference on Parsing Technologies (IWPT’09),
pages 25–28. Association for Computational Linguistics, Paris, France. URL
http://www.aclweb.org/anthology/W09-3803.
Richard Johansson and Pierre Nugues. 2007. Extended constituent-to-dependency
conversion for english. In Proceedings of NODALIDA 2007. Tartu, Estonia. URL
http://fileadmin.cs.lth.se/cs/Personal/Pierre_Nugues/Articles/nodalida2007/reg-Johansson-10.pdf.
Richard Johansson and Pierre Nugues. 2008. The effect of syntactic representation
on semantic role labeling. In Proceedings of the 22nd International Conference
on Computational Linguistics (Coling 2008), pages 393–400. Coling 2008 Organizing
Committee, Manchester, UK. URL http://www.aclweb.org/anthology/C08-1050.
Mark Johnson. 1998. Pcfg models of linguistic tree representations. Computational
Linguistics, 24(4):613–632. URL http://aclweb.org/anthology-new/J/J98/J98-4004.pdf.
Daisuke Kawahara and Kiyotaka Uchimoto. 2008. Learning reliability of parses for
domain adaptation of dependency parsing. In Proceedings of the Third International Joint
Conference on Natural Language Processing (IJCNLP-08), pages 709–714. Hyderabad,
India. URL http://aclweb.org/anthology/I/I08/I08-2097.pdf.
Lidia Khmylko, Kilian A. Foth, and Wolfgang Menzel. 2009. Co-parsing with competitive
models. In Proceedings of the 11th International Conference on Parsing Technologies
(IWPT’09), pages 99–107. Association for Computational Linguistics, Paris, France. URL
http://www.aclweb.org/anthology/W09-3816.
Dan Klein and Christopher D. Manning. 2003. Accurate unlexicalized parsing. In
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics,
pages 423–430. Association for Computational Linguistics, Sapporo, Japan. URL
http://www.aclweb.org/anthology/P03-1054.
Terry Koo and Michael Collins. 2005. Hidden-variable models for discriminative
reranking. In Proceedings of Human Language Technology Conference and Conference
on Empirical Methods in Natural Language Processing, pages 507–514. Association
for Computational Linguistics, Vancouver, British Columbia, Canada. URL
http://www.aclweb.org/anthology/H/H05/H05-1064.pdf.
Sandra Kübler, Erhard Hinrichs, Wolfgang Maier, and Eva Klett. 2009. Parsing
coordinations. In Proceedings of the 12th Conference of the European Chapter of the ACL
(EACL 2009), pages 406–414. Association for Computational Linguistics, Athens, Greece.
URL http://www.aclweb.org/anthology/E09-1047.
Sandra Kübler, Erhard W. Hinrichs, and Wolfgang Maier. 2006. Is it really that difficult
to parse German? In Proceedings of the 2006 Conference on Empirical Methods in Natural
Language Processing, pages 111–119. Association for Computational Linguistics, Sydney,
Australia. URL http://www.aclweb.org/anthology/W/W06/W06-1614.pdf.
Sandra Kübler, Wolfgang Maier, Ines Rehbein, and Yannick Versley. 2008. How
to compare treebanks. In Proceedings of the Sixth International Conference on
Language Resources and Evaluation (LREC 2008). Marrakesh, Morocco. URL
http://jones.ling.indiana.edu/~skuebler/papers/german_parsing.pdf.
Sandra Kübler and Heike Telljohann. 2002. Towards a dependency-based evaluation
for partial parsing. In Beyond PARSEVAL – Towards Improved Evaluation Measures
for Parsing Systems (LREC 2002 Workshop). Las Palmas, Gran Canaria. URL
http://jones.ling.indiana.edu/~skuebler/papers/lrec_ws02.ps.
Roger Levy and Christopher Manning. 2004. Deep dependencies from context-free
statistical parsers: Correcting the surface dependency approximation. In Proceedings of the
42nd Meeting of the Association for Computational Linguistics (ACL’04), Main Volume,
pages 327–334. Barcelona, Spain. URL http://www.aclweb.org/anthology/P04-1042.
Dekang Lin. 1995. A dependency-based method
for evaluating broad-coverage parsers. In Proceedings of IJCAI-95, pages 1420–1425. URL
http://webdocs.cs.ualberta.ca/~lindek/papers/ijcai95.ps.
Wolfgang Maier. 2006. Annotation schemes and their influence on parsing
results. In Proceedings of the COLING/ACL 2006 Student Research Workshop,
pages 19–24. Association for Computational Linguistics, Sydney, Australia. URL
http://www.aclweb.org/anthology/P/P06/P06-3004.pdf.
Takuya Matsuzaki, Yusuke Miyao, and Junichi Tsujii. 2005. Probabilistic CFG with
latent annotations. In Proceedings of ACL-05, pages 75–82. Ann Arbor, MI, USA. URL
http://www.aclweb.org/anthology-new/P/P05/P05-1010.pdf.
Ryan McDonald and Joakim Nivre. 2007. Characterizing the errors of data-driven
dependency parsing models. In Proceedings of the 2007 Joint Conference on Empirical
Methods in Natural Language Processing and Computational Natural Language Learning
(EMNLP-CoNLL), pages 122–131. Association for Computational Linguistics, Prague,
Czech Republic. URL http://www.aclweb.org/anthology/D/D07/D07-1013.pdf.
Ryan McDonald and Joakim Nivre. 2011.
Analyzing and integrating dependency parsers. Computational Linguistics, 37(1):197–230.
URL http://aclweb.org/anthology-new/J/J11/J11-1007.pdf.
Yusuke Miyao, Takashi Ninomiya, and Jun’ichi Tsujii. 2004. Corpus-oriented grammar
development for acquiring a head-driven phrase structure grammar from the penn treebank.
In Natural Language Processing - IJCNLP 2004, pages 684–693. Hainan Island, China. URL
http://www-tsujii.is.s.u-tokyo.ac.jp/7Eyusuke/paper/ijcnlp04.pdf.
Jens Nilsson, Joakim Nivre, and Johan Hall. 2007. Generalizing tree transformations for
inductive dependency parsing. In Proceedings of the 45th Annual Meeting of the Association
of Computational Linguistics, pages 968–975. Association for Computational Linguistics,
Prague, Czech Republic. URL http://www.aclweb.org/anthology/P07-1122.
Zheng-Yu Niu, Haifeng Wang, and Hua Wu. 2009. Exploiting heterogeneous treebanks
for parsing. In Proceedings of the Joint Conference of the 47th Annual Meeting of the
ACL and the 4th International Joint Conference on Natural Language Processing of the
AFNLP, pages 46–54. Association for Computational Linguistics, Suntec, Singapore. URL
http://www.aclweb.org/anthology/P/P09/P09-1006.pdf.
Joakim Nivre.
2009. Non-Projective Dependency Parsing in Expected Linear Time.
In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP,
pages 351–359. Association for Computational Linguistics, Suntec, Singapore.
URL http://www.aclweb.org/anthology/P/P09/P09-1040.pdf.
Joakim Nivre, Johan Hall, Jens Nilsson, Gülşen Eryiǧit, and Svetoslav Marinov.
2006. Labeled pseudo-projective dependency parsing with support vector machines.
In Proceedings of the Tenth Conference on Computational Natural Language Learning
(CoNLL-X), pages 221–225. Association for Computational Linguistics, New York City.
URL http://www.aclweb.org/anthology/W/W06/W06-2933.pdf.
Joakim Nivre and Ryan McDonald.
2008. Integrating graph-based and transition-based dependency parsers. In Proceedings of
ACL-08: HLT, pages 950–958. Association for Computational Linguistics, Columbus, Ohio.
URL http://www.aclweb.org/anthology/P/P08/P08-1108.pdf.
Joakim Nivre and Jens Nilsson. 2005. Pseudo-projective
dependency parsing. In Proceedings of ACL-05, pages 99–106. Ann Arbor, MI, USA. URL
http://aclweb.org/anthology-new/P/P05/P05-1013.pdf.
Joakim Nivre, Laura Rimell, Ryan McDonald, and Carlos Gómez Rodríguez.
2010. Evaluation of dependency parsers on unbounded dependencies. In
Proceedings of the 23rd International Conference on Computational Linguistics (Coling
2010), pages 833–841. Coling 2010 Organizing Committee, Beijing, China. URL
http://www.aclweb.org/anthology/C10-1094.
Ruth O’Donovan, Michael Burke, Aoife Cahill, Josef Van Genabith, and Andy
Way. 2005. Large-scale induction and evaluation of lexical resources from the
Penn-II and Penn-III Treebanks. Computational Linguistics, 31(3):330–365. URL
http://aclweb.org/anthology-new/J/J05/J05-3003.pdf.
Lilja Øvrelid, Jonas Kuhn, and Kathrin Spreyer. 2009. Improving data-driven dependency
parsing using large-scale lfg grammars. In Proceedings of the ACL-IJCNLP 2009 Conference
Short Papers, pages 37–40. Association for Computational Linguistics, Suntec, Singapore.
URL http://www.aclweb.org/anthology/P/P09/P09-2010.pdf.
Lilja Øvrelid and Joakim Nivre. 2007. When word order and part-of-speech tags are not
enough – Swedish dependency parsing with rich linguistic features. In Proceedings of the
International Conference on Recent Advances in Natural Language Processing (RANLP),
pages 447–451. URL http://heim.ifi.uio.no/liljao/articles/ranlp07.pdf.
Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate,
compact, and interpretable tree annotation. In Proceedings of the 21st International
Conference on Computational Linguistics and 44th Annual Meeting of the Association
for Computational Linguistics, pages 433–440. Association for Computational Linguistics,
Sydney, Australia. URL http://www.aclweb.org/anthology/P06-1055.
Vasin Punyakanok, Dan Roth, and Wen tau Yih. 2008. The importance of syntactic
parsing and inference in semantic role labeling. Computational Linguistics, 34(2):257–287.
URL http://aclweb.org/anthology-new/J/J08/J08-2005.pdf, special Issue on Semantic
Role Labeling.
Owen Rambow. 2010. The simple truth about dependency and phrase structure
representations: An opinion piece. In Human Language Technologies: The 2010 Annual
Conference of the North American Chapter of the Association for Computational
Linguistics, pages 337–340. Association for Computational Linguistics, Los Angeles,
California. URL http://www.aclweb.org/anthology/N10-1049.
Ines Rehbein and Josef van Genabith. 2007a. Treebank annotation schemes and parser
evaluation for German. In Proceedings of the 2007 Joint Conference on Empirical
Methods in Natural Language Processing and Computational Natural Language Learning
(EMNLP-CoNLL), pages 630–639. Association for Computational Linguistics, Prague,
Czech Republic. URL http://www.aclweb.org/anthology/D/D07/D07-1066.pdf.
Ines Rehbein and Josef van Genabith. 2007b. Why is it so difficult to compare treebanks?
TIGER and TüBa-D/Z revisited. In Proceedings of TLT-07. Bergen, Norway. URL
http://tlt07.uib.no/papers/17.pdf.
Roi Reichart and Ari Rappoport. 2007. An ensemble method for selection of high quality
parses. In Proceedings of the 45th Annual Meeting of the Association of Computational
Linguistics, pages 408–415. Association for Computational Linguistics, Prague, Czech
Republic. URL http://www.aclweb.org/anthology/P07-1052.
Han Ren, Donghong Ji, Jing Wan, and Mingyao Zhang. 2009. Parsing syntactic and
semantic dependencies for multiple languages with a pipeline approach. In Proceedings of
the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009):
Shared Task, pages 97–102. Association for Computational Linguistics, Boulder, Colorado.
URL http://www.aclweb.org/anthology/W09-1215.
Kenji Sagae and
Alon Lavie. 2006. Parser combination by reparsing. In Proceedings of the Human Language
Technology Conference of the NAACL, Companion Volume: Short Papers, pages 129–132.
New York City, USA. URL http://www.aclweb.org/anthology/N/N06/N06-2033.pdf.
Kenji Sagae and Jun’ichi Tsujii. 2007. Dependency parsing and domain adaptation
with LR models and parser ensembles. In Proceedings of the CoNLL Shared Task
Session of EMNLP-CoNLL 2007, pages 1044–1050. Prague, Czech Republic. URL
http://www.aclweb.org/anthology/D/D07/D07-1111.pdf.
Federico Sangati, Willem Zuidema, and Rens Bod. 2009. A generative re-ranking model
for dependency parsing. In Proceedings of the 11th International Conference on Parsing
Technologies (IWPT’09), pages 238–241. Association for Computational Linguistics, Paris,
France. URL http://www.aclweb.org/anthology/W09-3839.
Girogion Satta and Eric Brill. 1996. Efficient transformation-based parsing. In Proceedings
of the 34th Annual Meeting of the Association for Computational Linguistics, pages
255–262. Association for Computational Linguistics, Santa Cruz, California, USA. URL
http://www.aclweb.org/anthology/P96-1034.
Michael Schiehlen. 2004. Annotation strategies for probabilistic parsing in German.
In Proceedings of Coling 2004, pages 390–396. COLING, Geneva, Switzerland. URL
http://aclweb.org/anthology-new/C/C04/C04-1056.pdf.
Natalie Schluter and Josef van Genabith. 2007. Preparing, restructuring, and augmenting
a french treebank: Lexicalised parsers or coherent treebanks? In Proceedings of the 10th
Conference of the Pacific Association for Computational Linguistics. Melbourne, Australia.
URL http://www.itu.dk/people/nael/Pacling2007Paper.pdf.
Natalie Schluter and Josef van Genabith. 2008. Automatic induction of probabilistic
lfg resources for french. In Proceedings of LREC08. Marrakesh, Morocco. URL
http://www.itu.dk/people/nael/Lrec2008Paper.pdf.
Djamé Seddah, Marie Candito, and Benoît Crabbé. 2009. Cross parser evaluation : a
French Treebanks study. In Proceedings of the 11th International Conference on Parsing
Technologies (IWPT’09), pages 150–161. Association for Computational Linguistics, Paris,
France. URL http://www.aclweb.org/anthology/W09-3824.
Wolfgang Seeker, Ines Rehbein, Jonas Kuhn, and Josef Van Genabith. 2010. Hard
constraints for grammatical function labelling. In Proceedings of the 48th Annual Meeting
of the Association for Computational Linguistics, pages 1087–1097. Uppsala, Sweden. URL
http://www.aclweb.org/anthology/P10-1111.
Libin Shen, Anoop Sarkar, and Aravind Joshi. 2003. Using LTAG based features in
parse reranking. In Michael Collins and Mark Steedman, editors, Proceedings of the 2003
Conference on Empirical Methods in Natural Language Processing, pages 89–96. URL
http://www.aclweb.org/anthology/W03-1012.pdf.
Xiaodong Shi. 2007. Nbest dependency parsing with linguistically rich models.
In Proceedings of the Tenth International Conference on Parsing Technologies, pages
80–82. Association for Computational Linguistics, Prague, Czech Republic. URL
http://www.aclweb.org/anthology/W/W07/W07-2210.pdf.
David A. Smith and Jason Eisner. 2009. Parser adaptation and projection with
quasi-synchronous grammar features. In Proceedings of the 2009 Conference on Empirical
Methods in Natural Language Processing, pages 822–831. Association for Computational
Linguistics, Singapore. URL http://www.aclweb.org/anthology/D/D09/D09-1086.pdf.
Mihai Surdeanu and Christopher D. Manning. 2010. Ensemble models for dependency
parsing: Cheap and good? In Human Language Technologies: The 2010 Annual Conference
of the North American Chapter of the Association for Computational Linguistics, pages
649–652. Los Angeles. URL http://www.aclweb.org/anthology/N10-1091.
Ivan Titov and James Henderson. 2007. Fast and robust multilingual dependency parsing
with a generative latent variable model. In Proceedings of the CoNLL Shared Task Session of
EMNLP-CoNLL 2007, pages 947–951. Association for Computational Linguistics, Prague,
Czech Republic. URL http://www.aclweb.org/anthology/D/D07/D07-1099.pdf.
André Filipe Torres Martins, Dipanjan Das, Noah A. Smith, and Eric P. Xing. 2008.
Stacking dependency parsers. In Proceedings of the 2008 Conference on Empirical Methods
in Natural Language Processing, pages 157–166. Association for Computational Linguistics,
Honolulu, Hawaii. URL http://www.aclweb.org/anthology/D08-1017.
Reut Tsarfaty. 2006. Integrated morphological and syntactic disambiguation for
modern Hebrew. In Proceedings of the COLING/ACL 2006 Student Research Workshop,
pages 49–54. Association for Computational Linguistics, Sydney, Australia. URL
http://www.aclweb.org/anthology/P/P06/P06-3009.pdf.
Reut Tsarfaty and Khalil Sima’an. 2007. Three-dimensional parametrization for parsing
morphologically rich languages. In Proceedings of the Tenth International Conference on
Parsing Technologies, pages 156–167. Association for Computational Linguistics, Prague,
Czech Republic. URL http://www.aclweb.org/anthology/W/W07/W07-2219.pdf.
Reut Tsarfaty and Khalil Sima’an. 2008. Relational-realizational parsing. In
Proceedings of the 22nd International Conference on Computational Linguistics (Coling
2008), pages 889–896. Coling 2008 Organizing Committee, Manchester, UK. URL
http://www.aclweb.org/anthology/C08-1112.
Reut Tsarfaty and Khalil Sima’an. 2010. Modeling morphosyntactic agreement in
constituency-based parsing of modern hebrew. In Proceedings of the NAACL HLT
2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages
40–48. Association for Computational Linguistics, Los Angeles, CA, USA. URL
http://www.aclweb.org/anthology/W10-1405.
Sumire
Uematsu and Jun’ichi Tsujii. 2009. Evaluating contribution of deep syntactic information to
shallow semantic analysis. In Proceedings of the 11th International Conference on Parsing
Technologies (IWPT’09), pages 85–88. Association for Computational Linguistics, Paris,
France. URL http://www.aclweb.org/anthology/W09-3814.
Tylman Ule. 2003.
Directed treebank refinement for pcfg parsing. In Proceedings of TLT-03. Växjö, Sweden.
URL http://www.sfb441.uni-tuebingen.de/a1/Publikationen/ule03tlt.pdf.
Yotaro Watanabe, Masakazu Iwatate, Masayuki Asahara, and Yuji Matsumoto. 2008.
A pipeline approach for syntactic and semantic dependency parsing. In CoNLL 2008:
Proceedings of the Twelfth Conference on Computational Natural Language Learning,
pages 228–232. Coling 2008 Organizing Committee, Manchester, England. URL
http://www.aclweb.org/anthology/W08-2132.
Fei Xia and Martha Palmer. 2001. Converting dependency structures to phrase structures.
In Proceedings of the First International Conference on Human Language Technology
Research (HLT-01). URL http://aclweb.org/anthology-new/H/H01/H01-1014.pdf.
Fei Xia and Martha Palmer. 2010. From treebank to tree-adjoining grammar.
In Srinivas Bangalore and Aravind K. Joshi, editors, Supertagging: Using Complex
Lexical Descriptions in Natural Language Processing, pages 35–72. MIT Press. URL
http://faculty.washington.edu/fxia/papers_from_penn/lextract_chapter_printout.pdf.
Nianwen Xue. 2007. Tapping the implicit information for the ps to ds conversion
of the chinese treebank. In Proceedings of TLT-07. Bergen, Norway. URL
http://tlt07.uib.no/papers/20.pdf.
Alexander Yates, Stefan Schoenmackers, and Oren Etzioni. 2006. Detecting parser errors
using
web-based semantic filters. In Proceedings of the 2006 Conference on Empirical Methods
in Natural Language Processing, pages 27–34. Association for Computational Linguistics,
Sydney, Australia. URL http://www.aclweb.org/anthology/W/W06/W06-1604.pdf.
Szu-ting Yi and Martha Palmer. 2005. The integration of syntactic parsing and semantic
role labeling. In Proceedings of the Ninth Conference on Computational Natural Language
Learning (CoNLL-2005), pages 237–240. Association for Computational Linguistics, Ann
Arbor, Michigan. URL http://www.aclweb.org/anthology/W/W05/W05-0639.pdf.
Kazuhiro Yoshida. 2005. Corpus-oriented development of japanese HPSG parsers.
In Proceedings of the ACL Student Workshop, pages 139–144. Ann Arbor, MI. URL
http://aclweb.org/anthology-new/P/P05/P05-2024.pdf.
Daniel Zeman. 2009. A simple generative pipeline approach to dependency
parsing and semantic role labeling. In Proceedings of the Thirteenth Conference
on Computational Natural Language Learning (CoNLL 2009): Shared Task, pages
120–125. Association for Computational Linguistics, Boulder, Colorado. URL
http://www.aclweb.org/anthology/W09-1219.
Daniel Zeman and Zdeněk Žabokrtský. 2005. Improving parsing accuracy by combining
diverse dependency parsers. In Proceedings of the Ninth International Workshop on Parsing
Technology, pages 171–178. Association for Computational Linguistics, Vancouver, British
Columbia. URL http://www.aclweb.org/anthology/W/W05/W05-1518.pdf.
Muhua Zhu, Jingbo Zhu, and Minghan Hu. 2011. Better automatic treebank
conversion using a feature-based approach. In Proceedings of the 49th Annual Meeting
of the Association for Computational Linguistics: Human Language Technologies, pages
715–719. Association for Computational Linguistics, Portland, Oregon, USA. URL
http://www.aclweb.org/anthology/P11-2126.
References