Archive for the ‘Computational Linguistics’ Category

UMD CMSC 723: Computational Linguistics I

Tuesday, February 21st, 2012

UMD CMSC 723: Computational Linguistics I

Twenty-five (25) posts by Hal Daume III as part of his course on computational linguistics. References, pointers, examples, explanations.

I haven’t read these in detail. As always, welcome your comments/suggestions.

It would be interesting to take the major university computational linguistics courses and create a topic map of the topics covered and recommended resources. Could be useful for students with different learning styles to find an approach that works for them.

Anyone care to hazard a list of say the top twenty (20) schools in computational linguistics? (without ranking, just in the top 20)

PS: The course homepage.

50th Annual Meeting of the Association for Computational Linguistics

Saturday, December 3rd, 2011

50th Annual Meeting of the Association for Computational Linguistics

Important dates:

January 15, 2012 (11:59pm PST) : Long Submission Deadline
March 11, 2012 : Long Notification
April 30, 2012 : Long Camera Ready Deadline
March 18, 2012 (11:59pm PST) : Short Submission Deadline
April 23, 2012 : Short Notification
May 7, 2012 : Short Camera Ready Deadline
July 9, 2012 : Conference Starts – July 14, 2012 : Conference Ends

From the call for papers:

The 50th Annual Meeting of the Association for Computational Linguistics and the Human Language Technologies conference will be organized as a single event to be held at the International Convention Center Jeju, Jeju, Korea, on July 8-14, 2012. The conference will cover a broad spectrum of technical areas related to natural language and computation. ACL 2012 will include full papers, short papers, oral presentations, poster presentations, demonstrations, tutorials, and workshops. The conference is organized by the Association for Computational Linguistics.

The conference invites the submission of papers on original and unpublished research on all aspects of computational linguistics, including but not limited to:

1. Discourse, Dialogue, and Pragmatics
2. Information Extraction
3. Information Retrieval
4. Language Resources
5. Lexical Semantics
6. Lexicon and ontology development
7. Machine Translation
8. Multilinguality
9. Multimodal representations and processing
10. Social Media
11. Natural Language Processing Applications
12. Phonology/Morphology, Tagging and Chunking, Word Segmentation
13. Question Answering
14. Sentiment Analysis and Opinion Mining
15. Spoken Language Processing
16. Statistical and Machine Learning Methods
17. Summarization and Generation
18. Syntax and Parsing
19. Text Classification
20. Text Mining
21. User Studies and Evaluation Methods

Detecting Structure in Scholarly Discourse

Saturday, December 3rd, 2011

Detecting Structure in Scholarly Discourse (DSSD2012)

Important Dates:

March 11, 2012 Submission Deadline
April 15, 2012 Notification of acceptance
April 30, 2012 Camera-ready papers due
July 12 or 13, 2012 Workshop

From the Call for Papers:

The detection of discourse structure in scientific documents is important for a number of tasks, including biocuration efforts, text summarization, error correction, information extraction and the creation of enriched formats for scientific publishing. Currently, many parallel efforts exist to detect a range of discourse elements at different levels of granularity and for different purposes. Discourse elements detected include the statement of facts, claims and hypotheses, the identification of methods and protocols, and as the differentiation between new and existing work. In medical texts, efforts are underway to automatically identify prescription and treatment guidelines, patient characteristics, and to annotate research data. Ambitious long-term goals include the modeling of argumentation and rhetorical structure and more recently narrative structure, by recognizing ‘motifs’ inspired by folktale analysis.

A rich variety of feature classes is used to identify discourse elements, including verb tense/mood/voice, semantic verb class, speculative language or negation, various classes of stance markers, text-structural components, or the location of references. These features are motivated by linguistic inquiry into the detection of subjectivity, opinion, entailment, inference, but also author stance and author disagreement, motif and focus.

Several workshops have been focused on the detection of some of these features in scientific text, such as speculation and negation in the 2010 workshop on Negation and Speculation in Natural Language Processing and the BioNLP’09 Shared Task, and hedging in the CoNLL-2010 Shared Task Learning to detect hedges and their scope in natural language textM. Other efforts that have included a clear focus on scientific discourse annotation include STIL2011 and Force11, the Future of Research Communications and e-Science. There have been several efforts to produce large-scale corpora in this field, such as BioScope, where negation and speculation information were annotated, and the GENIA Event corpus.

The goal of the 2012 workshop Detecting Structure in Scholarly Discourse is to discuss and compare the techniques and principles applied in these various approaches, to consider ways in which they can complement each other, and to initiate collaborations to develop standards for annotating appropriate levels of discourse, with enhanced accuracy and usefulness.

This conference is being held in conjunction with ACL 2012.

Scala Tutorial – Tuples, Lists, methods on Lists and Strings

Monday, October 3rd, 2011

Scala Tutorial – Tuples, Lists, methods on Lists and Strings

I mention this not only because it looks like a good Scala tutorial series but also because it is being developed in connection with a course on computational linguistics at UT Austin (sorry, University of Texas at Austin, USA).

The cross-over between computer programming and computational linguistics illustrates the artificial nature of the divisions we make between disciplines and professions.

Graph-based Clustering for Computational Linguistics: A Survey

Thursday, March 17th, 2011

Graph-based Clustering for Computational Linguistics: A Survey

Slides by Zheng Chen and Heng Ji, City University of New York, July 2010.

A very concise summary of graph methods with citations to the literature.

You won’t be able to run off and become a hairy-chested graph warrior with these slides but you will have a better idea of why graphs are important.

Association for Computational Linguistics: Human Language Technologies (2011 Portland)

Monday, March 14th, 2011

Association for Computational Linguistics: Human Language Technologies (49th annual meeting)

The time for submitting papers is past but a quick look at the list of accepted papers gives plenty of reasons to attend.

To be held at the Portland Marriott Downtown Waterfront in Portland, Oregon, USA, June 19-24, 2011.

So you don’t miss 2012, it will be held on Jeju Island, Republic of Korea. I have been to Jeju Island. It is awesome!