NLP @ 精品SM在线影片

"The idea of giving computers the ability to process human language is as old as the idea of computers themselves. This vibrant interdisciplinary enterprise has many names corresponding to its many facets, names like speech and language processing, human lanquage technology, natural lanquage processing and computational linquistics. The goal of this exciting field is to provide scientific insights into the nature of human language and to enable human-machine communication and improve human-human communication."


锛峆rofessor Jim Martin


Daniel Jurafsky and James H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition (2ed.), Prentice Hall 2009

The NLP Process

Training computers to accurately deal with languages is a complex process that intricately weaves together linguistic insights and computational models that reference real world contexts. The process can begin with linguistic analysis, computational models, or a combination of the two. After it鈥檚 begun, however, it usually cycles in the following manner.

An infographic describing the NLP process

The NLP Ecosystem

The NLP ecosystem is comprised of linguists, computer scientists, and domain experts, as well as the computational linguists who link these three groups together.

An infographic about the NLP ecosystem

If this entire process seems interesting to you, why not become a computational linguist?

Apply Now

Featured Projects

Our faculty are engaged in research projects ranging from language documentation and morphological analysis to semantic analysis and biomedical informatics. We are also currently working on an autonomous conversational agent in a junior high through college classroom setting. Featured below are some of the projects we are most proud of, both past and present.听

听Ongoing

Jan听28th

DARPA AIDA听Program

Autonomous Interperation of听Disparate听Alternatives

Project leads

Martha Palmer

Martha Palmer

Susan Brown

Susan Brown

Jim Martin

Jim Martin听 听 听

Chris Heckman

Chris Heckman

Our goal is to听automatically analyze the content of written documents and extract听key pieces of information about the events they describe, including where听different news sources contradict each other.

Problem

We can鈥檛 possibly keep track of everything that is happening day to day - in the news, in medicine, in financial markets, on social media, etc.

Solution

Natural Language Processing can automatically extract key events, along with who is participating in them and the order in which they happen,听to help make our job of keeping on top of things much more tractable.

Techniques Used

  • Deep Learning听
  • Graph Embeddings听
  • Coreference Resolution听
  • Type Matching听
  • Entity听& Event Annotation听&听Recognition 听
  • Ontology Construction听&听Mapping

听听 听

听Ongoing

Jan 28th

THYME

Temporal History of Your Medical Events听 听听

Project leads

Martha Palmer

Martha听Palmer

Jim Martin
听 听 听
Jim听Martin听 听 听 听听

Kristin Wright-Bettener

Kristin听Wright-Bettener

Our goal is automatically extracting the timeline of a disease and its treatment from patient records. This benefits individual patients and their doctors by providing quick, accurate summaries of a patient鈥檚 history covering several years. Moreover, aggregating together timelines for large numbers of patients can also aid in analyzing the effectiveness of alternative treatments and the development of new treatments, benefitting all patients.

Problem

Ever increasing amounts of electronic clinical data and medical subspecialization hinder the ability of doctors听and patients听to stay on top of all aspects of a patient鈥檚 medical history.

Solution

Natural Language Processing can automatically process thousands of patient records in seconds. This allows automatic identification of salient diseases, signs, symptoms, and treatments, while preserving the timeline of the patient鈥檚 medical history.

Techniques Used

  • Annotation of Temporal Relations Between Events
  • Annotation and Parsing of Abstract Meaning Representations
  • Coreference Annotation and Resolution听
  • Entity & Event Annotation & Recognition

听听 听

听Ongoing

Jan 28th

Universal NLP

听听

Project leads

Professor Kann

Katharina Kann

Alexis Palmer

Alexis Palmer听 听 听 听听

NLP is making immense contributions to the听English and听Chinese speaking worlds. Automating teaching to give children access to education and automatic machine translation increasing access to healthcare are听just two examples.听For the rest of the world to benefit from NLP, it needs to function in their languages听too.

Problem

The majority of the world's听7000听languages have limited data available for Natural Language Processing.

Solution

When we don鈥檛 have enough data to use classical NLP, there are approaches that can make up for this lack.

Techniques Used

- Transfer Learning听
- Pre-training听
- Multi-task Training听
- Meta Learning