NLP @ 精品SM在线影片
"The idea of giving computers the ability to process human language is as old as the idea of computers themselves. This vibrant interdisciplinary enterprise has many names corresponding to its many facets, names like speech and language processing, human lanquage technology, natural lanquage processing and computational linquistics. The goal of this exciting field is to provide scientific insights into the nature of human language and to enable human-machine communication and improve human-human communication."
锛峆rofessor Jim Martin
Daniel Jurafsky and James H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition (2ed.), Prentice Hall 2009
听
The NLP Process
Training computers to accurately deal with languages is a complex process that intricately weaves together linguistic insights and computational models that reference real world contexts. The process can begin with linguistic analysis, computational models, or a combination of the two. After it鈥檚 begun, however, it usually cycles in the following manner.
听
听
The NLP Ecosystem
The NLP ecosystem is comprised of linguists, computer scientists, and domain experts, as well as the computational linguists who link these three groups together.
听
听
听
Featured Projects
Our faculty are engaged in research projects ranging from language documentation and morphological analysis to semantic analysis and biomedical informatics. We are also currently working on an autonomous conversational agent in a junior high through college classroom setting. Featured below are some of the projects we are most proud of, both past and present.听
听
听 听Ongoing
Jan听28th
DARPA AIDA听Program
Autonomous Interperation of听Disparate听Alternatives
Our goal is to听automatically analyze the content of written documents and extract听key pieces of information about the events they describe, including where听different news sources contradict each other.
Problem
We can鈥檛 possibly keep track of everything that is happening day to day - in the news, in medicine, in financial markets, on social media, etc.
Solution
Natural Language Processing can automatically extract key events, along with who is participating in them and the order in which they happen,听to help make our job of keeping on top of things much more tractable.
Techniques Used
- Deep Learning听
- Graph Embeddings听
- Coreference Resolution听
- Type Matching听
- Entity听& Event Annotation听&听Recognition 听
- Ontology Construction听&听Mapping
听
听
听 听Ongoing
Jan 28th
THYME
Temporal History of Your Medical Events听 听听
Our goal is automatically extracting the timeline of a disease and its treatment from patient records. This benefits individual patients and their doctors by providing quick, accurate summaries of a patient鈥檚 history covering several years. Moreover, aggregating together timelines for large numbers of patients can also aid in analyzing the effectiveness of alternative treatments and the development of new treatments, benefitting all patients.
听
Problem
Ever increasing amounts of electronic clinical data and medical subspecialization hinder the ability of doctors听and patients听to stay on top of all aspects of a patient鈥檚 medical history.
Solution
Natural Language Processing can automatically process thousands of patient records in seconds. This allows automatic identification of salient diseases, signs, symptoms, and treatments, while preserving the timeline of the patient鈥檚 medical history.
Techniques Used
- Annotation of Temporal Relations Between Events
- Annotation and Parsing of Abstract Meaning Representations
- Coreference Annotation and Resolution听
- Entity & Event Annotation & Recognition
听
听
听 听Ongoing
Jan 28th
Universal NLP
听听
NLP is making immense contributions to the听English and听Chinese speaking worlds. Automating teaching to give children access to education and automatic machine translation increasing access to healthcare are听just two examples.听For the rest of the world to benefit from NLP, it needs to function in their languages听too.
听
Problem
The majority of the world's听7000听languages have limited data available for Natural Language Processing.
Solution
When we don鈥檛 have enough data to use classical NLP, there are approaches that can make up for this lack.
Techniques Used
- Transfer Learning听
- Pre-training听
- Multi-task Training听
- Meta Learning