|
| Assistant Professor |
| Stony Brook University (SUNY Stony Brook) |
| 1422 Computer Science |
| Stony Brook, NY 11794-4400 |
| (phone) 631-632-8457 |
| (fax) 631-632-8334 |
 |
|
Office Hours: TBD for Fall 2013
Teaching:
News:
- 2 papers at ACL 2013: one on connotation
lexicon, another on new
image-text parallel corpus
- New media coverage: our work on connotation lexicon is
featured by FastCompany
- 1 journal to appear at TPAMI 2013
- Invited speaker at Vision+NLP
Workshop at NAACL 2013
- Panel speaker at Student
Research Workshop
at NAACL 2013
- Interview with News for New York @ WNBC on deception cues in product reviews
- Area chair for EMNLP 2012
- Area chair for NAACL 2012
Research Projects:
|
- Integrative Models for Natural Language and Images
Web data today is increasingly multi-modal, opening up opportunities as well as the need
for integrative models to bridge Natural Language Processing
with Computer Vision.
Our recent explorations include
- Generating natural language descriptions of images
by guiding object detection with language prior
[cvpr11],
by predicting likely action verbs from language-driven world knowledge
[conll11],
and by composing phrases retrieved by partial image matching
[acl12].
- Understanding
characteristics of visual descriptions
[naacl12].
- Constructing a new image-text parallel corpus by reducing information misalignment between images and text
[acl13s].
|
- Statistical Stylometry and Forensic Language Technologies
Language is a window into people's minds. We explore data-driven approaches to
statistical stylometry (i.e., the study of linguistic styles),
and forensic language technologies (e.g., authorship verification, obfuscation,
deception detection).
This research is naturally interdisciplinary with
broad connections to Psychology, Social Science, Cognitive Science, Psycholinguistics, and
Literature.
Our recent development includes
- Detecting socio-cognitive identities, such as
authorship [FBC12e],
gender [SGC11], and nationality.
- Uncovering (hidden) intent of the authors, such as deception
[OCCH11,
FBC12a,
FXGC12],
and textual vandalism [HHSJC11].
|
|
|
- Learning Connotation from a Network of Words
Our recent work presents algorithms to learn subtle, nuanced connotation of
words using a large-scale constraint optimization on a network of words [acl13,
emnlp11].
Check out our new connotation
lexicon [here].
Our past explorations on opinion analysis include
- Lexicon induction and adaptation for sentiment analysis
[CC09,
FBC11].
- Analysis in light of compositional semantics
[CC08,
CFGJMP10].
- Fine-grained opinion analysis
[CC10,
BCC07,
CBC06,
CCRP05].
Some of our work has been adopted by
Appinions.com
|
Publications:
- Recent Papers:
2013
Generalizing Image Captions for Image-Text Parallel Corpus.
Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi.
Association for Computational Linguistics (ACL), 2013.
Connotation Lexicon: A Dash of Sentiment Beneath the Surface
Meaning.
Song Feng, Jun Seok Kang, Polina Kuznetsova and Yejin Choi.
Association for Computational Linguistics (ACL), 2013.
Featured in Fast Company
BabyTalk: Understanding and Generating Simple Image Descriptions.
Girish Kulkarni, Visruth Premraj, Vicente Ordonez, Sagnik Dhar, Siming Li,
Yejin Choi, Alexander C. Berg, Tamara L Berg.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2013.
2012
Characterizing Stylistic Elements in Syntactic
Structure.
Song Feng, Ritwik Banerjee and Yejin Choi.
Empirical Methods in Natural Language Processing (EMNLP), 2012.
Syntactic Stylometry for Deception Detection.
Song Feng, Ritwik Banerjee and Yejin Choi.
Association for Computational Linguistics (ACL), 2012. short paper
Collective Generation of Natural Image Descriptions.
Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi.
Association for Computational Linguistics (ACL), 2012.
Distributional Footprints of Deceptive Product Reviews.
Song Feng, Longfei Xing, Anupam Gogar and Yejin Choi.
International AAAI Conference on Weblogs and Social Media. (ICWSM), 2012.
Best paper runner up
Featured in
Investor's Business Daily;
MIT's Technology Review;
Business Insider;
Computer World;
Consumerist;
Detecting Visual Text.
Jesse Dodge, Amit Goyal, Xufeng Han, Alyssa Mensch, Margaret Mitchell, Karl Stratos, Kota Yamaguchi, Yejin Choi, Hal Daume' III, Alexander C. Berg, Tamara L. Berg.
North American Chapter of the Association for Computational Linguistics. (NAACL), 2012.
- Complete list of publications:
By Year
By Topic
Students:
Short Bio:
Yejin Choi received her Ph.D. in Computer Science at Cornell University,
and BS in Computer Science and Engineering at Seoul National University.
She spent the summer of 2009 as a research intern at Yahoo! Research and joined the faculty of Computer Science Department at Stony Brook University in Sep 2010.