Youngja Park  Youngja Park photo       

contact information

Natural Language Processing, Text Mining, Information Extraction, Machine Learning, Information Security
Thomas J. Watson Research Center, Yorktown Heights, NY USA
  +1dash914dash945dash1421

links



2016

Tri-Modularization of Firewall Policies
Haining Chen, Omar Chowdhury, Ninghui Li, Warut Khern-am-nuai, Suresh Chari, Ian Molloy, Youngja Park
Proceedings of the 21st ACM on Symposium on Access Control Models and Technologies (SACMAT), 2016, pp. 37--48

A platform and analytics for usage and entitlement analytics
Suresh Chari, Ted Habeck, Ian Molloy, Youngja Park, Josyula R. Rao, Wilfried Teiken
IBM Journal of Research and Development 60(4), 7, 2016

Data classification and sensitivity estimation for critical asset discovery
Youngja Park, Wilfried Teiken, Josyula R. Rao, Suresh Chari
IBM Journal of Research and Development 60(4), 2, 2016

Comparing Password Ranking Algorithms on Real-World Password Datasets
Weining Yang, Ninghui Li, Ian M. Molloy, Youngja Park, Suresh N. Chari
21st European Symposium on Research in Computer Security (ESORICS), September 26-30, 2016, pp. 69--90

Graph Analytics for Real-time Scoring of Cross-channel Transactional Fraud
Ian Molloy, Suresh Chari, Ulrich Finkler, Mark Wiggerman, Coen Jonker, Ted Habeck, Youngja Park, Frank Jordens, Ron van Schaik
Financial Cryptography and Data Security, 2016

DinTucker: Scaling up Gaussian process models on large multidimensional arrays
Shandian Zhe, Yuan Qi, Youngja Park, Zenglin Xu, Ian Molloy, Suresh Chari
AAAI, 2016


2015

Scalable Nonparametric Multiway Data Analysis
Shandian Zhe, Zenglin Xu, Xinqi Chu, Yuan Qi, Youngja Park
Proceedings of the 18th International Conference on Artificial Intelligence and Statistics (AISTATS), 2015

Learning from Others: User Anomaly Detection UsingAnomalous Samples from Other Users
Youngja Park, Ian M. Molloy, Suresh N. Chari, Zenglin Xu, Chris Gates, Ninghi Li
To appear in Proceedings of the 20th European Symposium on Research in Computer Security (ESORICS), 2015


2014

Learning from a Neighbor: Adapting a Japanese Parser for Korean through Feature Transfer Learning
Hiroshi Kanayama, Youngja Park, Yuta Tsuboi, Dongmook Yi
Proceedings of the EMNLP 2014 Workshop: Language Technology for Closely Related Languages and Language Variants, pp. 2-12

Hetero-Labeled LDA: A Partially Supervised Topic Model with Heterogeneous Labels
Dongyeop Kang, Youngja Park, Suresh N. Chari
Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), 2014

Detecting Insider Information Theft Using Features from File Access Logs
Christopher Gates, Ninghui Li, Zenglin Xu, Suresh N. Chari, Ian Molloy, Youngja Park
Proceedings of the 19th European Symposium on Research in Computer Security (ESORICS), 2014


2013

PAKDD'12 best paper: generating balanced classifier-independent training samples from unlabeled data
Youngja Park, Zijie Qi, Suresh N. Chari, Ian M. Molloy
Journal of Knowledge and Information Systems (KAIS), Springer London, 2013

A bigData platform for analytics on access control policies and logs
Suresh Chari, Ted Habeck, Ian Molloy, Youngja Park, Wilfried Teiken
Proceedings of the 18th ACM symposium on Access control models and technologies (SACMAT), pp. 185-188, 2013

Ensuring continuous compliance through reconciling policy with usage
Suresh Chari, Ian Molloy, Youngja Park, Wilfried Teiken
Proceedings of the 18th ACM symposium on Access control models and technologies (SACMAT), pp. 49-60, 2013

Estimating Asset Sensitivity by Profiling Users
Youngja Park, Christopher S. Gates, Stephen C. Gates
Proceedings of the 18th European Symposium on Research in Computer Security (ESORICS),, pp. 94-110, 2013


2012

Incremental Subspace Based Classi cation for Sensitive Business Emails
Min Li, Youngja Park, Rui Ma, He Yuan Huang
International Conference on Pattern Recognition (ICPR), 2012

Generative Models for Access Control Policies: Applications to Role Mining Over Logs with Attribution
Ian Molloy, Youngja Park, Suresh N. Chari
Proceedings of the 17th ACM symposium on Access control models and technologies (SACMAT) (Best Paper Runners-Up Award), 2012

Generating Balanced Classifier-Independent Training Samples from Unlabeled Data
Youngja Park, Zijie Qi, Suresh N Chari, Ian M Molloy
Proceedings of Pacific Asia Knowledge Discovery and Data Mining (PAKDD) (Best Paper Runners-Up Award), 2012


2011

An Experimental Study on the Measurement of Data Sensitivities
Youngja Park, Stephen C Gates, Wifried Teiken, Pau-Chen Cheng
Proceedings of Workshop on Building Analysis Datasets and Gathering Experience Returns for Security (BADGERS), pp. 68--75, 2011

Automatic Call Quality Monitoring Using Cost-Sensitive Classification
Youngja Park
Proceedings of Interspeech, 2011

System for automatic estimation of data sensitivity with applications to access control and other applications
Youngja Park, Stephen C Gates, Wifried Teiken, Suresh N Chari
Proceedings of The ACM Symposium on Access Control Models and Technologies (SACMAT), 2011


2010

A Text Mining Approach to Confidential Document Detection for Data Loss Prevention
Youngja Park
IBM Research Technical Report RC25055, 2010


2009

Low-Cost Call Type Classification for Contact Center Calls Using Partial Transcripts
Youngja Park, Wilfried Teiken, Stephen C Gates
Proceedings of Interspeech, 2009

Towards Real-Time Measurement of Customer Satisfaction Using Automatically Generated Call Transcripts
Youngja Park, Stephen C Gates
Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 1387--1396, 2009


2008

Automatically Constructing Blue Pages For Characters In Instructional Videos
Ying Li, Youngja Park
Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 1409--1412, 2008

An Empirical Analysis of Word Error Rate and Keyword Error Rates
Youngja Park, Siddharth Patwardhan, Karthik Visweswariah, Stephen C Gates
Proceedings of Interspeech, 2008

Semi-automated logging of contact center telephone calls
Roy J. Byrd, Mary S. Neff, Wilfried Teiken, Youngja Park, Keh-Shin F Cheng, Stephen C Gates, Karthik Visweswariah
Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM), pp. 133--142, 2008


2007

Semantic Analysis for Topical Segmentation of Videos
Youngja Park, Ying Li
Proceedings of IEEE International Conference on Semantic Computing, pp. 161--168, 2007

Automatic call section segmentation for contact-center calls
Youngja Park
Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM), pp. 117--126, 2007


2006

Extracting Salient Keywords from Instructional Videos Using Joint Text, Audio and Visual Cues
Youngja Park, Ying Li
Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics  Human Language Technologies (NAACL-HLT) (Short Paper), pp. 109--112, 2006

MAGICAL demonstration: system for automated metadata generation for instructional content
Chitra Dorai, Robert Farrell, Amy Katriel, Galina Kofman, Ying Li, Youngja Park
Proceedings of ACM Multimedia Conference (Demo session), pp. 492, ACM, 2006

Atomic Topical Segments Detection for Instructional Videos
Ying Li, Youngja Park, Chitra Dorai
Proceedings of ACM Multimedia Conference, pp. 56, 2006


2004

Glossary Extraction and Utilization for IBM Technical Support Information Search and Delivery System
Lev Kozakov, Youngja Park, T Fin, Y Drissi, Y Doganata, T Cofino
IBM Systems Journal 43(3), 2004

GlossOnt: A concept-focused ontology building tool
Youngja Park
Proceedings of Knowledge Representation and Reasoning (KR04), pp. 498--506, 2004


2003

Towards Ontologies On Demand
Youngja Park, Roy J Byrd, Branimir K Boguraev
Proceedings of Semantic Web Workshop on Semantic Web Technologies for Scientific Search and Information Retrieve, 2003


2002

Identification of Probable Real Words: An Entropy-based Approach
Youngja Park
Proceedings of ACL Workshop on Unsupervised Lexical Acquisition, pp. 1--8, 2002

Automatic Glossary Extraction: Beyond Terminology Identification
Youngja Park, Roy J Byrd, Branimir K Boguraev
Proceedings of the 19th International Conference on Computational Linguistics (COLING), pp. 772--778, 2002


2001

Hybrid Text Mining for Matching Abbreviations and their Definitions
Youngja Park, Roy J Byrd
Proceedings of Empirical Methods in Natural Language Processing (EMNLP), pp. 126--133, 2001


1998

A Genetic Algorithm for Clustering
Youngja Park, ManSuk Song
Proceedings of Genetic Programming Conference (GP), 1998


1997

Automatic Classification of Word Senses Using a Genetic Algorithm
Youngja Park, Seok-Kyung Chung, ManSuk Songg
Proceedings of 17th International Conference on Computer Processing of Oriental Language (ICCPOL), 1997

Genetic Programming Approach to Sense Clustering in Natural Language Processing
Youngja Park, ManSukSong
Proceedings of Genetic Programming Conference (GP), 1997

Estimating Similarity of Word Senses by a Fuzzy Relation on a Large Dictionary
Youngja Park, ManSukSong
Proceedings of Recent Advances in NLP (RANLP), 1997