Jay Pujara's Publications

Publications

Theses

Probabilistic Models for Scalable Knowledge Graph Construction. Jay Pujara. (2016) Ph.D. thesis, University of Maryland, College Park.
bib doi pdf

Fundamental Properties of Feature Selection in fMRI Data. Jay Pujara. (2005) Master's thesis, Carnegie Mellon University.
bib doi pdf

Journals and Magazines

Gendered citation patterns among the scientific elite. Kristina Lerman, Yulin Yu, Fred Morstatter, and Jay Pujara. (2022) Proceedings of the National Academy of Sciences 119(40).
bib doi pdf

Artificial intelligence for modeling complex systems: taming the complexity of expert models to improve decision making. Yolanda Gil, Daniel Garijo, Deborah Khider, Craig Knoblock, Varun Ratnakar, Maximiliano Osorio, Hernan Vargas, Minh Pham, Jay Pujara, Basel Shbita, Binh Vu, Yao-Yi Chiang, Dan Feldman, Yijun Lin, Hayley Song, Vipin Kumar, Ankush Khandelwal, Michael Steinbach, Kshtij Tayal, Shaoming Xu, Suzanne A. Pierce, Lissa PEarson, Daniel Hardesty-Lewis, Ewa Deelman, Rafael Ferreira Da Silva, Rajiv Mayani, Armen R. Kemanian, Yuning Shi, Lorne Leonard, Scott Peckham, and Maria Stoica. (2021) ACM Transactions on Interactive Intelligent Systems 11(2).
bib doi pdf

Learning Cell Embeddings for Understanding Table Layouts. Majid Ghasemi-Gol, Jay Pujara, and Pedro Szekely. (2020) Knowledge and Information Systems 1(64).
bib doi pdf

Generating and Understanding Personalized Explanations in Hybrid Recommender Systems. Pigi Kouki, James Schaffer, Jay Pujara, John O'Donovan, and Lise Getoor. (2020) ACM Transactions on Interactive Intelligent Systems 10(4).
bib doi pdf

Collective Entity Resolution in Multi-relational Familial Networks. Pigi Kouki, Jay Pujara, Christopher Marcum, Laura Koehly, and Lise Getoor. (2018) Knowledge and Information Systems 61(3).
bib doi pdf

Using Semantics & Statistics to Turn Data into Knowledge. Jay Pujara, Hui Miao, Lise Getoor, and William W. Cohen. (2015) AI Magazine 36(1).
bib doi pdf

Refereed Conferences

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models. Dong-Ho Lee, Akshen Kadakia, Brihi Joshi, Aaron Chan, Ziyi Liu, Kiran Narahari, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, and Xiang Ren. (2023) Association for Computational Linguistics.
bib doi pdf

seq2seq-SC: End-to-End Semantic Communication Systems with Pre-Trained Language Models. Ju-Hyung Lee, Dong-Ho Lee, Eunsoo Sheen, Thomas Choi, Jay Pujara, and Joongheon Kim. (2023) Asilomar Conference on Signals, Systems, and Computers.

AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction. Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar, Bill Yuchen Lin, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan, and Xiang Ren. (2023) European Chapter of the Association for Computational Linguistics.
bib doi pdf

Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning. Dong-Ho Lee, Kian Ahrabian, Woojeong Jin, Fred Morstatter, and Jay Pujara. (2023) Conference on Empirical Methods in Natural Language Processing.
arxiv bib code doi pdf

Making Large Language Models Better Data Creators. Dong-Ho Lee, Jay Pujara, Mohit Sewak, Ryen White, and Sujay Jauhar. (2023) Conference on Empirical Methods in Natural Language Processing.
arxiv bib code doi pdf

Comparison of Knowledge Graph Representations for Consumer Scenarios. Ana Iglesias-Molina, Kian Ahrabian, Filip Ilievski, Jay Pujara, and Oscar Corcho. (2023) International Semantic Web Conference.
bib pdf

Analyzing Norm Violations in Live-Stream Chat. Jihyung Moon, Dong-Ho Lee, Hyundong Cho, Woojeong Jin, Chan Park, Minwoo Kim, Jonathan May, Jay Pujara, and Sungjoon Park. (2023) Conference on Empirical Methods in Natural Language Processing.
arxiv bib doi pdf

Learn Your Tokens: Word-Pooled Tokenization for Language Modeling. Avijit Thawani, Saurabh Ghanekar, Xiaoyuan Zhu, and Jay Pujara. (2023) Findings of the Association for Computational Linguistics: EMNLP.
arxiv bib doi pdf

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons. Pei Zhou, Andrew Zhu, Jennifer Hu, Jay Pujara, Xiang Ren, Chris Callison-Burch, Yejin Choi, and Prithviraj Ammanabrolu. (2023) Association for Computational Linguistics.
bib doi pdf

FETA: A benchmark for few-sample task transfer in open-domain dialogue. Alon Albalak, Yi-Lin Tuan, Pegah Jandaghi, Connor Pryor, Luke Yoffe, Deepak Ramachandran, Lise Getoor, Jay Pujara, and William Yang Wang. (2022) Conference on Empirical Methods in Natural Language Processing.
bib doi pdf

Does Wikidata Support Analogical Reasoning?. Filip Ilievski, Jay Pujara, and Kartik Shenoy. (2022) Iberoamerican Knowledge Graphs and Semantic Web Conference.
bib doi pdf

Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-Modal Knowledge Transfer. Woojeong Jin, Dong-Ho Lee, Chenguang Zhu, Jay Pujara, and Xiang Ren. (2022) Association for Computational Linguistics.
bib doi pdf

Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER. Dong-Ho Lee, Akshen Kadakia, Kangmin Tan, Mahak Agarwal, Xinyu Feng, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, and Xiang Ren. (2022) Association for Computational Linguistics.
bib doi pdf

Assessing Scientific Research Papers with Knowledge Graphs. Kexuan Sun, Zhiqiang Qiu, Abel Salinas, Yuzhong Huang, Dong-Ho Lee, Daniel Benjamin, Fred Morstatter, Xiang Ren, Kristina Lerman, and Jay Pujara. (2022) ACM Conference on Research and Development in Information Retrieval (SIGIR).
bib doi pdf

Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response Generation. Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, and Dilek Hakkani-Tur. (2022) Association for Computational Linguistics.
bib doi pdf

Reflect, Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality. Pei Zhou, Hyundong Cho, Pegah Jandaghi, Dong-Ho Lee, Bill Yuchen Lin, Jay Pujara, and Xiang Ren. (2022) Conference on Empirical Methods in Natural Language Processing.
bib doi pdf

Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources. Nina Mehrabi, Pei Zhou, Fred Morstatter, Jay Pujara, Xiang Ren, and Aram Galstyan. (2021) Conference on Empirical Methods in Natural Language Processing.
arxiv bib pdf

SPADE: A Semi-supervised Probabilistic Approach for Detecting Errors in Tables. Minh Pham, Craig Knoblock, Muhao Chen, Binh Vu, and Jay Pujara. (2021) International Joint Conference on Aritificial Intelligence (IJCAI).
bib doi pdf

A Hybrid Probabilistic Approach for Table Understanding. Kexuan Sun, Harsha Rayudu, and Jay Pujara. (2021) Conference on Artificial Intelligence (AAAI).
bib doi pdf

Tabular Functional Block Detection with Embedding-based Agglomerative Cell Clustering. Kexuan Sun, Fei Wang, Muhao Chen, and Jay Pujara. (2021) Conference on Information and Knowledge Management.
bib doi pdf

Numeracy enhances the Literacy of Language Models. Avijit Thawani, Jay Pujara, and Filip Ilievski. (2021) Conference on Empirical Methods in Natural Language Processing.
bib code pdf

Representing Numbers in NLP: a Survey and a Vision. Avijit Thawani, Jay Pujara, Pedro Szekely, and Filip Ilievski. (2021) Conference of the North American Chapter of the Association for Computational Linguistics (NAACL).
bib doi pdf

A Graph-based Approach for Inferring Semantic Descriptions of Wikipedia Tables. Binh Vu, Craig Knoblock, Pedro Szekely, Jay Pujara, and Minh Pham. (2021) International Semantic Web Conference.
bib doi pdf

Table-based Fact Verification With Salience-aware Learning. Fei Wang, Kexuan Sun, Jay Pujara, Pedro Szekely, and Muhao Chen. (2021) Findings of the Association for Computational Linguistics: EMNLP 2021.
arxiv bib code pdf

Retrieving Complex Tables with Multi-Granular Graph Representation Learning. Fei Wang, Kexuan Sun, Muhao Chen, Jay Pujara, and Pedro Szekely. (2021) ACM Conference on Research and Development in Information Retrieval (SIGIR).
bib doi pdf

Commonsense-Focused Dialogues for Response Generation An Empirical Study. Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, and Dilek Hakkani-Tur. (2021) Proceedings of the Special Interest Group on Discourse and Dialogue.
bib code pdf video

RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms. Pei Zhou, Rahul Khanna, Seyeon Lee, Bill Yuchen Lin, Daniel Ho, Jay Pujara, and Xiang Ren. (2021) Conference on Empirical Methods in Natural Language Processing.
arxiv bib code pdf

Probing Commonsense Explanation in Dialogue Response Generation. Pei Zhou, Pegah Jandaghi, Hyundong Cho, Bill Yuchen Lin, Jay Pujara, and Xiang Ren. (2021) Findings of the Association for Computational Linguistics: EMNLP 2021.
arxiv bib code pdf

Tabular Cell Classification Using Pre-Trained Cell Embeddings. Majid Ghasemi-Gol, Jay Pujara, and Pedro Szekely. (2019) International Conference on Data Mining.
bib doi pdf

Personalized Explanations for Hybrid Recommender Systems. Pigi Kouki, James Schaffer, Jay Pujara, John O'Donovan, and Lise Getoor. (2019) ACM International Conference on Intelligent User Interfaces.
bib doi pdf

Learning Data Transformations with Minimal User Effort. Minh Pham, Craig Knoblock, and Jay Pujara. (2019) IEEE BigData Conference.
bib pdf

A Common Framework for Developing Table Understanding Models. Jay Pujara, Arunkumar Rajendran, Majid Ghasemi-Gol, and Pedro Szekely. (2019) International Semantic Web Conference - Posters.
bib code pdf video

T2WML: Table To Wikidata Mapping Langauge. Pedro Szekely, Daniel Garijo, Divij Bhatia, Jiasheng Wu, Yixiang Yao, and Jay Pujara. (2019) ACM International Conference on Knowledge Capture (K-CAP).
bib doi pdf

D-REPR: A Language for Describing and Mapping Diversely-Structured Data Sources to RDF. Binh Vu, Craig Knoblock, and Jay Pujara. (2019) ACM International Conference on Knowledge Capture (K-CAP).
bib doi pdf

Learning Semantic Models of Data Sources Using Probabilistic Graphical Models. Binh Vu, Craig Knoblock, and Jay Pujara. (2019) The Web Conference.
bib doi pdf

Scalable Probabilistic Causal Structure Discovery. Dhanya Sridhar, Jay Pujara, and Lise Getoor. (2018) International Joint Conference on Artificial Intelligence.
bib code doi pdf

Probabilistic Visitor Stitching on Cross-Device Web Logs. Sungchul Kim, Nikhil Kini, Jay Pujara, Eunyee Koh, and Lise Getoor. (2017) World Wide Web Conference.
bib doi pdf

Collective Entity Resolution in Familial Networks. Pigi Kouki, Jay Pujara, Christopher Marcum, Laura Koehly, and Lise Getoor. (2017) IEEE International Conference on Data Mining.
bib code doi pdf

User Preferences for Hybrid Explanations. Pigi Kouki, James Schaffer, Jay Pujara, John O'Donovan, and Lise Getoor. (2017) ACM Conference on Recommender Systems.
bib doi pdf

Sparsity and Noise: Where Knowledge Graph Embeddings Fall Short. Jay Pujara, Eriq Augustine, and Lise Getoor. (2017) Conference on Empirical Methods in Natural Language Processing (EMNLP).
bib code doi pdf

Disambiguating Energy Disaggregation: A Collective Probabilistic Approach. Sabina Tomkins, Jay Pujara, and Lise Getoor. (2017) International Joint Conference on Artificial Intelligence.
bib code doi pdf

Unsupervised Models for Predicting Strategic Relations between Organizations. Shachi Kumar, Jay Pujara, Lise Getoor, David Mares, Dipak Gupta, and Ellen Riloff. (2016) International Conference on Advances in Social Networks Analysis and Mining.
bib doi pdf

RELLY: Inferring Hypernym Relationships Between Relational Phrases. Adam Grycner, Gerhard Weikum, Jay Pujara, James Foulds, and Lise Getoor. (2015) Conference on Empirical Methods in Natural Language Processing.
bib pdf

Budgeted Online Collective Inference. Jay Pujara, Ben London, and Lise Getoor. (2015) Uncertainty and Artificial Intelligence (UAI).
bib code pdf

Knowledge Graph Identification. Jay Pujara, Hui Miao, Lise Getoor, and William W. Cohen. (2013) International Semantic Web Conference (ISWC).
bib code doi pdf slides video

Using Classifier Cascades for Scalable E-Mail Classification. Jay Pujara, Hal Daume III, and Lise Getoor. (2011) Collaboration, Electronic Messaging, Anti-Abuse and Spam Conference.
bib doi pdf slides

Refereed Workshops and Symposia

Graph-Based Structure Aware Citation Intent Classification. Xinwei Du, Kian Ahrabian, Arun Baalaaji Sankar Ananthan, Richard Delwin Myloth, and Jay Pujara. (2023) Workshop on Scientific Document Understanding at AAAI.
bib pdf

Identifying Quantifiably Verifiable Statements from Text. Pegah Jandaghi and Jay Pujara. (2023) ACL Workshop on Matching From Unstructured and Structured Data .
bib doi pdf

Is Dynamicity All You Need?. Richard Delwin Myloth, Kian Ahrabian, Arun Baalaaji Sankar Ananthan, Xinwei Du, and Jay Pujara. (2023) Workshop on Scientific Document Understanding at AAAI.
bib pdf

Low-Resource Financial QA with Case-based Reasoning. Kexuan Sun and Jay Pujara. (2023) KDD Workshop on Robust NLP for Finance.
bib pdf

Changes in Research Collaborations During the Pandemic. Ziao Wang, Kian Ahrabian, Casandra Rusti, Jay Pujara, and Kristina Lerman. (2023) International Society of Scientometrics and Informetrics Conference.
bib pdf

Visual Sudoku Puzzle Classification: A Suite of Collective Neuro-Symbolic Tasks. Eriq Augustine, Connor Pryor, Charles Dickens, Jay Pujara, William Yang Wang, and Lise Getoor. (2022) Workshop on Neural-Symbolic Learning and Reasoning.
bib pdf

Understanding Narratives through Dimensions of Analogy. Thiloshon Nagarajah, Filip Ilievski, and Jay Pujara. (2022) IJCAI Workshop on Qualitative Reasoning.
bib pdf

Estimating Numbers Without Regression. Avijit Thawani, Jay Pujara, and Ashwin Kalyan. (2022) NeurIPS workshop on MathAI.
bib pdf

Story Generation with Commonsense Knowledge Graphs and Axioms. Filip Ilievski, Jay Pujara, and Hanzhi Zhang. (2021) AKBC Workshop on Commonsense Reasoning and Knowledge Bases.
bib pdf

Finding Pragmatic Differences Between Disciplines. Lee Kezar and Jay Pujara. (2021) NAACL Workshop on Scholarly Document Processing.
bib pdf

AutoTriggER: Named Entity Recognition with Auxiliary Trigger Extraction. Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar, Bill Yuchen Lin, Mahak Agarwal, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan, and Xiang Ren. (2021) NAACL Workshop on Trustworthy Natural Langugage Processing.
bib pdf

Human-like Time Series Summaries via Trend Utility Estimation. Pegah Jandaghi and Jay Pujara. (2020) Ninth International Workshop on Statistical Relational AI.
bib pdf

Collective Alignment of Large-scale Ontologies. Varun Embar, Jay Pujara, and Lise Getoor. (2019) AKBC Workshop on Federated KBs and the Open Knowledge Network.
bib pdf

An Intelligent Interface for Integrating Climate, Hydrology, Agriculture, and Socioeconomic Models. Daniel Garijo, Deborah Khider, Varun Ratnakar, Yolanda Gil, Ewa Deelman, Rafael Ferreira Silva, Craig Knoblock, Yao-Yi Chiang, Minh Pham, Jay Pujara, Binh Vu, Dan Feldman, Rajiv Mayani, Kelly Cobourn, Chris Duffy, Armen Kemanian, Lele Shu, Vipin Kumar, Ankush Khandelwal, Kshitij Tayal, Scott Peckham, Maria Stoica, Anna Dabrowski, Daniel Hardesty-Lewis, and Suzanne Pierce. (2019) Proceedings of the 24th International Conference on Intelligent User Interfaces: Companion.

Enterprise OKN: A Federated Knowledge Graph for Financial Data. Jay Pujara, Louiqa Raschid, Gerard Hoberg, Gordon Phillips, and Craig Knoblock. (2019) AKBC Workshop on Federated KBs and the Open Knowledge Network.
bib pdf

Parsing, Representing, and Transforming Units of Measure. Basel Shbita, Arunkumar Rajendran, Jay Pujara, and Craig Knoblock. (2019) Modeling the World's Systems.
bib pdf

Entity linking to knowledge graphs to infer column types and properties.. Avijit Thawani, Minda Hu, Erdong Hu, Husain Zafar, Naren Teja Divvala, Amandeep Singh, Ehsan Qasemi, and Jay Pujara. (2019) The Semantic Web Challenge on Tabular Data to Knowledge Graph Matching at ISWC.

Extensible and Scalable Entity Resolution for Financial Datasets Using RLTK. Yixiang Yao, Pedro Szekely, and Jay Pujara. (2019) SIGMOD Workshop on Data Science for Macro-modeling with Financial and Economic Datasets.
bib pdf

Aligning Product Categories using Anchor Products. Varun Embar, Golnoosh Farnadi, Jay Pujara, and Lise Getoor. (2018) WSDM Workshop on Knowledge Base Construction, Reasoning and Mining.
bib pdf

Feature Selection Methods For Understanding Business Competitor Relationships. Rahul Gupta, Jay Pujara, Craig A. Knoblock, Shushyam M. Sharanappa, Bharat Pulavarti, Gerard Hoberg, and Gordon Phillips. (2018) Fourth International Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets.
bib doi pdf

Hybrid Link Prediction for Competitor Relationships. Jay Pujara. (2018) Fourth International Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets.
bib doi pdf

Using Noisy Extractions to Discover Causal Knowledge. Dhanya Sridhar, Jay Pujara, and Lise Getoor. (2018) Sixth Workshop on Automated Knowledge Base Construction.
bib pdf

Extracting Knowledge Graphs from Financial Filings. Jay Pujara. (2017) Third International Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets.
bib pdf

Adaptive Neighborhood Graph Construction for Inference in Multi-Relational Networks. Shobeir Fakhraei, Dhanya Sridhar, Jay Pujara, and Lise Getoor. (2016) 12th International Workshop on Mining and Learning with Graphs (MLG).
bib pdf

Generic Statistical Relational Entity Resolution in Knowledge Graphs. Jay Pujara and Lise Getoor. (2016) Sixth International Workshop on Statistical Relational AI.
bib pdf

Online Inference for Knowledge Graph Construction.. Jay Pujara, Ben London, Lise Getoor, and William W. Cohen. (2015) Fifth International Workshop on Statistical Relational AI.
bib pdf

A Unified Probabilistic Approach for Semantic Clustering of Relational Phrases. Adam Grycner, Gerhard Weikum, Jay Pujara, James Foulds, and Lise Getoor. (2014) Fourth Workshop on Automated Knowledge Base Construction.
bib pdf

Building Dynamic Knowledge Graphs. Jay Pujara and Lise Getoor. (2014) Fourth Workshop on Automated Knowledge Base Construction.
bib pdf

Probabilistic Models for Collective Entity Resolution Between Knowledge Graphs. Jay Pujara, Kevin Murphy, Xin Luna Dong, and Curtis Janssen. (2014) Bay Area Machine Learning Symposium.
bib pdf

Ontology-Aware Partitioning for Knowledge Graph Identification. Jay Pujara, Hui Miao, Lise Getoor, and William W. Cohen. (2013) Third Workshop on Automatic Knowledge Base Construction.
bib pdf slides

Extended Abstract: Large-Scale Knowledge Graph Identification using PSL. Jay Pujara, Hui Miao, Lise Getoor, and William W. Cohen. (2013) AAAI Fall Symposium on Semantics for Big Data.
bib pdf

Large-Scale Knowledge Graph Identification using PSL. Jay Pujara, Hui Miao, Lise Getoor, and William W. Cohen. (2013) Workshop on Structured Learning.
bib pdf

Joint Judgments with a Budget: Strategies for Reducing the Cost of Inference. Jay Pujara, Hui Miao, and Lise Getoor. (2013) Workshop on Machine Learning with Test-Time Budgets.
bib pdf

Social Group Modeling with Probabilistic Soft Logic. Bert Huang, Stephen H. Bach, Eric Norris, Jay Pujara, and Lise Getoor. (2012) Workshop on Social Network and Social Media Analysis: Methods, Models, and Applications.
bib pdf

Large-Scale Hierarchical Topic Models. Jay Pujara and Peter Skomoroch. (2012) Workshop on Big Learning.
bib pdf

Facilitating Medication Reconciliation with Animation and Spatial Layout. Leo Claudino, Sameh Khamis, Ran Liu, Ben London, Jay Pujara, Catherine Plaisant, and Ben Shneiderman. (2011) Workshop on Interactive Healthcare Systems.
bib pdf

Reducing Label Cost by Combining Feature Labels and Crowdsourcing. Jay Pujara, Ben London, and Lise Getoor. (2011) Workshop on Combining Learning Strategies to Reduce Label Cost.
bib pdf slides

Coarse-to-Fine, Cost-Sensitive Classification of E-Mail. Jay Pujara and Lise Getoor. (2010) Workshop on Coarse-to-Fine Processing.
bib pdf slides

Patents

User Trustworthiness. Jay Pujara, Vishwanath Ramarao, Xiaopeng Xi, Martin Zinkevich, Anirban Dasgupta, Belle Tseng, Wei Chu, and Gareth Shue. (2016) Patent 9519682.

Real-Time Ad-Hoc Spam Filtering of E-Mail. Jay Pujara. (2011) Patent 8069128.

Employing Pixel Density to Detect a Spam Image. Ke Wei, Hao Zheng, and Jay Pujara. (2011) Patent 7882177.

Identifying IP Addresses for Spammers. Jaesik Choi, Jay Pujara, Vishwanath Ramarao, and Ke Wei. (2010) Patent 7849146.