For a downloadable version of most of these papers, see the IR Lab Publications Page
Publications
- Sajad Sotudeh, Nazli Goharian, "TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation", Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: (NAACL 2022)
- Hrishikesh Kulkarni, Sean MacAvaney, Nazli Goharian, Ophir Frieder, "TBD3: A Thresholding-Based Dynamic Depression Detection from Social Media for Low-Resource Users", Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)
- Sajad Sotudeh, Nazli Goharian, Zachary Young, "MentSum: A Resource for Exploring Summarization of Mental Health Online Posts", Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)
- Sean MacAvaney, Sergy Feldman, Nazli Goharian, Doug Downey, Arman Cohan, "ABNIRML: Analyzing the Behavior of Neural IR Models", Transactions of the Association for Computational Linguistics (TACL); 2021
- Sajad Sotudeh, Hanieh Deilamsalehy, Franck Dernoncourt, and Nazli Goharian, "TLDR9+: A Large Scale Resource for Extreme Summarization of Social Media Posts", Proceedings of the Workshop on New Frontiers in Summarization (2021)
- Sean MacAvaney, Andrew Yates, Sergey Feldman, Doug Downey, Arman Cohan, and Nazli Goharian, "Simplified Data Wrangling with ir_datasets", ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2021)
- Tong Xiang, Sean MacAvaney, Eugene Yang, Nazli Goharian, "ToxCCIn: Toxic Content Classification with Interpretability", 11th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis (WASSA 2021)
- Sajad Sotudeh, Arman Cohan, and Nazli Goharian, "On Generating Extended Summaries of Long Documents", The AAAI-21 Workshop on Scientific Document Understanding (SDU 2021)
- Sean MacAvaney, Franck Dernoncourt, Walter Chang, Nazli Goharian, and Ophir Frieder, "Interaction Matching for Long-Tail Multi-Label Classification", The AAAI-21 Workshop on Scientific Document Understanding (SDU 2021)
- Sajad Soutdeh, Arman Cohan, and Nazli Goharian, "GUIR @ LongSumm 2020: Learning to Generate Long Summaries from Scientific Documents", Workshop on Scholarly Document Processing (SDP 2020)
- Sean MacAvaney, Arman Cohan, and Nazli Goharian, "SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search",Empirical Methods in Natural Language Processing (EMNLP 2020)
- Sajad Sotudeh, Tong Xiang, Hao-Ren Yao, Sean MacAvaney, Eugene Yang, Nazli Goharian, Ophir Frieder
, "GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection", International Workshop on Semantic Evaluation (SemEval 2020)
- Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder, "Efficient Document Re-Ranking for Transformers by Precomputing Term Representations", SIGIR (July 2020)
- Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder, "Training Curricula for Open Domain Answer Re-Ranking", SIGIR (July 2020)
- Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder, "Expansion via Prediction of Importance with Contextualization", SIGIR (July 2020) (short)
- Sajad Sotudeh, Nazli Goharian, Ross W. Filice, "Attend to Medical Ontologies: Content Selection for Clinical Abstractive Summarization", ACL (July 2020). (short)
- Sean MacAvaney, Luca Soldaini, and Nazli Goharian, "Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-shot Learning",European Conference on Information Retrieval (ECIR) 2020.
- Sean MacAvaney, Arman Cohan, Nazli Goharian, and Ross Filice, "Ranking Significant Discrepancies in Clinical Reports", European Conference on Information Retrieval (ECIR) 2020.
- Sean MacAvaney, Hao-Ren Yao, Eugene Yang, Katina Russell, Nazli Goharian, and Ophir Frieder, "Hate speech detection: Challenges and solutions", PLoS ONE, 2019.
- Sean MacAvaney, Andrew Yates, Arman Cohan, and Nazli Goharian, "CEDR: Contextualized Word Representations for Document Re-Ranking", SIGIR 2019. (short)
- Sean MacAvaney*, Sajad Sotudeh Gharebagh*, Arman Cohan, Nazli Goharian, Ross Filice, and Ish Talati, "Ontology-Aware Clinical Abstractive Summarization", SIGIR 2019. *Equal contribution (short)
- Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kui Hui, Nazli Goharian, Ophir Frieder, "Overcoming Low-Utility Facets for Complex Answer Retrieval", Information Retrieval Journal, 2018
- Arman Cohan, Nazli. Goharian, "Scientific Document Summarization via Citation Contextualization and Scientific Discourse", International Journal on Digital Libraries, Special Issue on Bibliometric-Enhanced Information Retrieval and Natural Language Processing, Springer. (online publication: May 2017; Print: Volume 19, Issue 2-3, September 2018).
- Ziling Fan, Arman Cohan, Luca Soldaini, Nazli Goharian, "Relation Extraction for Protein-protein Interactions Affected by Mutations", The 9th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB), Aug 2018.
- Arman Cohan*, Bart Desmet*, Andrew Yates*, Luca Soldaini, Sean MacAvaney and Nazli Goharian, "SMHD: a Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions", COLING 2018; Area Chair Favorite
[*Equal contribution]
- Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, and Ophir Frieder, "Characterizing Question Facets for Complex Answer Retrieval", SIGIR 2018.
- Sean MacAvaney, Luca Soldaini, Arman Cohan, and Nazli Goharian, "Tree-LSTMs for Scientific Relation Classification",SemEval 2018
- Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, and Nazli Goharian, "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents", NAACL-HLT 2018.
- Luca Soldaini, Timothy Walsh, Arman Cohan, Julien Han, and Nazli Goharian, "Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions", CLPsych 2018.
- Sean MacAvaney, Bart Desmet, Arman Cohan, Luca Soldaini, Andrew Yates, Ayah Zirikly, and Nazli Goharian, "RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses", CLPsych 2018.
- Andrew Yates*, Arman Cohan*, Nazli Goharian, "Depression and Self-Harm Risk Assessment in Online
Forums", Conference on Empirical Methods in Natural Language Processing (EMNLP), Sept 2017. Best Long Paper Award [* Equal Contribution]
- Andrew Yates*, Arman Cohan*, Nazli Goharian, "Depression and Self-Harm Risk Assessment in Online
Forums", Conference on Empirical Methods in Natural Language Processing (EMNLP), Sept 2017. EMNLP'17 Best Long Paper Award. [* Equal Contribution]
- Arman Cohan, and Nazli Goharian, "Contextualizing Citations for Scientific Summarization using Word Embeddings and Domain Knowledge", ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), Aug 2017. (short).
- Luca Soldaini, Nazli Goharian, Andrew Yates, "Learning to Reformulate Long Queries for Clinical Decision Support", Journal of the Association for Information Science and Technology (JASIST) Special Issue on Biomedical Information Retrieval, DOI: 10.1002/asi.23924, 2017.
- Arman Cohan, Allen Fong, Raj Ratwani, Nazli Goharian, "Identifying Harm Events in Clinical Care through Medical Narratives", The 8th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB), Aug 2017.
- Sean MacAvaney, Arman Cohan, and Nazli Goharian "A Framework for Cross-Domain Clinical Temporal Information Extraction", International Workshop on Semantic Evaluation (SemEval 2017), Aug 2017.
- Arman Cohan, Nazli. Goharian, "Scientific Document Summarization via Citation Contextualization and Scientific Discourse", International Journal of Digital Libraries, Special Issue on Bibliometric-Enhanced Information Retrieval and Natural Language Processing, Springer. May 2017.
- A. Cohan, S. Young, A. Yates, N. Goharian, "Triaging Content Severity in Online Mental-Health Forums", Journal of the Association for Information Science and Technology (JASIST), Special Issue, DOI:10.1002/asi.23865, 2017.
- A. Cohan, A. Fong, N. Goharian, and R. Ratwani, "A Neural Attention Model for Categorizing Patient Safety Events", In Proceedings of the 39th European Conference on Information Retrieval (ECIR '17), April 2017.
- L. Soldaini and N. Goharian, "Learning to Rank for Consumer Health Search: a Semantic Approach", In Proceedings of the 39th European Conference on Information Retrieval (ECIR '17), April 2017.
- L. Soldaini and N. Goharian, "QuickUMLS: a fast, unsupervised approach for medical concept extraction", In Proceedings of the Medical Information Retrieval (MedIR) workshop at SIGIR 2016, July 2016.
- A. Cohan, K. Meurer, N. Goharian, "Temporal Information Processing for Clinical Narratives", NAACL HLT 10th international workshop on semantic evaluation Semantic Analysis: Clinical TempEval (SemEval'16), June 2016.
- A. Cohan, S. Young, N. Goharian, "Triaging Mental Health Forum Posts", NAACL HLT 3rd Computational Linguistics and Clinical Psychology - From Linguistic Signal to Clinical Reality Workshop (CLPsych'16), June 2016.
- A. Cohan and N. Goharian, "Revisiting Summarization Evaluation for Scientific Articles", In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC'16), May 2016.
- A.Yates, A. Kolcz, N. Goharian and O. Frieder, "Effects of Sampling on Twitter Trend Detection", In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC'16), May 2016.
- A. Yates, J. Joselow, N. Goharian, "The news cycle's influence on social media activity", International AAAI Conference on Web and Social Media (ICWSM), May 2016.
- A. Yates, N. Goharian, O. Frieder, "Learning the relationships between drug, symptom, and medical condition mentions in social media", International AAAI Conference on Web and Social Media (ICWSM). May 2016.
- A. Cohan, L. Soldaini, N. Goharian, A. Fong, R. Filice, R. Ratwani, "Identifying Significance of Discrepancies in Radiology Reports", SIAM International Conference on Data Mining (SDM) Workshop on data Mining for Medicine and Healthcare, May 2016.
- A. Cohan, and N. Goharian , "Scientific Article Summarization Using Citation-Context and Article's Discourse Structure", Empirical Methods in Natural Language Processing (EMNLP' 15), 2015.
- L. Soldaini, A. Yates, E. Yom-Tov, O. Frieder, and N. Goharian, "Enhancing web search in the medical domain via query clarification," Information Retrieval Journal, July 2015, doi:10.1007/s10791-015-9258-y.
- A. Cohan, L. Soldaini, N. Goharian, "Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach" North American Chapter of the Association for Computational Linguistics . Human Language Technologies (NAACL HLT 2015).
- L. Soldaini, A. Cohan, A. Yates, N. Goharian, O. Frieder, "Retrieving Medical Literature for Clinical Decision Support", 37th European Conference on Information Retrieval (ECIR 2015), 2015.
- A. Yates, N. Goharian, O. Frieder, "Extracting Adverse Drug Reactions from Social Media", Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15), 2015.
- J. Parker, A. Yates, N. Goharian, and O. Frieder, "Health Related Hypothesis Generation using Social Media Data", Journal of Social Network Analysis and Mining, Springer, 2015.
- A. Cohan, L. Soldaini, A. Yates, N. Goharian, O. Frieder, "On Clinical Decision Support System", 5th ACM conference on Bioinformatics, Computational Biology, and Health Informatics, 2014.
- A. Yates, J. Parker, N. Goharian, and O. Frieder, "A Framework for Public Health Surveillancei", In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC.14), May 2014.
- A. Yates, N. Goharian, O. Frieder, "Relevance-Ranked Domain-Specific Synonym Discoveryi", in 36th European Conference on Information Retrieval (ECIR), April 2014.
- A. Yates, N. Goharian, W. Yee, "Semi-supervised Sentiment Analysis: Merging Labeled Sentences with Unlabeled Reviews to Identify Sentiment", American Society for Information Science and Technology (ASIST), Nov. 2013.
- E. W. Burger, H. Federoff, O. Frieder, N. Goharian, A. Yates, Social Media Communications Networks and Pharmacovigilance: SequelAE-2.0'), IEEE 15th International Conference on e-Health Networking, Applications and Services (Healthcom), Oct 2013, (short).
- J. Parker, Y. Wei, A. Yates, O. Frieder, N. Goharian, A Framework for Detecting Public Health Trends with Twitter., The 2013 IEEE/ACM International Conference on Advances in Social Network Analysis and Mining, Aug. 2013.
- A. Yates, N. Goharian, O. Frieder, "Extracting Adverse Drug Reactions from Forum Posts and Linking them to Drugs", SIGIR Workshop on Health Search and Discovery, July-Aug 2013.
- Y. Zhu and N. Goharian, To Follow or Not to Follow: A Feature Evaluation, 22nd International Conference on World Wide Web (WWW), May 2013 (short).
- A. Yates, N. Goharian, and O. Frieder, Graded Relevance Ranking for Synonym Discovery, 22nd International Conference on World Wide Web (WWW), May 2013 (short).
- A. Yates and N. Goharian, Detecting Expected and Unexpected Adverse Drug Reactions from User Reviews on Social Media Sites, in 35th European Conference on Information Retrieval (ECIR), 2013. (short).
- Z. Tan, N. Goharian, M. Sherr, $100,000 Prize Jackpot. Call Now! Identifying the Pertinent Features of SMS Spam, In proceedings of ACM 35th Conference on Research and Development in Information Retrieval (SIGIR), August 2012.
- J. Parker, A. Yates, N. Goharian, Efficient Estimation of Aspect Weights, In proceedings of ACM 35th Conference on Research and Development in Information Retrieval (SIGIR), August 2012.
- N. Goharian, S. Mengle, "Networked Hierarchies for Web Directories", 20th International World Wide Web conference (WWW), March 2011.
- N. Goharian, S. Mengle, Context Aware Query Classification Using Dynamic Query Window and Relationship Net, In proceedings of ACM 33rd Conference on Research and Development in Information Retrieval (SIGIR), July 2010.
- S. Mengle and N. Goharian, Detecting Relationships among Categories using Text Classification, Journal of American Society for Information Science and Technology (JASIST), 61(5), May 2010.
- N. Goharian, O. Frieder, W. G. Yee, J. Mudrawala, Enriching Peer-to-Peer File Descriptors Using Association Rules on Query Logs, 32nd European Conference On Information Retrieval (ECIR), March 2010.
- S. Mengle, N. Goharian, Mining Temporal Relationships Among Categories, ACM 25th Symposium on Applied Computing (SAC), March 2010.
- S. Mengle and N. Goharian, Ambiguity Measure Feature Selection Algorithm., Journal of American Society for Information Science and Technology (JASIST), 60(5), April 2009.
- S. Mengle and N. Goharian, Passage Detection Using Text Classification, Journal of American Society for Information Science and Technology (JASIST), 60 (4), March 2009.
- J. Urbain, O. Frieder, N. Goharian, Passage relevance models for genomics search, BMC Bioinformatics, 10(Suppl 3):S3 (19 March 2009).
- A. Platt, S. Mengle, N. Goharian, Improving Classification Based Off-topic Search Detection via Category Relationships., ACM 24th Symposium on Applied Computing (SAC), March 2009.
- S. Liu, Y. Mehrav, W. G. Yee, N. Goharian, and O. Frieder, "A Sentence Level Probabilistic Model for Evolutionary Theme Pattern Mining for News Corpora", ACM 24th Symposium on Applied Computing (SAC), March 2009.
- J. Urbain, O. Frieder, and N. Goharian, "A Dimensional Retrieval Model for Integrating Semantics and Statistical Evidence in Context for Genomics Literature Search," Journal- Computers in Biology and Medicine, 39(1), January 2009.
- J. Urbain, O. Frieder, and N. Goharian, "Passage Relevance Models for Genomics Search., ACM Second International Workshop on Data and Text Mining in Bioinformatics, Napa Valley, California, October 2008.
- J. Urbain, N. Goharian, and O. Frieder, "Probabilistic Passage Models for Semantic Search of Genomics Literature," Journal of the American Society of Information Science and Technology (JASIST), 59(12), September 2008.
- N. Goharian, S. Mengle, On Document Splitting in Passage Detection, In Proceedings of 31st Conference on Research and Development in Information Retrieval (SIGIR), July 2008.
- S. Mengle, N. Goharian, .Using Ambiguity Measure Feature Selection Algorithm for Support Vector Machine Classifier., ACM 23rd Symposium on Applied Computing (SAC), March 2008
- S. Mengle, N. Goharian, Detecting Hidden Passages from Documents. In proceedings of SIAM Conference on Data Mining (SDM 2008) Workshop, April 2008.
- S. Mengle, N. Goharian, A. Platt, Discovering Relationships among Categories using Misclassification Information., ACM 23rd Symposium on Applied Computing (SAC), March 2008.
- J. Urbain, N. Goharian O. Frieder, Combining Semantics, Context, and Statistical Evidence in Genomics Literature Search., The 7th IEEE International Conference on Bioinformatics and Bioengineering (BIBE), 2007.
- S. Mengle, N. Goharian, A. Platt , FACT: Fast Algorithm for Categorizing Text, IEEE 5th International Conference on Intelligence and Security Informatics (ISI), May 2007.
- N. Goharian and A. Platt, DOTS: Detection of Off-Topic Search Via Result Clustering, IEEE 5th International Conference on Intelligence and Security Informatics (ISI), May 2007.
- A. Platt, N. Goharian, S. Mengle, Using User Query Sequence to Detect Off-Topic Search, ACM 22nd Symposium on Applied Computing (SAC), March 2007.
- N. Goharian, A. Platt, Detection Using Clustering Query Results. IEEE International Conference on Intelligence and Security Informatics (ISI), May 2006.
- N. Goharian, L. Ma, Off-Topic Access Detection In Information Systems, ACM 14th Conference on Information and Knowledge Management (CIKM), November 2005.
- N. Goharian, L. Ma, C. Meyer, Detecting Misuse of Information Retrieval Systems Using Data Mining Techniques, IEEE International Conference on Intelligence and Security Informatics (ISI), May 2005.
- S. Argamon, N. Goharian, D. Grossman, O. Frieder, N. Raju, A Specialization in Information and Knowledge Management Systems for the Undergraduate Computer Science Curriculum, IEEE International Conference on Information Technology: Coding & Computing (ITCC 2005), April 2005.
- N. Goharian, L. Ma, "Query Length Impact on Misuse Detection in Information Retrieval Systems", ACM 20th Symposium on Applied Computing (SAC), March 2005
- N. Goharian, L. Ma, "Using Relevance Feedback to Detect Misuse in Information Retrieval Systems" ACM 13th Conference on Information and Knowledge Management (CIKM), November 2004.
- S. Beitzel, E. Jensen, A. Chowdhury, D. Grossman, O. Frieder, N. Goharian," On Fusion of Effective Retrieval Strategies in the Same Information Retrieval System", Journal of American Society for Information Science and Technology (JASIST), 2004.
- N. Goharian, D. Grossman, O. Frieder, N. Raju, "Migrating Information Retrieval from the Graduate to the Undergraduate Curriculum", Journal of Information Systems Education, vol 15(1), April 2004.
- N. Goharian, D. Grossman, N. Raju "Extending the Undergraduate Computer Science Curriculum To Include Data Mining", IEEE International Conference on Information Techniques on: Coding & Computing (ITCC 2004), Las Vegas, Nevada, April 2004.
- ,P. Jain, N. Goharian, A. Weiser, S. Kimm, S. Kim, J. Stern, J. Pazona, C. Wambi, R. Yap, L. Blunt, and R. Nadler, "Efficiency and Safety of the Healthtronics LithoTron® Lithotripter ,Journal of EndoUrology, 18 (1), January/February 2004.
- R. Cathey, L. Ma, N. Goharian, D. Grossman, "Misuse Detection for Information Retrieval Systems", ACM 12th Conference on Information and Knowledge Management (CIKM), November 2003
- L. Ma, N. Goharian, A. Chowdhury, M. Chung, "Extracting Unstructured Data From Template Generated Web Documents", ACM 12th Conference on Information and Knowledge Management (CIKM), November 2003
- E. Jensen, S. Beitzel, A. Pilotto, N. Goharian, O. Frieder "Parallelizing the Buckshot Algorithm for Efficient Document Clustering", ACM Conference on Information and Knowledge Management (CIKM), November 2002.
TREC (Text Retrieval conference) & TAC (Text Analysis Conference) Participation:
- A. Cohan, L. Soldaini, S. Mengle, N. Goharian, .Towards Citation-Based Summarization of Biomedical Literature., Proc. of Text Analysis Conference (TAC Summarization Track), 2014.
- L. Soldaini, A. Cohan, A. Yates, N. Goharian, O. Frieder, "Query Reformulation for Clinical Decision Support Search", Proc of the 23rd Text REtrieval Conference Proceedings (TREC - Medical Track), 2014.
- A. Yates , D. DeBoer , H. Yang , N. Goharian , S. Kunath , O. Frieder, (Not Too) Personalized Learning to Rank for Contextual Suggestion, TREC 2012, Contexual Track.
-
D. Guan, H. Yang, N. Goharian, Effective Structured Query Formulation for Session Search, TREC 2012. Query Session Track.
- J. Urbain, N. Goharian, O. Frieder, IIT TREC-2007 Genomics Track: Using Concept-based Semantics in Context for Genomics Literature Passage Retrieval, TREC 2007.
- J. Urbain, N. Goharian, O. Frieder, .IIT TREC-2006: Genomics Track., Proceedings of the Fifth Text REtrieval Conference, November 2006.
- J. Urbain, N. Goharian, O. Frieder, .IIT TREC-2005: Genomics Track., Proceedings of the Fourteenth Text REtrieval Conference, November 2005.