Doug Oard's Research Page



This page contains a mix of peer reviewed and unrefereed journal articles, book chapters, and conference and workshop papers, and an edited book, organized by subject and listed most-recent-first within a subject. Many of the papers are available here as PDF. Sometimes the PDF here is the initially submitted version rather than the version finally published (this should be clear from formatting); links to the publisher's Web site are provided for journal articles when possible. This page is sometimes updated less frequently than I would like, so if there's something specific that you are looking for that is not yet here let me know and I'll do my best to get it posted.

Papers Written for a Broad Audience

This is a mix of overviews of a topic that were prepared for various venues and position papers that describe a specific interest that are sometimes prepared as a basis for discussion at a workshop.
  1. James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi, Kate Sanders, Marc Mason and #Noah Hibbler, On the Evaluation of Machine-Generated Reports, Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington DC, Perspectives paper, 12 pages, 2024. PDF
  2. Petra Galuščáková and Douglas W. Oard and Suraj Nair, Cross-Language Information Retrieval, CoRR abs/2111.05988, 49 pages, 2022. (PDF preprint)
  3. Tetsuya Sakai, Douglas W. Oard, and Noriko Kando, Evaluating Information Retrieval and Access Tasks: NTCIR's Legacy of Research Impact, Springer, 2020. (Publisher Open Access)
  4. Douglas W. Oard, The Future of Information Retrieval Evaluation, in Evaluating Information Retrieval and Access Tasks: NTCIR's Legacy of Research Impact, Springer, 2020. (PDF preprint)
  5. Ben Carterette, Hussein Suleman and Douglas W. Oard, Report on the 1st ACM SIGIR/SIGKDD Africa School on Machine Learning for Data Mining and Search. SIGIR Forum 53(1): 3-13 (2019). (PDF)
  6. Mihai Lupu, Atsushi Fujii, Douglas W. Oard, Makoto Iwayama, and Noriko Kando, Patent-Related Tasks at NTCIR, in Mihai Lupu et al, Current Challenges in Patent Information Retrieval (Second Edition), pp.77-111, Springer-Verlag, 2017. (PDF preprint) (Publisher)
  7. Douglas W. Oard. The Moonwalkers Who Could Have Been, Quest: The History of Spaceflight Quarterly, 23(3), 51-53, 2016. (PDF)
  8. Douglas W. Oard, Amalia S. Levi, Ricardo L. Punzalan and Robert Warren, "Bridging Communities of Practice: Emerging Technologies for Content-Centered Linking," in 18th Annual Museums and the Web Conference, 10 pages, Baltimore, MD, 2014. (PDF)
  9. Douglas W. Oard and Joseph Malionek, "The Apollo Archive Explorer," in Joint Conference on Digital Libraries, 2 page demonstration description, Indianapolis, IN, 2013. (PDF)
  10. Douglas W. Oard and William Webber, "Information Retrieval for E-Discovery," Foundations and Trends in Information Retrieval, 7(2-3)100-237, 2013. (PDF) (Publisher)
  11. Douglas W. Oard, ``Can Automatic Speech Recognition Replace Manual Transcription?,'' Oral History in the Digital Age (Web resource), 2012. (HTML)
  12. Douglas W. Oard, "A Whirlwind Tour of Automated Language Processing for the Humanities and Social Sciences," in Working Together or Apart: Promoting Digital Scholarship, Council on Library and Information Resources, 2009. (PDF)
  13. Douglas W. Oard, "Multilingual Information Access," in Encyclopedia of Library and Information Sciences, 3rd Ed., edited by Marcia J. Bates, Editor, and Mary Niles Maack, Associate Editor, Taylor & Francis, 2009. (PDF)
  14. Douglas W. Oard, "Unlocking the Potential of the Spoken Word," Science, 321(5897)1787-1788, 2008. (Publisher)
  15. Franciska de Jong, Douglas W. Oard, Willemijn Heeren and Roeland Ordelman, "Access to Recorded Interviews: A Research Agenda," ACM Journal on Computing and Cultural Heritage, 1(1)1-27, 2008. (PDF), (Publisher)
  16. Franciska de Jong, Douglas W. Oard, Roeland Ordelman and Stephan Raaijmakers, "Searching Spontaneous Conversational Speech," workshop report in SIGIR Forum, 41(2)104-108, 2007. (PDF)
  17. Douglas W. Oard, "Transcending the Tower of Babel: Supporting Access to Multilingual Information with Cross-Language Information Retrieval," in Robert Popp and John Yen, ed., Emergent Information Technologies and Enabling Policies for Counter-Terrorism, Prentice Hall, Chapter 15, pp. 299-314, 2006. (PDF)
  18. Douglas W. Oard, "Towards Analysis Tools for a Multilingual Blogsphere," in AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs, Stanford, CA, 3 pages, 2006. (PDF)
  19. Jerry Goldman, Steve Renals, Steven Bird, Franciska de Jong, Marcello Federico, Carl Fleischhauer, Mark Kornbluh, Lori Lamel, Douglas W. Oard, Fabrizio Sebastiani, Claire Stewart and Richard Wright, "Transforming Access to the Spoken Word," International Journal on Digital Libraries, 5(4)287-298, 2005. (PDF), (Publisher)
  20. Douglas W. Oard, "The SIGIR Workshop Program," SIGIR Forum, 39(2)15-16, 2005. (PDF)
  21. Douglas W. Oard, "The Surprise Language Exercises," ACM Transactions on Asian Language Information Processing, 2(2)79-84, 2003. (PDF) (Publisher)
  22. Douglas W. Oard, "Coping with Surprise: Responsive Language Technology", Team TIDES, p. 2, October 2003. (PDF)
  23. Douglas W. Oard, "Surprise: It's Cebuano!", Team TIDES, pp. 2-3, April 2003. (PDF)
  24. Douglas W. Oard, "Interactive Cross-Language Information Retrieval," workshop report in SIGIR Forum, 35(1)1-3, 2001. (PDF)
  25. Judith Klavans, Eduard Hovy, Christian Fluhr, Robert Frederking, Douglas Oard, Akitoshi Okumura, Kai Ishikawa, and Kenji Satoh, "Multilingual (or Cross-Lingual) Information Retrieval" in Multilingual Information Management: Current Levels and Future Abilities, Eduard Hovy, Nancy Ide, Robert Frederking, Joseph Mariani, Antonio Zampolli (eds.), Chapter 2, pp. 35-56, 2001. (HTML)
  26. Douglas W. Oard and Anne R. Diekema, "Cross-Language Information Retrieval," in Martha Williams (ed.), in Annual Review of Information Science and Technology, Volume 33, Chapter 6, pp. 223-256, 1998. (ASCII)
  27. Douglas Oard, Carol Peters, Miguel Ruiz, Robert Frederking, Judith Klavans, and Paraic Sheridan, "Multilingual Information Discovery and Access (MIDAS): A Joint ACM DL '99 / ACM SIGIR '99 Workshop," D-Lib Magazine, October, 1999. (HTML)
  28. Douglas W. Oard, "Extending Cross-Language Information Retrieval to a Global Scale," NSF Workshop on Multilingual Information Management, pp. 24-25, Granada, Spain, 1998. (PDF)
  29. Douglas W. Oard, "The State of the Art in Text Filtering." User Modeling and User Adapted Interaction, 7(3)141-178, 1997. (PDF)
  30. Douglas W. Oard, "Serving Users In Many Languages: Cross-Language Information Retrieval for Digital Libraries" D-Lib Magazine, December, 1997. (HTML)
  31. Douglas W. Oard, "Alternative Approaches for Cross-Language Text Retrieval," in AAAI Symposium on Cross-Language Text and Speech Retrieval, pp. 131-139, Palo Alto CA, 1997. (PDF)
  32. Douglas W. Oard, "Speech-Based Information Retrieval for Digital Libraries," AAAI Symposium on Cross-Language Text and Speech Retrieval, Palo Alto, CA, 1997. (PDF)
  33. Douglas W. Oard, "Cross-Language Text Retrieval Research in the USA," Third DELOS Workshop: Cross-Language Information retrieval, pp. 7-16, Zurich, 1997. (PDF)
  34. Douglas W. Oard and Bonnie J. Dorr, "A Survey of Multilingual Text Retrieval," University of Maryland Computer Science Department, 31 pp., CS-TR-3615, 1996. (PDF)
  35. Christos Faloutsos and Douglas Oard, "A Survey of Information Retrieval and Filtering Methods," University of Maryland Computer Science Department, 23 pp., CS-TR-3514, 1995. (PDF)

TREC NeuCLIR Track (2022-2024)

These are track overview and track description papers that resulted from my work as a track coordinator in the Text Retroeval Conference (TREC) Neural Cross-Language Infromation Retrieval (NeuCLIR) Track. These papers describe evaluation design issues for information retrieval systems that are designed to support a search using math. My own research on information retrieval techniques using these evaluation designs can be found below in the Muntlingual Information Access section.
  1. Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini and Eugene Yang, Overview of the TREC 2023 NeuCLIR Track, TREC, 2023. (PDF)
  2. Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini and Eugene Yang, Overview of the TREC 2022 NeuCLIR Track, TREC, 2022. (PDF)

CLEF ARQMath Lab (2020-2022)

These are track overview and track description papers that resulted from my work as a track coordinator in the Copnferences and Labs of the Evaluation Forum (CLEF) Answer Retrieval for Questions on Math (ARQMATH) lab. These papers describe evaluation design issues for information retrieval systems that are designed to support a search using math. My own research on information retrieval techniques using these evaluation designs can be found below in the Math Search section.
  1. Behrooz Mansouri, Vit Novotny, Anurag Agrawal, Douglas W. Oard and Richard Zanibbi, Overview of ARQMath-3 (2022): Third CLEF Lab on Answer Retrieval for Questions on Math (Working Notes Version). Working Notes of CLEF, pp. 1-25, 2022. (PDF)
  2. Behrooz Mansouri, Vit Novotny, Anurag Agrawal, Dougl0as W. Oard and Richard Zanibbi, Overview of ARQMath-3 (2022): Third CLEF Lab on Answer Retrieval for Questions on Math, CLEF, Springer LNCS, 2022. (PDF)
  3. Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard and Richard Zanibbi, Advancing Math-Aware Search: The ARQMath-3 Lab at CLEF 2022, 8 pages, ECIR, 2022. (PDF)
  4. Behrooz Mansouri, Richard Zanibbi, Douglas W. Oard and Anurag Agarwal, Overview of ARQMath-2 (2021): Second CLEF Lab on Answer Retrieval for Questions on Math (Working Notes Version). Working Notes of CLEF, pp. 1-24, 2021. (PDF)
  5. Behrooz Mansouri, Richard Zanibbi, Douglas W. Oard, Anurag Agarwal, Overview of ARQMath-2 (2021): Second CLEF Lab on Answer Retrieval for Questions on Math, CLEF, Springer LNCS, pp. 215-238, 2021. (PDF)
  6. Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard and Richard Zanibbi, Advancing Math-Aware Search: The ARQMath-2 Lab at CLEF 2021, ECIR, 7 pages, 2021. (PDF)
  7. Richard Zanibbi, Behrooz Mansouri, Anurag Agarwal and Douglas W. Oard, ARQMath: A new benchmark for math-aware CQA and math formula retrieval. SIGIR Forum 54(2): 4:1-4:9, 2020. (PDF)
  8. Richard Zanibbi, Douglas W. Oard, Anurag Agarwal and Behrooz Mansouri, Overview of ARQMath 2020: CLEF Lab on Answer Retrieval for Questions on Math (Updated Working Notes Version with Eratta 2 incorporated), Working Notes of CLEF, 27 pages, 2020, corrected in 2021. (PDF)
  9. Richard Zanibbi, Douglas W. Oard, Anurag Agarwal, and Behrooz Mansouri, Overview of ARQMath 2020: CLEF Lab on Answer Retrieval for Questions on Math, CLEF, Springer LNCS, 2020. (PDF)
  10. Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard, Richard Zanibbi, Finding Old Answers to New Math Questions: The ARQMath Lab at CLEF 2020, ECIR, pp. 564-571, 2020. (PDF)

FIRE Track Overviews (2011-2013)

These are track overview papers that resulted from my work as a track coordinator in the Forum for Information Retrieval Evaluation (FIRE). These papers describe evaluation design issues for information retrieval systems that are designed to support a search for digital evidence in a litigation context. My own research on information retrieval techniques using these evaluation designs can be found below in the Document Image Retrieval and Speech sections.
  1. Douglas W. Oard, Jerome White, Jaiul Paik, Rashmi Sankepally and Aren Jansen, "The FIRE 2013 Question Answering for the Spoken Web Task," Fifth Forum for Information Retrieval Evaluation, 8 pages, New Delhi, India, 2013. (PDF)
  2. Utpal Garain, Jiaul Paik, Tamaltaru Pal, Prasenjit Majumder, David Doermann and Douglas W. Oard, "Overview of the FIRE 2011 RISOT Task," Third Forum for Information Retrieval Evaluation, pp.~159--163, Mumbai, India, 2011. (PDF)

TREC Legal Track Overviews (2006-2011)

These are track overview papers that resulted from my work as a track coordinator in the Text Retrieval Conference (TREC). These papers describe evaluation design issues for information retrieval systems that are designed to support a search for digital evidence in a litigation context. My own research on information retrieval techniques using these evaluation designs can be found below in the Document Image Retrieval and Email sections.
  1. Maura R. Grossman, Gordon V. Cormack, Bruce Hedin and Douglas W. Oard, "Overview of the TREC 2011 Legal Track," in Proceedings of the Twentieth Text Retrieval Conference, 20 pages, Gaithersburg, MD, 2011. (PDF)
  2. Douglas W. Oard, Jason R. Baron, Bruce Hedin, David D. Lewis and Stephen Tomlinson, "Evaluation of Information Retrieval for E-Discovery," Artificial Intelligence and Law, 18(4)347-386, 2010. (PDF) (Publisher)
  3. Gordon V. Cormack, Maura R. Grossman, Bruce Hedin, and Douglas W. Oard, "Overview of the TREC-2010 Legal Track," in Working Notes of the Nineteenth Text Retrieval Conference, pp. 30-38, Gaithersburg, MD, 2010. (PDF)
  4. William Webber, Douglas W. Oard, Falk Scholer and Bruce Hedin, "Assessor Error in Stratified Evaluation," in The 18th ACM International Conference on Information and Knowledge Management, 10 pages, Toronto, Canada, 2010. (PDF)
  5. Bruce Hedin, Stephen Tomlinson, Jason R. Baron and Douglas W. Oard, "Overview of the TREC 2009 Legal Track,'' in Proceedings of the Eighteenth Text Retrieval Conference," 40 pages, Gaithersburg, MD, 2009. (PDF)
  6. Bruce Hedin and Douglas W. Oard, "Replication and Automation of Expert Judgments: Information Engineering in Legal E-Discovery," in IEEE Conference on Systems, Man and Cybernetics, 6 pages, San Antonio, TX, 2009. (PDF)
  7. Douglas W. Oard, Bruce Hedin, Stephen Tomlinson and Jason R. Baron, "Overview of the TREC 2008 Legal Track," in The Seventeenth Text Retrieval Conference, Gaithersburg, MD, 45 pages, 2008. (PDF)
  8. Stephen Tomlinson, Douglas W. Oard, Jason R. Baron and Paul Thompson, "Overview of the TREC 2007 Legal Track," in The Sixteenth Text Retrieval Conference, Gaithersburg, MD, 34 pages, 2007. (PDF)
  9. Jason R. Baron, David D. Lewis and Douglas W. Oard, "The TREC-2006 Legal Track" in The Fifteenth Text Retrieval Conference, Gaithersburg, MD, 20 pages, 2006. (PDF)

CLEF Cross-Language Speech Retrieval Track Overviews (2005-2007)

These are track overview papers that resulted from my work as a track coordinator in the Cross-Language Evaluation Forum (CLEF). These papers describe evaluation design issues for information retrieval from spontaneous speech, regardless of the query language. My own research on information retrieval techniques using these evaluation designs can be found below in the Speech Retrieval section.
  1. Pavel Pecina, Petra Hoffmannova, Gareth J.F. Jones, Ying Zhang and Douglas W. Oard, "Overview of the CLEF-2007 Cross-Language Speech Retrieval Track," in Advances in Multilingual and Multimodal Information Retrieval, Revised Selected Papers, CLEF 2007, Springer-Verlag, LNCS (5152), Budapest, pp. 674-686, 2007. (PDF)
  2. Douglas W. Oard, Jianqiang Wang, Gareth G.F. Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, Izhak Shafran, "Overview of the CLEF-2006 Cross-Language Speech Retrieval Track," in Evaluation of Multilingual and Multi-modal Information Retrieval, Revised Selected Papers, CLEF-2006, Springer-Verlag, LNCS (4730), Alicante, Spain, 12 pages, 2006. (PDF)
  3. Ryen W. White, Douglas W. Oard, Gareth J.F. Jones, Dagobert Soergel and Xiaoli Huang, "Overview of the CLEF-2005 Cross-Language Speech Retrieval Track," in Multilingual Information Repositories, Revised Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022), Vienna, Austria, pp. 744-759, 2005. (PDF)

CLEF Interactive Track Overviews (2002-2004)

These are track overview papers that resulted from my work as a track coordinator in the Cross-Language Evaluation Forum (CLEF). These papers describe evaluation design issues for user-in-the-loop systems that are designed to support Multilingual Information Access (MLIA). My own research on information retrieval techniques using these evaluation designs can be found below in the MLIA section.
  1. Julio Gonzalo and Douglas W. Oard, "iCLEF 2004 Track Overview: Pilot Experiments in Interactive Cross-Language Question Answering," in Multilingual Information Access for Text, Speech and Images, Fifth Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Revised Selected Papers Series, Springer-Verlag, LNCS (3491), Bath, UK, pp. 310-322, 2004. (PDF)
  2. Julio Gonzalo and Douglas W. Oard, "The CLEF-2003 Interactive Track," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, 2003. (PDF)
  3. Douglas Oard and Julio Gonzalo, "The CLEF-2002 Interactive Track," in Advances in Cross-Language Information Retrieval Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002, Revised papers, Springer-Verlag LNCS (2785), pp. 245-254, Rome, Italy, 2002. (PDF)
  4. Douglas W. Oard and Julio Gonzalo, "The CLEF 2001 Interactive Track," in Evaluation of Cross-Language Information Retrieval Systems, Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001 Revised Papers, Springer-Verlag LNCS (2406), Darmstadt, Germany, pp. 308-319, 2001. (PDF)

TREC Arabic CLIR Track Overviews (2001-2002)

These are track overview papers and other papers that resulted from my work as a track coordinator in the Text Retrieval Conference (TREC). These papers describe evaluation design issues for information retrieval from Arabic, regardless of the query language. My own research on information retrieval techniques using these evaluation designs can be found below in the Multilingual Information Access section.
  1. Douglas W. Oard and Frederic C. Gey, "The TREC-2002 Arabic-English CLIR Track," in The Eleventh Text Retrieval Conference, Gaithersburg, MD, pp. 17-26, 2002. (PDF)
  2. Douglas W. Oard, Fredric C. Gey and Bonnie J. Dorr, "Evaluating Arabic Retrieval from English or French Queries," in LREC Workshop on Arabic Language Resources and Evaluation, Las Palmas, Spain, pp. 5-10, 2002. (PDF)
  3. Fredric C. Gey and Douglas W. Oard, "The TREC-2001 Cross-Language Information Retrieval Track: Searchi Arabic Queries," in The Tenth Text Retrieval Conference, pp. 114-121, Gaithersburg, MD, 2001. (PDF)
  4. Douglas W. Oard and Fredric C. Gey, "The TREC-2001 Arabic Information Retrieval Evaluation," in ACL Workshop on Arabic Language Processing, pp. 95-96, Toulouse, France, 2001. (PDF)

Other Evaluation Design

These papers report on evaluation design research conducted outside the scope of a shared-task evaluation that I helped to coordinate.
  1. Elizabeth Salesky, Matthew Weisner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post, Multilingual TEDx Corpus for Speech Recognition and Translation. Interspeech, 5 pp., 2021. (PDF)
  2. Jacob Bremerman, Huda Khayrallah, Douglas W Oard and Matt Post, On the Evaluation of Machine Translation n-best Lists, EMNLP Workshop on Evaluation and Comparison of NLP Systems, 9 pages, 2020. (PDF)
  3. Jacob Bremerman, Dawn J. Lawrie, James Mayfield and Douglas W. Oard, Two Test Collections for Retrieval Using Named Entity Markup. CIKM, pp. 3265-3268, 2020. (PDF)
  4. Douglas W. Oard, Tetsuya Sakai and Noriko Kando, Celebrating 20 Years of NTCIR: The Book, 1 page, EVIA, Tokyo, Japan, 2019. (PDF)
  5. Ning Gao, Mossaab Bagdouri and Douglas W. Oard, Pearson Rank: A Head-Weighted Gap-Sensitive Score-Based Correlation Coefficient," in 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Pisa, Italy, 2016. (PDF)
  6. Ning Gao and Douglas W. Oard, "A Head-Weighted Gap-Sensitive Correlation Coefficient," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, Santiago, Chile, 2015. (PDF)
  7. Ning Gao, William Webber and Douglas W. Oard, "Reducing Reliance on Relevance Judgments for System Comparison by Using Expectation-Maximization," in Proceedings of the of the 36th European Conference on Information Retrieval, 12 pages, Amsterdam, The Netherlands, 2014. (PDF)
  8. Dina Demner-Fushman, Daqing He and Douglas W. Oard, "Exploring Interactive Relevance Feedback With a Two-Pass Study Design," Technical Report CS-TR-4621, University of Maryland Computer Science Department, 2004. (PDF)
  9. Bonnie Dorr, Christof Monz, Douglas Oard, David Zajic and Richard Schwartz, "Extrinsic Evaluation of Automatic Metrics for Summarization," Technical Report CS-TR-4610, University of Maryland Computer Science Department, 2004. (PDF)

Multilingual Information Access

These papers address the problem of finding documents that are written in one language (e.g., Chinese) using requests that are written in a different language (e.g., English). This problem is often referred to as "Cross-Language Information Retrieval" (CLIR), but Multilingual Information Access (MLIA) is a more inclusive term that better describes the scope of the work described here. Papers that address MLIA for spoken or scanned content can be found in those sections, interspersed with my other papers that address those topics. TREC and CLEF track overview papers that address evaluation design for some specific MLIA problems that have been the focus of international evaluation venues can be found in the evaluation design sections above.
  1. Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield, Douglas W. Oard and Kevin Duh, Effectiveness=Efficiency Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval, arXiv preprint, arXiv:2404.18797, 11 pages, 2024. (PDF Preprint)
  2. Eugene Yang, Dawn Lawrie, James Mayfield, Douglas Oard and Scott Miller, Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation, European Conference on Information Retrieval, Glasgow, UK, 17 pages, 2024. (PDF)
  3. Suraj Nair and Douglas W. Oard, BLADE: The University of Maryland at the TREC 2023 NeuCLIR Track, TREC, 2023. (PDF)
  4. Suraj Nair, Eugene Yang, Dawn Lawrie, James Mayfield and Douglas Oard, BLADE: Combining Vocabulary Pruning and Intermediate Pretraining for Scaleable Neural CLIR, Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, Taipei, 11 pages, 2023. (PDF)
  5. Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang, #Suraj Nair and Petra Galuščáková, HC3: A Suite of Test Collections for CLIR Evaluation over Informal Text, Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, Taipei, 10 pages, 2023. (PDF)
  6. Dawn Lawrie, James Mayfield, Suraj Nair, Douglas W. Oard and Eugene Yang, Neural Methods for Cross-Language Information Retrieval, SIGIR 2023 Tutorial Abstract, Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, Taipei, 2 pages, 2023. (PDF)
  7. Dawn Lawrie, Eugene Yang, James Mayfield and Douglas W. Oard, Neural Approaches to Multilingual Information Retroeval, European Conference on Infromation Retrieval, 2023. (PDF)
  8. Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield and Douglas W. Oard, Parameter-Efficient Zero-Shot Transfer for Cross-Language Dense Retrieval with Adapters, 15 pages, ArXiv preprint arXiv:2212,10448. (PDF)
  9. Suraj Nair and Douglas W. Oard, Probabilistic Structured Queries: The University of Maryland at the TREC 2022 NeuCLIR Track, TREC, 2022. (PDF)
  10. Inkyung Choi, Wan-Chen Lee, Ying-Hsang Liu, Hsinlinag Chen, Douglas W. Oard and Chi Young Oh, Cross-Cultural Information Access (Panel Summary), 4 pages, ASIS&T, 2022. (PDF)
  11. Suraj Nair, Eugene Yang, Dawn Lawrie, James Mayfield and Douglas W. Oard, Learning a Sparse Representation Model for Neural CLIR, 12 pages, DESIRES, 2022. (PDF)
  12. Eugene Yang, Suraj Nair, Ramraj Chandradevan, Rebecca Iglesias-Flores and Douglas W. Oard, C3: Continued Pretraining with Contrastive Weak Supervision for Cross-Language Ad-Hoc Retrieval, 6 pages, SIGIR, 2022. (PDF)
  13. Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield and Douglas W. Oard, Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models, 15 pages, ECIR, 2022. (PDF)
  14. Dawn Lawrie, James Mayfield, Douglas W. Oard and Eugene Yang, HC4: A New Suite of Tst Collections for Ad Hoc CLIR, 16 pages, ECIR, 2022. (PDF)
  15. Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuščáková, Rui Zhang, Douglas W. Oard and Kathleen McKeown, Cross-language Sentence Selection via Data Augmentation and Rationale Training, Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 3881-3895, 2021. (PDF)
  16. Petra Galuščáková and Douglas W. Oard, Supporting Global Knowledge Sharing using Cross-Language Information Retrieval, NASA AI and Data Science Workshop for Earth and Space Science, 2 page poster paper, 2021. (PDF)
  17. Suraj Nair, Petra Galuščáková and Douglas W. Oard, Combining contextualized and non-contextualized query translations to improve CLIR, 4 pages, SIGIR, 2020. (PDF)
  18. Petra Galuščáková, Douglas W. Oard, Joe Barrow, Suraj Nair, Han-Chin Shing, Elena Zotkina, Ramy Eskander, Rui Zhang, MATERIALizing Cross-Language Information Retrieval: A Snapshot, LREC Workshop on Cross-Language Search and Summarization of Text and Speech, pp. 14-21, 2020. (PDF)
  19. Han-Chin Shing, Joe Barrow, Petra Galuščáková, Douglas W. Oard, Philip Resnik, Unsupervised System Combination for Set-Based Retrieval with Expectation Maximization, CLEF, pp. 191-197, Lugano, Switzerland, 2019. (PDF)
  20. Douglas Oard, Marine Carpuat, Petra Galuščáková, Joseph Barrow, Suraj Nair, Xing Niu, Han-Chin Shing, Weijia Xu, Elena Zotkina, Kathleen McKeown, Smaranda Muresan, Efsun Kayi, Ramy Eskander, Chris Kedzie, Yan Virin, Dragomir Radev, Rui Zhang, Mark Gales, Anton Ragni and Kenneth Heafield, Surprise Languages: Rapid-Response Cross-Language IR. Proceedings of the Ninth International Workshop on Evaluating Information Access (EVIA 2019), 5 pages, Tokyo Japan, 2019. (PDF)
  21. Sungho Kim, Youngjoong Ko and Douglas W. Oard, "Combining Lexical and Statistical Translation Evidence for Cross-Language Information Retrieval," Journal of the Association for Information Science and Technology (JASIST), 66(1)23-39, 2015. (preprint: PDF) (Publisher)
  22. Mossaab Bagdouri, Douglas W. Oard, and Vittorio Castelli, "CLIR for Informal Content in Arabic Forum Posts," in ACM International Conference on Information and Knowledge Management, 4 pages, Shanghai, China, 2014. (PDF)
  23. Yejun Wu and Douglas W. Oard, "English and Chinese Bilingual Topic Aspect Classification: Examining Similarity Measures, Optimal LSA Dimensions, and Centroid Correction of Translated Training Examples," in 76th Annual Conference of the American Society for Information Science and Technology, contributed paper, 12 pages, Montreal, Canada, 2013. (PDF)
  24. Ferhan Ture, Jimmy Lin and Douglas W. Oard, "Combining Statistical Translation Techniques for Cross-Language Information Retrieval," in 24th International Conference on Computational Linguistics, 17 pages, Mumbai, India, 2012. (PDF)
  25. Ferhan Ture, Douglas W. Oard and Philip Resnik, "Encouraging Consistent Translation Choices," in Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics, pp. 417-426, Montreal, Canada, 2012. (PDF)
  26. Ferhan Ture, Jummy Lin and Douglas W. Oard, "Looking Inside the Box: Context-Sensitive Translation for Cross-Language Information Retrieval," in 35th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2 pages, Portland, OR, 2012. (PDF)
  27. Jianqiang Wang and Douglas W. Oard, "Matching Meaning for Cross-Language Information Retrieval." Information Processing and Management, 48(4)631-653, 2012. (PDF) (Publisher)
  28. Douglas W. Oard, Carl Madson, Joseph Olive, John McCary and Caitlin Christianson (eds.), "Operational Engines," in Joseph Olive, Caitlin Christianson and John McCary (eds.), Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pp. 845-932, Springer, 2011. (Publisher)
  29. Tan Xu and Douglas W. Oard "FIRE-2008 at Maryland," in Working Notes of the Forum for Information Retrieval Evaluation, 12 pages, Kolkata, India, 2008. (PDF)
  30. Yejun Wu and Douglas W. Oard, "Bilingual Aspect Classification Based on Cross-language Text Classification," 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 203-210, Singapore, 2008. (PDF)
  31. Douglas W. Oard, Daqing He and Jianqiang Wang, "User-Assisted Query Translation for Cross-Language Information Retrieval," Information Processing and Management, 44(1)181-211, 2008. (PDF) (Publisher)
  32. Pengyi Zhang, Lynne Plettenberg, Judith Klavans, Douglas W. Oard and Dagobert Soergel, "Task-Based Interaction with an Integrated Multilingual Multimedia Information System: A Formative Evaluation," Joint Conference on Digital Libraries, pp. 117-126, Vancouver, BC, Canada, 2007. (PDF)
  33. Jianqiang Wang and Douglas W. Oard, "Combining Bidirectional Translation and Synonymy for Cross-Language Information Retrieval," in 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202-209, Seattle, 2006. (PDF)
  34. Daqing He, Douglas W. Oard, and Lynne Plettenberg, "Studying the Use of Interactive Multilingual Information Retrieval", in ACM SIGIR Workshop on New Directions in Multilingual Information Access, Amsterdam, 5 pages, 2006. (PDF)
  35. Gina-Anne Levow, Douglas W. Oard and Philip Resnik, "Dictionary-Based Cross-Language Retrieval," Information Processing and Management, 41(3)523-547, 2005. (PDF)
  36. J. Scott Olsson, Douglas W. Oard and Jan Hajic, "Cross-Language Text Classification," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 645-646, 2005. (PDF)
  37. Daqing He, Jianqiang Wang, Jun Luo and Douglas W. Oard, "iCLEF 2004 at Maryland: Summarization Design for Interactive Cross-Language Question Answering," in Multilingual Information Access for Text, Speech and Images, Fifth Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Revised Selected Papers Series, Springer-Verlag, LNCS (3491), Bath, UK, pp. 340-362, 2004. (PDF)
  38. Douglas W. Oard, Julio Gonzalo, Mark Sanderson, Fernando Lopez-Ostenero and Jianqiang Wang, "Interactive Cross-Language Document Selection," Information Retrieval, 7(1-2)205-228, 2004. (PDF)
  39. H. Ma, D. Doermann, B. Karagol-Ayan, D. Oard, and J. Wang, "Parsing and Tagging of Bilingual Dictionaries," Traitement Automatique des Langues, 44(2)125-150, 2004. (PDF)
  40. Tamer Elsayed, Douglas W. Oard, David Doermann and Gary Kuhn, "TDT- 2004: Adaptive Topic Tracking at Maryland," in Working Notes of the TDT-2004 Workshop, Gaithersburg, MD, 5 pages, 2004. (PDF)
  41. Kareem Darwish and Douglas W. Oard, "Probabilistic Structured Query Methods," in Twenty-Sixth International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 338-344, Toronto, Canada, 2003. (PDF)
  42. Daqing He, Douglas W. Oard, Jianqiang Wang, Jun Luo, Dina Demner- Fushman, Kareem Darwish, Philip Resnik, Sanjeev Khudanpur, Michael Nossal, Michael Subotin and Anton Leuski, "Making MIRACLEs: Interactive Translingual Search for Cebuano and Hindi," ACM Transactions on Asian Language Information Processing, 2(3)219-244, 2003. (PDF), (Publisher)
  43. Dina Demner-Fushman and Douglas W. Oard, "The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval," in Hawaii International Conference on System Sciences, 10 pages, Kona, HI, 2003. (PDF)
  44. Douglas W. Oard and Franz Josef Och, "Rapid-Response Machine Translation for Unexpected Languages," 7 pages, Machine Translation Summit IX, New Orleans, 2003. (PDF)
  45. Douglas W. Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn and Kevin Knight, "Desperately Seeking Cebuano," in Third Conference on Human Language Technologies, short paper (3 pages), Edmonton, Canada, 2003. (PDF)
  46. Abdessamad Echicabi, Douglas W. Oard, Daniel Marcu and Ulf Hermjakob, "Answering Spanish Questions from English Documents," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 514-522, 2003. (PDF)
  47. Bonnie J. Dorr, Daqing He, Jun Luo, Douglas W. Oard, Richard Schwartz, Jianqiang Wang and David Zajic, "iCLEF-2003 at Maryland: Headline Generation and Interactive Query Formulation," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 435-449, 2003. (PDF)
  48. Daqing He, Jianqiang Wang, Douglas W. Oard and Michael Nossal, "Comparing User-Assisted and Automatic Query Translation," in Advances in Cross-Language Information Retrieval Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002, Revised papers, Springer-Verlag LNCS (2785), pp. 267-278, Rome, Italy, 2002. (PDF)
  49. Gina-Anne Levow and Douglas W. Oard, "Signal Boosting for Translingual Topic Tracking" in Allen, James, ed. Topic Detection and Tracking: Event-Based Information Organization, Chapter 9, pp. 175-195, Kluwer Academic, 2002. (PDF)
  50. Douglas W. Oard and Funda Ertunc, "Translation-Based Indexing for Cross-Language Information Retrieval," in 24th BCS-IRSG European Colloquium on IR Research, pp. 324-333, Glasgow, UK, 2002. (PDF)
  51. David Doermann, Huanfeng Ma, Burcu Karagol-Ayan and Douglas W. Oard, "Translation Lexicon Acquisition from Bilingual Dictionaries," in Proceedings of the Ninth SPIE Symposium on Document Recognition and Retrieval, pp. 37-48, San Jose, CA, 2002.
  52. Kareem Darwish and Douglas W. Oard, "CLIR Experiments at Maryland for TREC-2002: Evidence Combination for Arabic-English Retrieval," in The Eleventh Text Retrieval Conference, Gaithersburg, MD, pp. 703-711, 2002. (PDF)
  53. Daqing He, Hyuk Ro Park, G. Craig Murray, Michael Subotin and Douglas W. Oard, "TDT-2002: Topic Tracking at Maryland: First Experiments with the Lemur Toolkit," in Working Notes of the Topic Detection and Tracking Workshop, 7 pages (online proceedings), Gaithersburg, MD, 2002. (PDF)
  54. Kareem Darwish, David Doermann, Ryan Jones, Douglas Oard, and Mika Rautiainen, "TREC-10 Experiments at Maryland: CLIR and Video," in The Tenth Text Retrieval Conference, pp. 552-564, Gaithersburg, MD, 2001. (PDF)
  55. Gina Levow, Douglas Oard, Philip Resnik, and Clara Cabezas, "Rapidly Retargetable Interactive Translingual Retrieval," in Proceedings of the First International Conference on Human Language Technology, pp. 294-298, San Diego, 2001. (PDF)
  56. Philip Resnik, Douglas Oard and Gina Levow, "Improved Cross-Language Retrieval using Backoff Translation," in Proceedings of the First International Conference on Human Language Technology, pp. 153-155, San Diego, 2001. (PDF)
  57. Jianqiang Wang and Douglas W. Oard, "iCLEF 2001 at Maryland: Comparing Word-for-Word Gloss and MT," in Evaluation of Cross-Language Information Retrieval Systems, Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001 Revised Papers, Springer-Verlag LNCS (2406), Darmstadt, Germany, pp. 336-354, 2001. (PDF)
  58. Douglas W. Oard and Jianqiang Wang, "NTCIR-2 ECIR Experiments at Maryland: Comparing Structured Queries and Balanced Translation," in Proceedings of the Second NTCIR Workshop on Evaluation of Japanese and Chinese Text Retrieval and Text Summarization, pp. 97-104, Tokyo, 2001. (PDF)
  59. Douglas W. Oard, "Evaluating Interactive Cross-Language Document Retrieval: Document selection," Proceedings of the First Cross-Language Evaluation Forum, pp. 57-71, Lisbon, 2000. (PDF)
  60. Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "CLEF Experiments at Maryland: Statistical stemming and backoff Translation," in Cross-Language Information Retrieval and Evaluation, Workshop of Cross-Language Evaluation Forum, CLEF 2000, Revised Papers Springer-Verlag, LNCS (2069), pp. 176-187, Lisbon, 2000. (PDF)
  61. Ruth Sperer and Douglas W. Oard, "Structured Translation for Cross-Language Information Retrieval," in Proceedings of the 23rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 120-127, Athens, Greece, 2000. (PDF)
  62. Paul G. Hackett and Douglas W. Oard, "Comparison of Word-Based and Syllable-Based Retrieval for Tibetan," Poster paper, in Fifth International Workshop on Information retrieval with Asian Languages, pp. 197-198, Hong Kong, 2000. (PDF)
  63. Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "TREC-9 Experiments at Maryland: Interactive CLIR," in The Ninth Text Retrieval Conference, pp. 543-550, Gaithersburg, MD, 2000. (PDF)
  64. Gina-Anne Levow and Douglas W. Oard, "Translingual Topic Tracking: Applying Lessons from the MEI Project," in Working notes of the Topic Detection and Tracking Workshop, (5 pages), Gaithersburg, MD, 2000. (PDF)
  65. Gina-Anne Levow and Douglas W. Oard, "Translingual Topic Tracking With PRISE," in Working notes of the Topic Detection and Tracking Workshop, pp. 175-180, Tysons Corner, VA, 2000. (PDF)
  66. Douglas W. Oard and Jianqiang Wang, "NTCIR CLIR Experiments at the University of Maryland," in Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, pp. 157-161, Tokyo, 1999. (PDF)
  67. Douglas W. Oard, Jianqiang Wang, Dekang Lin, and Ian Soboroff, "TREC-8 Experiments at Maryland: CLIR, QA and Routing," in The Eighth Text Retrieval Conference, pp. 623-636, Gaithersburg, MD, 1999. (PDF)
  68. Douglas W. Oard and Philip Resnik, "Support for Interactive Searching in Cross-Language Information Retrieval," Information Processing and Management, 35(3)363-379, 1999. (PDF) (Publisher)
  69. Gina-Anne Levow and Douglas W. Oard, "Evaluating Lexicon Coverage for Cross-Language Information Retrieval" in Proceedings of the Workshop on Multilingual Information Processing and Asian Language Processing, pp. 69-74, Beijing, 1999. (PDF)
  70. Douglas W. Oard and Jianqiang Wang, "Effects of Term Segmentation in Chinese/English Cross-Language Information Retrieval," in Proceedings of the Symposium on String Processing and Information Retrieval, pp. 149-157, Cancun, Mexico, 1999. (PDF)
  71. Douglas W. Oard, "Topic Tracking with the PRISE Information Retrieval System," in Proceedings of the DARPA Broadcast News Workshop, pp. 209-211, Reston, VA, 1999. (PDF)
  72. Douglas W. Oard, "Resources for Chinese/English Cross-Language IR," 25 pp., University of Maryland, 1999. (PDF) [the greek letters are misrendered versions of unfilled, half-filled, and completely filled circles that broke when Microsoft updated the character set for Word]
  73. Douglas W. Oard, "A Comparative Study of Query and Document Translation for Cross-Language Information Retrieval," in Proceedings of the Third Conference of the Association for Machine Translation in the Americas, pp. 472-483, Philadelphia, PA, 1998. (PDF)
  74. Bonnie J. Dorr and Douglas W. Oard, "Evaluating Resources for Query Translation in Cross-Language Information Retrieval," in Proceedings of the First International Conference on Language Resource Evaluation, Volume II, pp. 759-764, Granada, Spain, 1998. (PDF)
  75. Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language Text Filtering Effectiveness," in Gregory Grefenstette (ed.), Cross-Language Information Retrieval, Chapter 12, pp. 151-161, Kluwer Academic, 1998. [This is essentially the same as the SIGIR 96 workshop paper below.]
  76. Douglas W. Oard, "TREC-7 Experiments at the University of Maryland," in The Seventh Text Retrieval Conference, pp. 541-545, Gaithersburg, MD, 1998. (PDF)
  77. Douglas W. Oard and Paul Hackett, "Document Translation for Cross-Language Text Retrieval at the University of Maryland," in The Sixth Text Retrieval Conference, pp. 687-696, Gaithersburg MD, 1997. (PDF)
  78. Douglas W. Oard, "Adaptive Filtering of Multilingual Document Streams," in Fifth RIAO Conference on Computer Assisted Information Searching on the Internet, Volume 1, pp. 233-254, Montreal, Canada, 1997. (PDF)
  79. Douglas W. Oard, "Alignment of Spanish and English TREC Topic Descriptions," in The Fifth Text Retrieval Conference, pp. 547-553, Gaithersburg MD, 1996. (PDF)
  80. Douglas W. Oard, "Adaptive Vector Space Text Filtering for Monolingual and Cross-Language Applications," Ph.D. Dissertation, University of Maryland, College Park, 1996. (PDF)
  81. Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language Text Filtering Effectiveness," in Proceedings of Cross-Linguistic Multilingual Information Retrieval Workshop, ACM SIGIR Conference, pp. 8-14, Zurich, 1996. (PDF)
  82. Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr and Christos Faloutsos, "On Automatic Filtering of Multilingual Texts," Proceedings of IEEE International Conference on Systems, Man and Cybernetics, pp. 1645-1650, San Antonio, TX, 1994. (PDF)

Speech Retrieval

These papers address techniques for searching spoken content based on written or spoken queries.
  1. Douglas W. Oard, Christopher Bearman, David Baker, Susannah Paletz and Johanne Trippas, Operational Disconnect Detection in Mission Control, ICASSP Fearless Steps Apollo Workshop, Seoul, South Korea, 2 pages, 2024. (PDF)
  2. Petra Galuščáková, Suraj Nair and Douglas W. Oard: Combine and Re-Rank: The University of Maryland at the TREC 2020 Podcasts Track, TREC (notebook paper), 9 pages, 2020. (PDF)
  3. Suraj Nair, Anton Ragni, Ondrej Klejch, Petra Galuščáková and Douglas Oard, Experiments with Cross-Language Speech Retrieval for Lower-Resource Languages, Asia Information Retrieval Symposium, pp. 145-157, Hong Kong, China, 2019. (PDF)
  4. Ning Gao, Gregory Sell, Douglas Oard, Mark Dredze, Leveraging Side Information for Speaker Identification with the Enron Conversational Telephone Speech Collection, IEEE Automatic Speech Recognition and Understanding Workshop, 7 pages, 2017. (PDF)
  5. Ning Gao, Douglas W. Oard and Mark Dredze, Support for Interactive Identification of Mentioned Entities in Conversational Speech, 40th International ACM SIGIR Conference on Research and Development in Information Retrieval 4 pages, Tokyo, Japan, 2017. (PDF)
  6. Tiffany Jachja and Douglas W. Oard, "Goal-Directed Information Seeking in Time-Synchronized and Topic-Linked Records of the Apollo Lunar Missions," The ACM Conference on Human Information Interaction and Retrieval, 4 pages, Oslo, Norway, 2017. (PDF)
  7. Douglas W. Oard, John H.L. Hansen, Abhijeet Sangawan, Bryan Toth, Lakshmish Kaushik and Chengzhu Yu, "Toward Access to Multi-Perspective Archival Spoken Word Content," International Conference on Asian Digital Libraries, 6 pages, Tsukuba, Japan, 2016. (PDF)
  8. Douglas W. Oard, Rashmi Sankepally, Jerome White and Craig Harman, "Vapor Engine: Demonstrating an Early Prototype of a Language-Independent Search Engine for Speech," in ACM SIGIR Conference on Human Information Interaction and Retrieval, 4 pages, Chapel Hill, NC, 2016. (PDF)
  9. Douglas W. Oard, Rashmi Sankepally, Jerome White, Aren Jansen and Craig Harman, "A Test Collection for Spoken Gujarati Queries," Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, Santiago, Chile, 2015. (PDF)
  10. Jerome White, Douglas Oard, Aren Jansen, Jiaul Paik and Rashmi Sankepally, Using Zero-Resource Spoken Term Discovery for Ranked Retrieval, Annual Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, Denver, CO, 2015. (PDF)
  11. Ali Ziaei, Lakshmish Kaushik, Abhijeet Sangwan, John H.L. Hansen and Doug Oard, "Speech Activity Detection for NASA Apollo Space Missions: Challenges and Solutions," in 15th Annual Conference of the International Speech Communication Association, 5 pages, Singapore, 2014. (PDF)
  12. Chengzhu Yu, John Hansen and Douglas W. Oard, "Houston, We have a Solution: A Case study of the Analysis of Astronaut Speech during NASA Apollo 11 for Long-term Speaker Modeling," in 15th Annual Conference of the International Speech Communication Association, 4 pages, Singapore, 2014. (PDF)
  13. Jerome White, Douglas W. Oard, Nitendra Rajput and Marion Zalk, "Simulating Early-Termination Search for Verbose Spoken Queries," Emperical Methods in Natural Language Processing, 11 pages, Seattle, WA, 2013. (PDF)
  14. Abhijeet Sangwan, Lakshmish Kaushik, Chengzhu Yu, John H.L. Hansen and Douglas W. Oard, "Houston, We Have a Solution: Using NASA Apollo Program to Advance Speech and Language Processing Technology," INTERSPEECH, pp. 1135-1139, Lyon, France, 2013. (PDF)
  15. Douglas W. Oard, Abhijeet Sangwan and John H.L. Hansen, "Reconstruction of Apollo Mission Control Center Activity," in SIGIR Workshop on Exploration, Navigation and Retrieval of Information in Cultural Heritage (ENRICH), 4 pages, Dublin, Ireland, 2013. (PDF)
  16. Joseph Malionek, Douglas W. Oard, John Hansen and Abhijeet Sangwan, "Linking Transcribed Conversational Speech," in 36th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Dublin, Ireland, 2013. (PDF)
  17. Douglas W. Oard, "Query By Babbling: A Research Agenda," in CIKM Workshop on Information and Knowledge Management for Developing Regions, 5 pages, Maui, HI, 2012. (PDF)
  18. J. Scott Olsson and Douglas W. Oard, "Combining Evidence from LVCSR and Ranked Utterance Retrieval for Robust Domain-Specific Ranked Retrieval," Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Boston, 2009. (PDF)
  19. J. Scott Olsson and Douglas W. Oard, "Phrase-Based Query Degradation Modeling for Vocabulary-Independent Ranked Utterance Retrieval" Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technology Conference, Boulder, 2009. (PDF)
  20. J. Scott Olsson and Douglas W. Oard, "Combining Speech Retrieval Results with Generalized Additive Models," Association for Computational Linguistics-Human Language Technology Conference, pp. 461-469, Columbus, OH, 2008. (PDF)
  21. J. Scott Olsson and Douglas W. Oard, "Improving Text Classification for Oral History Archives with Temporal Domain Knowledge," 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 623-630, Amsterdam, 2007. (PDF)
  22. Pavel Ircing, Douglas W. Oard and Jan Hoideker, "First Experiments Searching Spontaneous Czech Speech," in 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 835-836, 2007. (PDF)
  23. Baolong Liu and Douglas W. Oard, "One-Sided Measures for Evaluating Ranked Retrieval Effectiveness with Spontaneous Conversational Speech," in 29th Annual International ACM SIGIR Conference on Research and Development on Information Retrieval, Seattle, pp. 673-674, 2006. (PDF)
  24. Diana Inkpen, Muath Alzghool, Gareth J.F. Jones and Douglas W. Oard, "Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection," Conference on Human Language Technologies and the North American Chapter of the Association for Computational Linguistics, 4 pages, New York, 2006. (PDF)
  25. Jianqiang Wang and Douglas W. Oard, "CLEF 2006 CL-SR at Maryland: English and Czech," in Evaluation of Multilingual and Multi-modal Information Retrieval, Revised Selected Papers, CLEF-2006, Springer-Verlag, LNCS (4730), Alicante, Spain, 7 pages, 2006. (PDF)
  26. Jianqiang Wang and Douglas W. Oard, "CLEF 2005 CL-SR at Maryland: Document and Query Expansion using Side Collections and Thesauri," in Multilingual Information Repositories, Revised Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022), Vienna, Austria, pp. 800-809, 2005. (PDF)
  27. William Byrne, David Doermann, Martin Franz, Samuel Gustman, Jan Hajic, Douglas Oard, Michael Picheny, Josef Psutka, Bhuvana Ramabhadran, Dagobert Soergel, Todd Ward and Wei-Jing Zhu, "Automated Recognition of Spontaneous Speech for Access to Multilingual Oral History Archives," IEEE Transactions on Speech and Audio Processing, 12(4)420-435, 2004. (PDF) (Publisher)
  28. Douglas W. Oard, Dagobert Soergel, Craig Murray, David Doermann, Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, James Mayfield, Samuel Gustman, and Stephanie Strassel, "Building an Information Retrieval Test Collection for Spontaneous Conversational Speech," Twenty-Seventh ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 41-48, Sheffield, UK, 2004. (PDF)
  29. Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang, Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," Computer Speech and Language, 18(2)163-179, 2004. (PDF) (Publisher)
  30. Sudeep Gandhe, Andrew Gordon, Anton Leuski, David R. Traum and Douglas W. Oard, "First Steps Towards Linking Dialogues: Mediating Between Free-Text Questions and Pre-recorded Video Answers," in 24th Army Science Conference, Orlando, FL, 8 pages, 2004. (PDF)
  31. Jinmook Kim Douglas W. Oard and Dagobert Soergel, "Searching Large Collections of Recorded Speech: A Preliminary Study," in Annual Conference of the American Society for Information Science and Technology, Long Beach, CA, pp. 330-339, 2003. (PDF)
  32. Douglas W. Oard and Anton Leuski, "Searching Recorded Speech Based on the Temporal Extent of Topic Labels," in AAAI Spring Symposium on Intelligent Multimedia Knowledge Management, Palo Alto, CA, 5 pages, 2003. (PDF)
  33. Jinmook Kim, Dagobert Soergel and Douglas W. Oard, "MALACH Workshop 2: Final Report," 62 pp., 2003.
  34. Samuel Gustman, Dagobert Soergel, Douglas Oard, William Byrne, Michael Picheny, Bhuvana Ramabhadran and Douglas Greenberg, "Supporting Access to Large Digital Oral History Archives," in Second Joint Conference on Digital Libraries, pp. 18-27, Portland, OR, 2002. (PDF)
  35. Douglas W. Oard, Dina Demner-Fushman, Jan Hajic, Bhuvana Ramabhadran, Samuel Gustman, William J. Byrne, Dagobert Soergel, Bonnie Dorr, Philip Resnik and Michael Picheny, "Cross-Language Access to Recorded Speech in the MALACH Project," in Fifth International Conference on Text,S peech and Dialog, pp. 57-64, Brno, Czech Republic, 2002. (PDF)
  36. Jinmook Kim and Douglas W. Oard, "The Use of Speech Retrieval Systems: A Study Design," in ACM SIGIR Workshop on IR Techniques for Speech Applications, New Orleans, pp. 86-93, 2001. (PDF)
  37. Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang, Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," in Proceedings of the First International Conference on Human Language Technology, pp. 239-245, San Diego, 2001. (PDF)
  38. Helen Meng, Sanjeev Khudanpur, Gina-Anne Levow, Douglas W. Oard, and Hsin-Min Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," in NAACL Workshop on Embedded Machine Translation, pp. 23-30, Seattle, WA, 2000. (PDF)
  39. Douglas W. Oard, "User Interface Design for Speech-Based Retrieval," Bulletin of the American Society for Information Science, vol. 26, no. 5, pp. 20-22, June/July, 2000. (Publisher)
  40. Helen Meng, Sanjeev Khudanpur, Douglas W. Oard, and Hsin-Min Wang, "Mandarin-English Information (MEI)," in Working notes of the Topic Detection and Tracking Workshop, pp. 117-121, Tysons Corner, VA, 2000. (PDF)
  41. Laura Slaughter, Douglas W. Oard, Vernon Warnick, Galen Wilkerson and Julie Harding, "A Graphical Interface for Speech-Based Retrieval," in Proceedings of the Third ACM Conference on Digital Libraries, pp. 305-306, Pittsburgh, PA, 1998. (PDF)

Search and Sense-making in Email Collections

These papers address techniques for helping people find things in large collections of electronic mail that are not their own. I do not work on the counterpart problem of Personal Information Management, in which tools are built to help people better manage their own email collections. Papers reporting on evaluation design for email search in the TREC Legal Track can also be found in the TREC Legal Track Overview section.
  1. Tan Xu and Douglas W. Oard, "Exploring Example-Based Person Search in Email," in 35th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2 pages, Portland, OR, 2012. (PDF)
  2. Hyunmo Kang, Catherine Plaisant, Tamer Elsayed, and Douglas W. Oard, "Making Sense of Archived Email: Exploring the Enron Collection with NetLens," Journal of the American Society for Information Science and Technology, 61(4)723-744, 2010. (PDF) (Publisher)
  3. Tamer Elsayed, Douglas W. Oard, and Galileo Namata, "Resolving Personal Names in Email Using Context Expansion," accepted for presentation at Association for Computational Linguistics-Human Language Technology Conference, pp. 941-949, Columbus, OH, 2008. (PDF)
  4. Adam Perer, Ben Shneiderman, and Douglas W. Oard, "Using Rhythms of Relationships to Understand Email Archives," Journal of the American Society for Information Science and Technology, 57(14)1936-1948, 2006. (PDF) (Publisher)
  5. Yejun Wu, Douglas W. Oard and Ian Soboroff, "An Exploratory Study of the W3C Mailing List Test Collection for Retrieval of Emails with Pro and/or Con arguments," in Third Conference on Email and Anti-Spam, 10 pages, Mountain View, CA, 2006. (PDF)
  6. Tamer Elsayed and Douglas W. Oard, "Modeling Identity in Archival Collections of Email: A Preliminary Study," in Conference on Email and Anti-Spam, 9 pages, Mountain View, CA, 2006. (PDF)
  7. Yejun Wu and Douglas W. Oard, "Indexing Emails and Email Threads for Retrieval," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 665-666, 2005. (PDF)
  8. Jimmy Lin, Eileen Abels, Dina Demner-Fushman, Douglas W. Oard, Philip Wu, and Yejun Wu, "A Menagerie of Tracks at Maryland: HARD, Enterprise, QA, and Genomics, Oh My!," in The Fourteenth Text Retrieval Conference, Gaithersburg, MD, 16 pages, 2005. (PDF)
  9. Anton Leuski, Douglas W. Oard and Rahul Bhagat, "eArchivarius: Accessing Collections of Electronic Mail," in Twenty-Sixth International ACM-SIGIR Conference on Research and Development in Information Retrieval, description of system demonstration, pp. 468, Toronto, Canada, 2003. (PDF)

Search and Sense-making in Text Chat

This is a research area on which I may publish more in the future.
  1. Rashmi Sankepally and Douglas W. Oard, An Initial Test Collection for Ranked Retrieval of SMS Conversations, in Eleventh Language Resources and Evaluation Conference, Miyazaki, Japan, 2018. (PDF)
  2. Lidan Wang and Douglas W. Oard, "Context-based Message Expansion for Disentanglement of Interleaved Text Conversations" Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technology Conference, Boulder, 2009. (PDF)

Search Among Sensitive Content

The focus of this work is on balancing relevance with protection for sensitive content.
  1. Jason R. Baron, Nathaniel W. Rollings and Douglas W. Oard, Using ChatGPT for the FOIA Exemption 5 Deliberative Process Privilege, Proceedings of the Third International Workshop on Artificial Intelligence and Intelligent Assistance for Legal Professionals in the Digital Workplace (LegalAIIA), Braga, Portugal, 2023. (PDF)
  2. Mahmoud F. Sayed, Nishanth Mallekav and Douglas W. Oard, Comparing Intrinsic and Extrinsic Evaluation of Sensitivity Classification, 8 pages, ECIR, 2022. (PDF)
  3. Jason R. Baron, Mahmoud F. Sayed and Douglas W. Oard, Providing More Efficient Access To Government Records: A Use Case Involving Application of Machine Learning to Improve FOIA Review for the Deliberative Process Privilege, ACM Journal on Computing and Cultural Heritage, 19pp., to appear, 2021. (PDF)
  4. Modassir Iqbal, Katie Shilton, Mahmoud F. Sayed, Douglas W. Oard, Jonah Lynn Rivera and William Cox, Search with Discretion: Value Sensitive Design of Training Data for Information Retrieval, Proceedings of the ACM on Human Computer Interaction (also presented at CSCW 2021), 20 pages, 2021. (PDF)
  5. Graham McDonald and Douglas W. Oard, Search Among Sensitive Content, ECIR 2021 Tutorial Abstract, in Proceedings of the 43rd European Conference on IR Re search, 1 page, 2021. (PDF)
  6. Mahmoud Sayed, William Cox, Jonah Lynn Rivera, Caitlin Christian-Lamb, Modassir Iqbal, Douglas W. Oard and Katie Shilton, A Test Collection for Relevance and Sensitivity, 4 pages, SIGIR, 2020. (PDF)
  7. Jimmy Lin, Ian Milligan, Douglas Oard, Nick Ruest and Katie Shilton, We Could, But Should We? Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections, CHIIR, 10 pages, Vancouver, BC, Canada, 2020. (PDF)
  8. Alexandra Olteanu, Jean Garcia-Gathright, Maarten de Rijke, and Michael D. Ekstrand (eds.) and Adam Roegiest, Aldo Lipani, Alex Beutel, Alexandra Olteanu, Ana Lucic, Ana-Andreea Stoica, Anubrata Das, Asia Biega, Bart Voorn, Claudia Hauff, Damiano Spina, David Lewis, Douglas W. Oard, Emine Yilmaz, Faegheh Hasibi, Gabriella Kazai, Graham McDonald, Hinda Haned, Iadh Ounis, Ilse van der Linden, Jean Garcia-Gathright, Joris Baan, Kamuela N. Lau, Krisztian Balog, Maarten de Rijke, Mahmoud Sayed, Maria Panteli, Mark Sanderson, Matthew Lease, Michael D. Ekstrand, Preethi Lahoti, and Toshihiro Kamishima (authors), FACTS-IR: Fairness, Accountability, Confidentiality, Transparency, and Safety in Information Retrieval, SIGIR Forum, (53)2, 2019. (PDF)
  9. Mahmoud F. Sayed, Douglas W. Oard: Jointly Modeling Relevance and Sensitivity for Search Among Sensitive Content. SIGIR, pp. 615-624, Paris, France, 2019. (PDF)
  10. Katie Shilton, Amy Wickner, Douglas W. Oard and Jimmy Lin, Protecting Sensitive Content in Email: Archival Views on Challenges and Opportunities, The First International Workshop on Privacy-Sensitive Collections for Digital Scholarship, 4 pages, Montreal, Canada, 2017. (PDF)
  11. Douglas W. Oard, Katie Shilton and Jimmy Lin, Evaluating Search Among Secrets, in The Seventh International Workshop on Evaluating Information Access, Tokyo, Japan, 2016. (PDF)

Math Search

The focus of this work is on searching mathematical content, or mixed math and text content.
  1. Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL Systems in the CLEF 2022 ARQMath Lab: Introducing MathAMR for Math-Aware Search, Working Notes of CLEF, 18 pages, 2022.
  2. Behrooz Mansouri, Douglas W. Oard, Anurag Agrawal and Richard Zanibbi, Effects of Context, Complexity, and Clustering on Evaluation for Math Formula Retrieval, arXiv preprint arXiv:2111.10504, 10 pages, 2021. (PDF)
  3. Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL Systems in the CLEF 2021 ARQMath Lab: Sentence-BERT for Answer Retrieval, Learning-to-Rank for Formula Retrieval, Working Notes of CLEF, pp. 47-62, 2021. (PDF)
  4. Behrooz Mansouri, Richard Zanibbi and Douglas W. Oard, Learning to Rank for Mathematical Formula Retrieval. SIGIR, pp. 952-961, 2021. (PDF)
  5. Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL Systems in the CLEF 2020 ARQMath Lab. CLEF Working Notes, 12 pages, 2020. (PDF)
  6. Behrooz Mansouri, Shaurya Rohatgi, Douglas W. Oard, Jian Wu, C. Lee Giles, Richard Zanibbi, Tangent-CFT: An Embedding Model for Mathematical Formulas, ICTIR, pp. 11-18, Santa Clara, CA, 2019. (PDF)
  7. Behrooz Mansouri, Richard Zanibbi and Douglas Oard, Toward Math-Enabled Digital Libraries: Characterizing Searches for Mathematical Concepts, Joint Conference on Digital Libraries, Urbana, IL, 2019. (PDF)

Microblog Search

The focus of this work is on searching short text posted to Twitter and similar services.
  1. Mossaab Bagdouri and Douglas W. Oard, CLIP at TREC 2016: LiveQA and RTS, The Twenty-Fifth Text Retrieval Conference, 6 pages, Gaithersburg, MD, 2016. (PDF)
  2. Mossaab Bagdouri and Douglas W. Oard, CLIP at TREC 2015: Microblog and Live QA," in The Twenty-Fourth Text Retrieval Conference, 8 pages, Gaithersburg, MD, 2015. (PDF)
  3. Mossaab Bagdouri and Douglas W. Oard, "Profession-Based Person Search in Microblocs: Using Seed Sets to Find Journalists," in Proceedings of the 24rd Annual International ACM CIKM Conference on Information and Knowledge Management, 10 pages, Melbourne, Australia, 2015. (PDF)
  4. Mossaab Bagdouri and Douglas W. Oard, "On Prediccoting Deletions of Microblog Posts," in Proceedings of the 24rd Annual International ACM CIKM Conference on Information and Knowledge Management, 4 pages, Melbourne, Australia, 2015. (PDF)
  5. Tan Xu, Paul McNamee, and Douglas W. Oard, "HLTCOE at TREC 2014: Microblog and Clinical Decision Support", in The Twenty-Third Text Retrieval Conference, 8 pages, Gaithersberg, MD, 2014. (PDF)
  6. Tan Xu and Douglas W. oard, "Wikipedia-Based Topic Clustering for Microblogs," 10 pages, Annual Meeting of the American Society for Information Science and Technology, New Orleans, LA, 2011. (PDF)

E-Discovery

The focus of this work is on the design and evaluation of systems that can support the exchange of documentary evidence among litigants incident to civil litigation. The word "discovery" is also used with other meanings by information retrieval researchers, but here it is used in the legal sense.
  1. Douglas W. Oard, Fabrizio Sebastiani and Jyothi K. Vinjumur, Jointly Minimizing the Expected Costs of Review for Responsiveness and Privilege in E-Discovery, ACM Transactions on Information Systems, 37(1)11:1-11:35, 2018. (PDF, Publisher, SIGIR 2020 Talk (MP4), SIGIR slides (PPTX)). Figure 4 in the paper has incorrect colors in the legend; a corrected figure is available.
  2. Douglas W. Oard, Jyothi Vinjumur and Fabrizio Sebastiani, When is it Rational to Review for Privilege? ICAIL DESI VII Workshop on Using Advanced Data Analysis in eDiscovery and Related Disciplines to Identify and Protect Sensitive Information in Large Collections, 10 pages, London, UK, 2017. (PDF)
  3. William Webber and Douglas W. Oard, "Metrics in Predictive Coding," Perspectives on Predictive Coding and Other Advanced Search and Review Technologies for the Legal Practitioner, American Bar Association, 2016.
  4. Jyothi K. Vinjumur, Douglas W. Oard and Amittai Axelrod, "An AID for Avoiding Inadvertent Disclosure: Supporting Interactive Review for Privilege in E-Discovery," in ACM SIGIR Conference on Human Information Interaction and Retrieval, 10 pages, Chapel Hill, NC, 2016. (PDF)
  5. Jyothi K. Vinjumur and Douglas W. Oard, "Finding the privileged Few: Supporting Privilege Review for E-Discovery," in Annual Meeting of the Association for Information Science and Technology, 4 pages, St. Louis, MO, 2015. (PDF)
  6. Jyothi K. Vinjumur, Douglas W. Oard and Jiaul H. Paik, "Assessing the Reliability and Reusability of an E-Discovery Privilege Test Collection," in 37th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Gold Coast, Australia, 2014. (PDF)
  7. Mossaab Bagdouri, William Webber, David D. Lewis and Douglas W. Oard, "Towards Minimizing the Annotation Cost of Certified Text Classification," in ACM Conference on Information and Knowledge Management, 10 pages, San Francisco, CA, 2013. (PDF)
  8. William Webber, Mossaab Bagdouri, David D. Lewis and Douglas W. Oard, "Sequential Testing in Classifier Evaluation Yields Biased Estimates of Effectiveness," in 36th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Dublin, Ireland, 2013. (PDF)
  9. Feng Charlie Zhao, Douglas W. Oard and Jason R Baron, "Improving Search Effectiveness in the Legal E-Discovery Process Using Relevance Feedback," in Third International Workshop on Discovery of Electronically Stored Information (DESI III), 10 pages, Barcelona, Spain, 2009. (PDF)

Document Image Retrieval

These papers address techniques for searching scanned documents. Papers reporting on the evaluation design for the TREC Legal Track, which included scanned documents, can be found in the evaluation design section above.
  1. Rajiv Jain, Douglas W. Oard and David Doermann, Scalable Ranked Retrieval Using Document Images, in 21st SPIE Document Recognition and Retrieval Conference, 15 pages, San Francisco, CA, 2014. (PDF)
  2. Utpal Garain, Arjun Das, David Doermann and Douglas Oard, Leveraging Statistical Transliteration for Dictionary-Based English-Bengali CLIR of OCR'd Text, in 24th International Conference on Computational Linguistics, 9 pages, Mumbai, India, 2012. (PDF)
  3. Lidan Wang and Douglas W. Oard, "Query Expansion for Noisy Legal Documents," in The Sixteenth Text Retrieval Conference, 9 pages, Gaithersburg, MD, 2008.
  4. Douglas Oard, Tamer Elsayed, Jianqiang Wang, Yejun Wu, Pengyi Zhang, Eileen Abels, Jimmy Lin and Dagobert Soergel, TREC-2006 at Maryland: Blog, Enterprise, Legal and QA Tracks," in The Fifteenth Text Retrieval Conference, 16 pages, Gaithersburg, MD, 2006. (PDF)
  5. Kareem Darwish and Douglas W. Oard, "Balanced Query Methods for OCR-Based Retrieval," 2003 Symposium on Document Image Understanding Technology, Greenbelt, MD, 2003. (PDF)
  6. Kareem Darwish and Douglas W. Oard, "Term Selection for Searching Printed Arabic," in Twenty-Fifth International ACM-SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, pp. 261-268, 2002. (PDF)
  7. Yuen-Hsien Tseng and Douglas W. Oard, "Document Image Retrieval Techniques for Chinese," 2001 Symposium on Document Image Understanding Technology, pp. 151-158, Columbia, MD, 2001. (PDF)
  8. Douglas W. Oard, "Issues in Cross-Language Retrieval from Document Image Collections," 1999 Symposium on Document Image Understanding Technology, pp. 229-234, Annapolis, 1999. (PDF)

Archival Access

Some of my papers refer to archives simply with the broad meaning "collections of content," but the papers in this section are focused on learning to find content in archival institutions.
  1. Douglas W. Oard, Tokinori Suzuki, Emi Ishita and Noriko Kando, Searching Unseen Sources for Historical Information: Evaluation Design for the NTCIR-18 SUSHI Pilot Task, SIGIR-AP Workshop on Evaluation Methodologies, Testbeds and Community for Information Access Research, 8 pages, 2024. (PDF)
  2. Tokinori Suzuki, Douglas W. Oard, Emi Ishita and Yoichi Tomiura, Searching for Physical Documents in Archival Repositories, Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington DC, 5 pages, 2024. (PDF)
  3. Tokinori Suzuki, Douglas Oard, Emi Ishita and Yoichi Tomiura, Automatically Detecting Referencesfrom the Scholarly Literature to Records in Archives, International Conference on Asian Digital Libraries, Taipei, 2023. (PDF)
  4. Douglas W. Oard, Known by the Company it Keeps: Proximity-Based Indexing for Physical Content in Archival Repositories, TPDL, 14 pages, 2023 (PDF)

Computational Social Science

These papers involve the application of computational techniques to foster social science research. Many of my other papers also address issues that have potential application to social science research; what distinguishes these papers is that supporting social science research was the principal motivation for this work.
  1. Satoshi Fukuda, Emi Ishita, Yoichi Tomiura and Douglas W. Oard, Automating the Choice Between Single or Dual Annotation for Classifier Training, International Conference on Asian Digital Libraries, 15 pp., 2021. (PDF)
  2. Emi Ishita, Satoshi Fukuda, Toru Oga, Yoichi Tomiura, Douglas W. Oard and Kenneth. R. Fleischmann, Cost-Effective Learning for Classifying Human Values, iConference, poster paper, Boras, Sweden, 2020. (PDF)
  3. Emi Ishita, Satoshi Fukuda, Toru Oga, Douglas W. Oard, Kenneth R. Fleischmann, Yoichi Tomiura and An-Shou Cheng, Toward Three-Stage Automation of Annotation for Human Values, iConference, College Park, MD, 2019. (PDF)
  4. Emi Ishita, Toru Oga, Yasuhiro Takayama, An-Shou Cheng, Douglas W. Oard and Kenneth R. Fleischmann, Yoichi Tomiura, Toward Automating Detection of Human Values in the Nuclear Power Debate, 80th Annual Meeting of the Association for Information Science and Technology, 2 pages, Washington, DC, 2017. (PDF)
  5. Yasuhiro Takayama, Yoichi Tomiura, Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng and Emi Ishita, "An Automatic Dictionary Extraction and Annotation Method Using Simulated Annealing for Detecting Human Values," Sixth International Conference on E-Service and Knowledge Management, Okayama, Japan, 2015. (PDF)
  6. Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, Yoichi Tomiura, Yasuhiro Takayama and An-Shou Cheng, "Learning curves for automating content analysis: How much human annotation is needed?," Sixth International Conference on E-Service and Knowledge Management, Okayama, Japan, 2015. (PDF)
  7. Kenneth R. Fleischmann, Yasuhiro Takayama, An-Shou Cheng, Yoichi Tomiura, Douglas W. Oard and Emi Ishita, "Thematic Analysis of Words that Invoke Values in the Net Neutrality Debate," iConference, 6 pages, Newport Beach, CA, 2015. (PDF)
  8. Yasuhiro Takayama, Yoichi Tomiura, Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, and An-Shou Cheng, "A Word-Scale Probabilistic Latent Variable Model for Detecting Human Values," in ACM International Conference on Information and Knowledge Management, 10 pages, Shanghai, China, 2014. (Corrected PDF, Corrections from published version, PDF)
  9. Yasuhiro Takayama, Yoichi Tomiura, Emi Ishita, Zheng Wang, Douglas Oard, Kenneth Fleischmann and An-Shou Cheng, "Improving Automatic Sentence-Level Annotation of Human Values Using Augmented Feature Vectors," in Conference of the Pacific Association for Computational Linguistics, 6 pages, Tokyo, Japan, 2013. (PDF)
  10. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita, and Douglas W. Oard, The Role of Innovation and Wealth in the Net Neutrality Debate: A Content Analysis of Human Values in Congressional and FCC Hearings, Journal of the American Society for Information Science and Technology (JASIST), 63(7)1360-1373, 2012. (PDF) (Publisher)
  11. Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng, Jordan Boyd-Graber, Thomas Clay Templeton, Emi Ishita, Jes A. Koepfler, and William A. Wallace, "Content Analysis for Values Elicitation," Proceedings of the CHI Workshop on Methods for Accounting for Values in Human-Centered Computing, 4 pages, Austin, TX, 2012. (PDF)
  12. Pranav Anand, Joseph King, Jordan Boyd-Graber, Earl Wagner, Craig Martell, Doug Oard, and Philip Resnik, "Believe Me -- We Can Do This! Annotating Persuasive Acts in Blog Text", AAAI Workshop on Computational Models of Natural Argument, San Francisco, CA, 2011. (PDF)
  13. Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, An-Shou Cheng and Thomas Clay Templeton, "Investigating Multi-Label Sentence Classification for Human Values," Annual Conference of the American Society for Information Science and Technology, 4 pages, Pittsburgh, PA, 2010. (PDF)
  14. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita and Douglas W. Oard, "Values of Stakeholders in the Net Neutrality Debate: Applying Content Analysis to Telecommunications Policy," in Hawaii International Conference on System Sciences, 10 pages, Kauai, HI, 2010. (PDF)
  15. Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas W. Oard and Asad B. Sayeed, Exploring the Relationships among ICTs: A Scalable Computational Approach Using KL Divergence and Hierarchical Clustering," in Hawaii International Conference on System Sciences, 10 pages, Kauai, HI, 2010. (PDF)
  16. Emi Ishita, An-Shou Chen, Douglas W. Oard and Kenneth R. Fleischmann, "Multi-label Classification for Human Values" (in Japanese), in Annual Conference of the Japan Society of Library and Information Science, 4 pages, Tokyo, Japan, 2009. (PDF)
  17. Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas W. Oard and Asad B. Sayeed, "Understanding IT Innovations through Computational Analysis of Discourse," in International Conference on Information Systems, 9 pages, Phoenix, AZ, 2009. (PDF)
  18. Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng, Ping Wang, and Emi Ishita, "Automatic Classification of Human Values: Applying Computational Thinking to Information Ethics," Annual Conference of the Association for Information Science and Technology, Vancouver, 2009. (Publisher)
  19. Ping Wang, Chia-Jung Tsui, Kenneth R. Fleischmann, Douglas W. Oard and Lidan Wang, "Understanding IT Innovations Through Discourse Analysis," Fourth iSchools Conference, 3 pages, Chapel Hill, 2009. (PDF)
  20. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang and Douglas W. Oard, "Advancing Social Science Research by Applying Computational Linguistics," in Proceedings of the Annual Conference of the American Society for Information Science and Technology, 12 pages, Columbus, 2008. (PDF)

Information Integration

These papers address issues that involve structured representation of information found in (or that can be inferred from) unstructured documents. This includes my work on the narrower problems of information extraction, co-reference resolution, and text classification. My principal interest is in how these techniques can be employed in integrated systems that are designed to satisfy specific types of information needs.
  1. Joe Barrow, Rajiv Jain, Nedim Lipka, Franck Dernoncourt, Vlad I. Morariu, Varun Manjunatha, Douglas W. Oard, Philip Resnik and Henning Wachsmuth, Syntopical Graphs for Computational Argumentation Tasks, Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 1583-1595. 2021. (PDF)
  2. Joe Barrow, Rajiv Jain, Vlad I. Morariu, Varun Manjunatha, Douglas W. Oard, Philip Resnik, A Joint Model for Document Segmentation and Segment Labeling, ACL, pp. 313-322, 2020. (PDF)
  3. Rashmi Sankepally; Tongfei Chen; Benjamin Van Durme; Douglas W. Oard, "A Test Collection for Coreferent Mention Retrieval", 4 pages, SIGIR, Ann Arbor, MI, 2018. (PDF)
  4. Ning Gao, Mark Dredze and Douglas W. Oard, Enhancing Scientific Collaboration Through Knowledge Base Population and Linking for Meetings, Hawaii International Conference on System Sciences, Waikoloa, HI, 2018. (PDF)
  5. Ning Gao, Mark Dredze and Douglas W. Oard, Person Entity Linking in Email with NIL Detection, Journal for the Association for Information Science and Technology, 68(10)2412-2424, 2017. (Publisher)
  6. Ning Gao, Mark Dredze and Douglas W. Oard, Knowledge-Based Population for Organization Mention in Email, in 5th Workshop on Automated Knowledge Base Conttruction,, 5 pages, 2016. (PDF)
  7. Tim Finin, Dawn Lawrie, Paul McNamee, James Mayfield, Douglas Oard, Nanyun Peng, Ning Gao, Yiu-Chang Lin, Josh MacLin and Tim Dowd, HLTCOE Participation in TAC KBK 2015: Cold Start and TEDL, in Proceedings of the Text Analysis Conference, 14 pages, Gaithersburg, MD, 2015. (PDF)
  8. Ning Gao, Douglas Oard and Mark Dredze, A Test Collection for Email Entity Linking, 4th NIPS Workshop on Automated Knowledge Base Construction (AKBC), 5 pages, Montreal, Canada, 2013. (PDF)
  9. Dawn Lawrie, James Mayfield, Paul McNamee and Douglas W. Oard, "Cross-Language Person-Entity Linking from 20 Languages," Journal of the Association for Information Science and Technology (JASIST), 66(6)2091-1105, 2015. (preprint: PDF) (Publisher)
  10. Hui Su, Adi Hajj-Ahmad, Min Wu and Douglas W. Oard, "Exploring the Use of ENF for Multimedia Synchronization," IEEE International Conference on Acoustics, Speech, and Signal Processing, 5 pages, Florence, Italy, 2014. (PDF)
  11. Douglas W. Oard, Min Wu, Kari Kraus, Adi Hajj-ahmad, Hui Su and avi Garg, "Its About Time: Projecting Temporal Metadata for Historically Significant Recordings," 7 pages, iConference, Berlin, Germany, 2014. (PDF)
  12. Paul McNamee, James Mayfield, Tim Finin, Tim Oates, Dawn Lawrie, Tan Xu and Douglas Oard, "KELVIN: A Tool for Automated Knowledge Base Construction," in Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4 page demonstration paper, Atlanta, GA, 2013. (PDF)
  13. Paul McNamee, Veselin Stoyanov, James Mayfield, Tim Finin, Tim Oates, Tan Xu, Douglas W. Oard and Dawn Lawrie, "HLTCOE Participation at TAC 2012: Entity Linking and Cold Start Knowledge Base Construction," in Proceedings of the Text Analysis Conference, 11 pages, Gaithersburg, MD, 2012. (PDF)
  14. Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard, "Creating and Curating a Cross-Language Entity Linking Collection," 8th International Conference on Language Resources and Evaluation, 5 pages, Istanbul, Turkey, 2012. (PDF)
  15. Paul McNamee, James Mayfield, Douglas W. Oard, Tan Xu, Wu Ke, Veselin Stoyanov and David Doermann, "Cross-Language Entity Linking in Maryland During a Hurricane," in Proceedings of the Text Analysis Conference, 11 pages, Gaithersburg, MD, 2011. (PDF)
  16. Jun Gong, Lidan Wang and Douglas W. Oard, "Matching Person Names Through Name Transformation, in ACM Conference on Information and Knowledge Management, 4 pages, Hong Kong, China, 2009. (PDF)
  17. Yejun Wu and Douglas W. Oard, "Beyond Topicality, Finding Opinionated Documents," Annual Conference of the Association for Information Science and Technology, Vancouver, 2009. (PDF)
  18. Jun Gong and Douglas W. Oard, "Selecting Hierarchical Clustering Cut Points for Web Person-Name Disambiguation," Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Boston, 2009. (PDF)
  19. Asad Sayeed, Tamer Elsayed, Nikesh Garera, David Alexander, Tan Xu, Douglas W. Oard, David Yarowsky and Christine Piatko, Arabic Cross-Document Coreference Resolution, Annual Conference of the Association for Computational Linguistics / International Joint Conference on Natural Language Processing, pp. 357-360, Singapore, 2009. (PDF)
  20. James Mayfield, David Alexander, Bonnie Dorr, Jason Eisner, Tamer Elsayed, Tim Finin, Clay Fink, Marjorie Freedman, Nikesh Garera, Paul McNamee, Saif Mohammad, Douglas W. Oard, Christine Piatko, Asad Sayeed, Zarem Syed, Ralph Weischedel, Tan Xu and David Yarowsky, "Cross-Document Coreference Resolution: A Key Technology for Learning by Reading," AAAI Spring Symposium on Learning by Reading and Learning to Read, 6 pages, Stanford, 2009. (PDF)
  21. James Mayfield, Bonnie J. Dorr, Tim Finin, Douglas W. Oard and Christine Piatko, "Knowledge Base Evaluation for Semantic Knowledge Discovery," in Symposium on Syntactic Knowledge Discovery, Organization and Use, New York, 2 pages, 2008. (PDF)
  22. Tan Xu, Douglas W. Oard, Tamer Elsayed and Asad Sayeed, "Knowledge Representation from Information Extraction," Joint Conference on Digital Libraries, Pittsburgh, p. 475, 2008. (PDF)
  23. Yejun Wu and Douglas W. Oard, "NTCIR-6 at Maryland: Chinese Opinion Analysis Pilot Task," in Proceedings of the Sixth NTCIR Workshop, Tokyo, 6 pages, 2007. (PDF)
  24. J. Scott Olsson and Douglas W. Oard, "Evaluating Feature Selection Combination Methods for Automatic Text Classification," in Conference on Information and Knowledge Management, Arlington, VA, pp. 798-799, 2006. (PDF)
  25. Douglas W. Oard, "Integration of Natural Language with Structured Data: Three Test Collections," Information Integration Workshop, Philadelphia, 2 pages, 2006. (PDF)
  26. Dina Demner-Fushman, Philip Resnik and Douglas W. Oard. "Genomic Entity Recognition at TREC," JCDL TREC Genomics Pre-Track Workshop, Portland, 2002. (PDF)
  27. Paul Losiewicz, Douglas W. Oard and Ronald N. Kostoff, "Textual Data Mining to Support Science and Technology Management," Journal of Intelligent Information Systems, 15(2)99-119, 2000. (PDF)

Recommender Systems

These papers address techniques for recommending new content to users based on learned representations of the stable interests of those users. The term "recommender systems" is used expansively here to include both content-based and behavior-based systems, and systems that rely on either explicit or implicit feedback from the user.
  1. Melanie Gnasa, Armin B. Cremers and Douglas W. Oard, "ISKADOR: Unified User Modeling for Integrated Searching," in 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, p. 898, 2007. (PDF)
  2. Penelope Brooks, Khoo Yit Phang, Douglas W. Oard, Ryen W. White, Rachael Bradley, and Francois Guimbretiere, "Measuring the Utility of Gaze Detection for Task Modeling: A Preliminary Study," in IUI-2006 Workshop on Intelligent User Interfaces for Intelligence Analysis, Sydney, Australia, 4 pages, 2006. (PDF)
  3. Tamer Elsayed and Douglas W. Oard, "On Evaluation of Adaptive Topic Tracking Systems," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 597-598, 2005. (PDF)
  4. Douglas W. Oard, Anton Leuski and Stuart Stubblebine, "Protecting the Privacy of Observable Behavior in Distributed Recommender Systems," ACM SIGIR Workshop on Implicit Methods, Toronto, Canada, 4 pages, 2003. (PDF)
  5. Jinmook Kim and Douglas W. Oard, "Observable Behavior for Implicit User Modeling: A Framework for User Studies," in Journal of the Korean Society for Library and Information Science, volume 35, pp. 173-189, 2001. (PDF)
  6. Douglas W. Oard and Jinmook Kim, "Modeling Information Content Using Observable Behavior," in Proceedings of the 64th Annual Conference of the American Society for Information Science and Technology, pp. 481-488, Washington, 2001. (PDF)
  7. Jinmook Kim, Douglas W. Oard and Kathleen Romanik, "User Modeling for Information Access Based on Implicit Feedback," in Third ISKO Workshop on Information Filtering, pp. 25-37, Paris, 2001. (PDF)
  8. Jinmook Kim, Douglas W. Oard and Kathleen Romanik. Using implicit feedback for user modeling in internet and intranet searching. University of Maryland CLIS Technical Report 00-01, 2000. (PDF)
  9. Douglas W. Oard and Jinmook Kim, "Implicit Feedback for Recommender Systems," in AAAI Workshop on Recommender Systems, pp. 81-83, Madison, WI, 1998. (PDF)
  10. Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr, and Christos Faloutsos, "High Performance Cognitive and Interactive Text Filtering," Proceedings of IEEE International Conference on Systems, Man, and Cybernetics, Volume V, pp. 4398-4403, Vancouver, Canada, 1995. (PDF)

Other Topics

Papers on topics that are new to me will initially show up in this category, and then ultimately perhaps become the anchor of a category of their own.
  1. Dawn Larwrie, Efsun Kayi, Eugene Yang, James Mayfield and Douglas W. Oard, PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval, Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington DC, 5 pages, 2024. (PDF)
  2. Nathaniel W. Rollings, Peter A. Rankel and Douglas W. Oard, Multi-Faceted Question Fusion in the TREC 2022 CrisisFACTS Track, TREC, 2022. (PDF)
  3. Xin Qian, Douglas W. Oard and Joel Chan, Conversational Interaction with Historical Figures: What’s it good for?, iConference, 17 pages, 2022. (PDF)
  4. Xin Qian and Douglas W. Oard, Full-Collection Search with Passage and Document Evidence: Maryland at the TREC 2021 Conversational Assistance Track, TREC, 9 pages, 2021. (PDF)
  5. Han-Chin Shing, Chaitanya Shivade, Nima Pourdamghani, Feng Nan, Philip Resnik, Douglas Oard and Parminder Bhatia, Towards Clinical Encounter Summarization: Learning to Compose Discharge Summaries from Prior Notes, Preprint, CoRR abs/2104.13498, 12 pp., 2021. (PDF Preprint)
  6. Mahmoud F. Sayed and Douglas W. Oard, The University of Maryland at the TREC 2020 Fair Ranking Track, TREC, 4 pages, 2020. (PDF)
  7. Han-Chin Shing, Philip Resnik, Douglas W. Oard, A Prioritization Model for Suicidality Risk Assessment, ACL, pp. 8124-8137, 2020. (PDF)
  8. Tokinori Suzuki, Daisuke Ikeda, Petra Galuščáková, Douglas W. Oard, Towards Automatic Cataloging of Image and Textual Collections with Wikipedia, ICADL, pp. 167-180, Kuala Lumpur, Malaysia, 2019. (PDF)
  9. Kristine Rogers and Douglas W. Oard, UMD_CLIP: Using Relevance Feedback for Find Diverse Documents for TREC Dynamic Domain 2017, Working Notes of the Twenty-Sixth Text Retrieval Conference, Gaithersburg, MD, 2017. (PDF)
  10. Mossaab Bagdouri and Douglas W. Oard, Building Bridges Across Social Platforms: Answering Twitter Questions with Yahoo! Answers, 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, short paper, 4 pages, Tokyo, Japan, 2017. (PDF)
  11. Jiaul H. Paik and Douglas W. Oard, A Fixed-Point Method for Weighting Terms in Verbose Informational Queries, in ACM International Conference on Information and Knowledge Management, 10 pages, Shanghai, China, 2014. (PDF)
  12. Tanya Clement, Kari Kraus, Jentery Sayers, Whitney Trettien, David Tcheng, Loretta Auvil, Tony Borries, Min Wu, Doug Oard, Adi Hajj-Ahmad, Hui Su, Mary Caton Lingold, Daren Mueller, William J. Turkel, and Devon Elliott, "Digital Humanities: The Intersections of Sound and Method," panel abstract in Digital Humanities Conference, Lausanne, Switzerland, 2014. (PDF)
  13. Katie Shilton, Michael Kurtz, Bruce Ambacher, Erik Mitchell, Douglas Oard and Ann Weeks, "Bridging By Design: The Curation and Management of Digital Assets Specialization at the University of Maryland," in Proceedings of the Framing the Digital Curation Curriculum Conference (DigCurV), 5 pages, Florence, Italy, 2013. (PDF)
  14. Tan Xu, Paul McNamee and Douglas W. Oard, "HLTCOE at TREC 2013: Temporal Submission," in The Twenty-Second Text Retrieval Conference, 8 pages, Gaithersberg, MD, 2013. (PDF)
  15. Douglas W. Oard and Noriko Kando, "Extrinsic Evaluation of Patent MT, in Fifth International Workshop on Evaluating Information Access, 5 pages, Tokyo, Japan, 2013. (PDF)
  16. Keith C. Walker and Douglas W. Oard, "Extending Argument Maps to Provide Decision Support for Rulemaking," in Hawaii International Conference on System Sciences, 10 pages, Maui, HI, 2013. (PDF)
  17. Amalia S. Levi and Douglas W. Oard, "From Personal Narratives to Collective Memory: Spinning a Web from Oral History," in XVII International Oral History Association Conference, 31 pages, Buenos Aires, Argentina, 2012. (PDF)
  18. Pengyi Zhang, Dagobert Soergel, Judith L. Klavans and Douglas W. Oard, "Extending Sense-Making Models with Ideas from Cognition and Learning Theories," in Proceedings of the Annual Conference of the American Society for Information Science and Technology, 12 pages, Columbus, 2008. (PDF)
  19. Tamer Elsayed, Jimmy Lin and Douglas W. Oard, "Pairwise Document Similarity for Large Collections with MapReduce," Annual Conference of the Association for Computational Linguistics-Human Language Technology Conference, Columbus, OH, companion volume, pp. 265-268, 2008. (PDF)
  20. Ashwin Swaminathan, Yinian Mao, Guan-Ming Su, Hongmei Gou, Avinash L Varna, Shan He, Min Wu and Douglas W. Oard, "Confidentiality-Preserving Rank-Ordered Search," ACM Workshop on Storage, Security and Survivability, Alexandria, VA, 6 pages, 2007. (PDF)
  21. Kareem Darwish and Douglas W. Oard, "Adapting Morphology for Arabic Information Retrieval," in Abdelhadi Soudi, Gunter Neumann and Antal Van den Bosch (eds.), Arabic Computational Morphology: Knowledge-based and Empirical Methods, Kluwer/Springer Series on Text, Speech, and Language Technology, 2006. (PDF) (Publisher)
  22. Wilma Bainbridge, Douglas W. Oard and Ryen White, "An Interface to Search Human Movements Based on Geographic and Chronological Metadata," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 579-580, 2005. (PDF)
  23. Daqing He, Dina Demner-Fushman, Douglas W. Oard, Damianos Karakos, and Sanjeev Khudanpur, "Improving Passage Retrieval Using Interactive Elicitation and Statistical Modeling," in The Thirteenth Text Retrieval Conference, Gaithersburg, MD, 8 pages, 2004. (PDF)
  24. Douglas W. Oard, Sheldon Wolk and Anthony Ephremides, "On The Integrated Scheduling of Hardkill and Softkill Assets Using Dynamic Programming," Naval Research Laboratory, 1994. (PDF)

Project Pages

When research projects create a project specific page, I will generally include a link here. Some very old projects are not included.
  1. Safely Searching Among Sensitive Content
  2. ArQAT: Arabic Question Answering in Twitter
  3. Text Classification for Human Values
  4. E-Discovery
  5. Oral History in the Digital Age
  6. PopIT
  7. JIKD
  8. MALACH
  9. US/EU Digital Library Spoken Word Archive Group

Edited Works

    ACM TALIP Special Issue on the TIDES Surprise Language
    A pair of special issues (June and September 2003) of the ACM Transactions on Asian Language Information Processing that I edited. Membership in the ACM Digital Library is needed to access the articles.
    Team TIDES Newsletter
    The newsletter for the DARPA Translingual Information Detection Extraction and Summarization (TIDES) program. I edited the first two (December 2002 and April 2003) and helped out with the third (October 2003). The April 2003 and October 2003 issues contain articles that I wrote about the surprise language exercises.

Talk Videos

It is becoming more common to post recorded talks. Here are a few from around the Web that I know of.
  1. Speaking with the Past: Novel forms of access to spoken word collections, Center for Archival Futures Speaker Series, University of Maryland, College Park, 2022.
  2. Search the World: Cross-Language Informaton Retrieval, Search Mastery Speaker Series, University of Maryland, College Park, 2021 (correction: 2lingual is not owned by Google!)
  3. Search Among Sensitive Content, European Conference on Information Retrieval Tutorial, 2021. This was a jointly presented tutorial with Graham McDonald, who spoke first.
  4. IR4All, Building Search Engines for Everyone, AFIRM 2020 ACM SIGIR/SIGKDD Africa Summer School on Machine Learning for Data Mining and Search.
  5. Search Among Secrets: Separating the Wheat from the Buzzsaw, Intelligent Systems Dotoral Program Seminar, UNED, Madrid, Spain, 2014.
  6. Thinking Big, 2012. This is a short video on using serch technology for access to oral history made for the Oral History in the Digital Age project, 2012.
  7. Who 'Dat: Identity Resolution in Large Email Collections, Microsoft Research, 2009.
  8. Nobody Writes Letters Anymore, MAVIR Seminar, UNED, Madrid, Spain, 2009.
  9. Oral History in the Digital Age, Library of Congress, 2012. This is a joint seminar series talk with Mark Kornbluh, who spoke first.
  10. Mandarin-English Information, Johns Hopkins University, 2000. This is a team talk led by Helen Meng, who spoke first.

Workshop Pages

These pages provide access to resources (e.g., papers) that were assembled for workshops and evaluation campaigns that I helped to organize.
  1. LREC 2020 Workshop on Cross-Language Search and Summarization of Text and Speech
  2. ICAIL 2017 Workshop on Discovery of Electronically Stored Information
  3. ICAIL 2015 Workshop on Discovery of Electronically Stored Information
  4. FIRE 2013 Question Answering for the Spoken Web (QASW) track.
  5. ICAIL 2013 Workshop on Discovery of Electronically Stored Information
  6. AAAI-2011 Workshop on Analyzing Microtext
  7. SIGIR 2011 Information Retrieval for E-Discovery Workshop
  8. ICAIL 2011 Workshop on Discovery of Electronically Stored Information
  9. First DC-area IR Experts (DIRE) Meeting
  10. TREC Legal Track
  11. Second Iternational Workshop on Supporting Search and Sense-making for Electronically Stored Information in Discovery Proceedings (DESI II)
  12. SIGIR 2007 Workshop on Searching Spontaneous Conversational Speech
  13. ICAIL 2007 Workshop on Discovery of Electronically Stored Information (DESI I)
  14. HLT 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval
  15. CLEF Interactive Track (iCLEF)
  16. TREC-2002 Arabic/English CLIR Track (TREC-2001 also available)
  17. 2001 Workshop on Evaluation of Interactive Cross-Language Retrieval
  18. Summer 2000 Johns Hopkins Workshop on Mandarin-English Information (MEI)
  19. 2000 Workshop on Interactive Searching of Foreign Language Collections
  20. 1999 Joint ACM Digital Library/SIGIR Workshop on Multilingual Information Discovery and AccesS
  21. AAAI Spring 1997 Symposium on Cross-Language Text and Speech Retrieval

Research Software

Some software that I have developed for my research projects can be downloaded from a page that describes the available files. All of this is now quite old.

Research Directories

Community-wide resources on subjects that have been on interest me. These pages are not actively maintained, so they are best thought of as a snapshot of what a field looked like long ago near the time I first built them.
Last modified: Wed Jun 8 08:18:28 2022
Doug Oard oard@umd.edu