Doug Oard's Research Page



This page contains a mix of peer reviewed and unrefereed journal articles, book chapters, and conference and workshop papers, organized by subject and listed most-recent-first within a subject. Many of the papers are available here as PDF. Sometimes the PDF here is the initially submitted version rather than the version finally published (this should be clear from formatting); links to the publisher's Web site are provided for journal articles when possible. This page is sometimes updated less frequently than I would like, so if there's something specific that you are looking for that is not yet here let me know and I'll do my best to get it posted.

Papers Written for a Broad Audience

This is a mix of overviews of a topic that were prepared for various venues and position papers that describe a specific interest that are sometimes prepared as a basis for discussion at a workshop.
  1. Douglas W. Oard and Joseph Malionek, "The Apollo Archive Explorer," in Joint Conference on Digital Libraries, 2 page demonstration description, Indianapolis, IN, 2013. (PDF)
  2. Douglas W. Oard and William Webber, "Information Retrieval for E-Discovery," Foundations and Trends in Information Retrieval, 7(2-3)100-237, 2013. (PDF) (Publisher)
  3. Douglas W. Oard}, ``Can Automatic Speech Recognition Replace Manual Transcription?,'' Oral History in the Digital Age (Web resource), 2012. (HTML)
  4. Douglas W. Oard, "A Whirlwind Tour of Automated Language Processing for the Humanities and Social Sciences," in Working Together or Apart: Promoting Digital Scholarship, Council on Library and Information Resources, 2009. (PDF)
  5. Douglas W. Oard, "Multilingual Information Access," in Encyclopedia of Library and Information Sciences, 3rd Ed., edited by Marcia J. Bates, Editor, and Mary Niles Maack, Associate Editor, Taylor & Francis, 2009. (PDF)
  6. Douglas W. Oard, "Unlocking the Potential of the Spoken Word," Science, 321(5897)1787-1788, 2008. (Publisher)
  7. Franciska de Jong, Douglas W. Oard, Willemijn Heeren and Roeland Ordelman, "Access to Recorded Interviews: A Research Agenda," ACM Journal on Computing and Cultural Heritage, 1(1)1-27, 2008. (PDF), (Publisher)
  8. Franciska de Jong, Douglas W. Oard, Roeland Ordelman and Stephan Raaijmakers, "Searching Spontaneous Conversational Speech," workshop report in SIGIR Forum, 41(2)104-108, 2007. (PDF)
  9. Douglas W. Oard, "Transcending the Tower of Babel: Supporting Access to Multilingual Information with Cross-Language Information Retrieval," in Robert Popp and John Yen, ed., Emergent Information Technologies and Enabling Policies for Counter-Terrorism, Prentice Hall, Chapter 15, pp. 299-314, 2006. (PDF)
  10. Douglas W. Oard, "Towards Analysis Tools for a Multilingual Blogsphere," in AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs, Stanford, CA, 3 pages, 2006. (PDF)
  11. Jerry Goldman, Steve Renals, Steven Bird, Franciska de Jong, Marcello Federico, Carl Fleischhauer, Mark Kornbluh, Lori Lamel, Douglas W. Oard, Fabrizio Sebastiani, Claire Stewart and Richard Wright, "Transforming Access to the Spoken Word," International Journal on Digital Libraries, 5(4)287-298, 2005. (PDF), (Publisher)
  12. Douglas W. Oard, "The SIGIR Workshop Program," SIGIR Forum, 39(2)15-16, 2005. (PDF)
  13. Douglas W. Oard, "The Surprise Language Exercises," ACM Transactions on Asian Language Information Processing, 2(2)79-84, 2003. (PDF) (Publisher)
  14. Douglas W. Oard, "Coping with Surprise: Responsive Language Technology", Team TIDES, p. 2, October 2003. (PDF)
  15. Douglas W. Oard, "Surprise: It's Cebuano!", Team TIDES, pp. 2-3, April 2003. (PDF)
  16. Douglas W. Oard, "Interactive Cross-Language Information Retrieval," workshop report in SIGIR Forum, 35(1)1-3, 2001. (PDF)
  17. Judith Klavans, Eduard Hovy, Christian Fluhr, Robert Frederking, Douglas Oard, Akitoshi Okumura, Kai Ishikawa, and Kenji Satoh, "Multilingual (or Cross-Lingual) Information Retrieval" in Multilingual Information Management: Current Levels and Future Abilities, Eduard Hovy, Nancy Ide, Robert Frederking, Joseph Mariani, Antonio Zampolli (eds.), Chapter 2, pp. 35-56, 2001. (HTML)
  18. Douglas W. Oard and Anne R. Diekema, "Cross-Language Information Retrieval," in Martha Williams (ed.), in Annual Review of Information Science and Technology, Volume 33, Chapter 6, pp. 223-256, 1998. (ASCII)
  19. Douglas Oard, Carol Peters, Miguel Ruiz, Robert Frederking, Judith Klavans, and Paraic Sheridan, "Multilingual Information Discovery and Access (MIDAS): A Joint ACM DL '99 / ACM SIGIR '99 Workshop," D-Lib Magazine, October, 1999. (HTML)
  20. Douglas W. Oard, "Extending Cross-Language Information Retrieval to a Global Scale," NSF Workshop on Multilingual Information Management, pp. 24-25, Granada, Spain, 1998. (PDF)
  21. Douglas W. Oard, "The State of the Art in Text Filtering." User Modeling and User Adapted Interaction, 7(3)141-178, 1997. (PDF)
  22. Douglas W. Oard, "Serving Users In Many Languages: Cross-Language Information Retrieval for Digital Libraries" D-Lib Magazine, December, 1997. (HTML)
  23. Douglas W. Oard, "Alternative Approaches for Cross-Language Text Retrieval," in AAAI Symposium on Cross-Language Text and Speech Retrieval, pp. 131-139, Palo Alto CA, 1997. (PDF)
  24. Douglas W. Oard, "Speech-Based Information Retrieval for Digital Libraries," AAAI Symposium on Cross-Language Text and Speech Retrieval, Palo Alto, CA, 1997. (PDF)
  25. Douglas W. Oard, "Cross-Language Text Retrieval Research in the USA," Third DELOS Workshop: Cross-Language Information retrieval, pp. 7-16, Zurich, 1997. (PDF)
  26. Douglas W. Oard and Bonnie J. Dorr, "A Survey of Multilingual Text Retrieval," University of Maryland Computer Science Department, 31 pp., CS-TR-3615, 1996. (PDF)
  27. Christos Faloutsos and Douglas Oard, "A Survey of Information Retrieval and Filtering Methods," University of Maryland Computer Science Department, 23 pp., CS-TR-3514, 1995. (PDF)

FIRE Track Overviews (2011-2013)

These are track overview papers that resulted from my work as a track coordinator in the Forum for Information Retrieval Evaluation (FIRE). These papers describe evaluation design issues for information retrieval systems that are designed to support a search for digital evidence in a litigation context. My own research on information retrieval techniques using these evaluation designs can be found below in the Document Image Retrieval and Speech sections.
  1. Douglas W. Oard, Jerome White, Jaiul Paik, Rashmi Sankepally and Aren Jansen, "The FIRE 2013 Question Answering for the Spoken Web Task," Fifth Forum for Information Retrieval Evaluation, 8 pages, New Delhi, India, 2013. (PDF)
  2. Utpal Garain, Jiaul Paik, Tamaltaru Pal, Prasenjit Majumder, David Doermann and Douglas W. Oard, "Overview of the FIRE 2011 RISOT Task," Third Forum for Information Retrieval Evaluation, pp.~159--163, Mumbai, India, 2011. (PDF)

TREC Legal Track Overviews (2006-2011)

These are track overview papers that resulted from my work as a track coordinator in the Text Retrieval Conference (TREC). These papers describe evaluation design issues for information retrieval systems that are designed to support a search for digital evidence in a litigation context. My own research on information retrieval techniques using these evaluation designs can be found below in the Document Image Retrieval and Email sections.
  1. Maura R. Grossman, Gordon V. Cormack, Bruce Hedin and Douglas W. Oard, "Overview of the TREC 2011 Legal Track," in Proceedings of the Twentieth Text Retrieval Conference, 20 pages, Gaithersburg, MD, 2011. (PDF)
  2. Douglas W. Oard, Jason R. Baron, Bruce Hedin, David D. Lewis and Stephen Tomlinson, "Evaluation of Information Retrieval for E-Discovery," Artificial Intelligence and Law, 18(4)347-386, 2010. (PDF) (Publisher)
  3. Gordon V. Cormack, Maura R. Grossman, Bruce Hedin, and Douglas W. Oard, "Overview of the TREC-2010 Legal Track," in Working Notes of the Nineteenth Text Retrieval Conference, pp. 30-38, Gaithersburg, MD, 2010. (PDF)
  4. William Webber, Douglas W. Oard, Falk Scholer and Bruce Hedin, "Assessor Error in Stratified Evaluation," in The 18th ACM International Conference on Information and Knowledge Management, 10 pages, Toronto, Canada, 2010. (PDF)
  5. Bruce Hedin, Stephen Tomlinson, Jason R. Baron and Douglas W. Oard, "Overview of the TREC 2009 Legal Track,'' in Proceedings of the Eighteenth Text Retrieval Conference," 40 pages, Gaithersburg, MD, 2009. (PDF)
  6. Bruce Hedin and Douglas W. Oard, "Replication and Automation of Expert Judgments: Information Engineering in Legal E-Discovery," in IEEE Conference on Systems, Man and Cybernetics, 6 pages, San Antonio, TX, 2009. (PDF)
  7. Douglas W. Oard, Bruce Hedin, Stephen Tomlinson and Jason R. Baron, "Overview of the TREC 2008 Legal Track," in The Seventeenth Text Retrieval Conference, Gaithersburg, MD, 45 pages, 2008. (PDF)
  8. Stephen Tomlinson, Douglas W. Oard, Jason R. Baron and Paul Thompson, "Overview of the TREC 2007 Legal Track," in The Sixteenth Text Retrieval Conference, Gaithersburg, MD, 34 pages, 2007. (PDF)
  9. Jason R. Baron, David D. Lewis and Douglas W. Oard, "The TREC-2006 Legal Track" in The Fifteenth Text Retrieval Conference, Gaithersburg, MD, 20 pages, 2006. (PDF)

CLEF Cross-Language Speech Retrieval Track Overviews (2005-2007)

These are track overview papers that resulted from my work as a track coordinator in the Cross-Language Evaluation Forum (CLEF). These papers describe evaluation design issues for information retrieval from spontaneous speech, regardless of the query language. My own research on information retrieval techniques using these evaluation designs can be found below in the Speech Retrieval section.
  1. Pavel Pecina and Petra Hoffmannova and Gareth J.F. Jones and Ying Zhang and Douglas W. Oard, "Overview of the CLEF-2007 Cross-Language Speech Retrieval Track," in Advances in Multilingual and Multimodal Information Retrieval, Revised Selected Papers, CLEF 2007, Springer-Verlag, LNCS (5152), Budapest, pp. 674-686, 2007. (PDF)
  2. Douglas W. Oard, Jianqiang Wang, Gareth G.F. Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, Izhak Shafran, "Overview of the CLEF-2006 Cross-Language Speech Retrieval Track," in Evaluation of Multilingual and Multi-modal Information Retrieval, Revised Selected Papers, CLEF-2006, Springer-Verlag, LNCS (4730), Alicante, Spain, 12 pages, 2006. (PDF)
  3. Ryen W. White, Douglas W. Oard, Gareth J.F. Jones, Dagobert Soergel and Xiaoli Huang, "Overview of the CLEF-2005 Cross-Language Speech Retrieval Track," in Multilingual Information Repositories, Revised Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022), Vienna, Austria, pp. 744-759, 2005. (PDF)

CLEF Interactive Track Overviews (2002-2004)

These are track overview papers that resulted from my work as a track coordinator in the Cross-Language Evaluation Forum (CLEF). These papers describe evaluation design issues for user-in-the-loop systems that are designed to support Multilingual Information Access (MLIA). My own research on information retrieval techniques using these evaluation designs can be found below in the MLIA section.
  1. Julio Gonzalo and Douglas W. Oard, "iCLEF 2004 Track Overview: Pilot Experiments in Interactive Cross-Language Question Answering," in Multilingual Information Access for Text, Speech and Images, Fifth Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Revised Selected Papers Series, Springer-Verlag, LNCS (3491), Bath, UK, pp. 310-322, 2004. (PDF)
  2. Julio Gonzalo and Douglas W. Oard, "The CLEF-2003 Interactive Track," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, 2003. (PDF)
  3. Douglas Oard and Julio Gonzalo, "The CLEF-2002 Interactive Track," in Advances in Cross-Language Information Retrieval Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002, Revised papers, Springer-Verlag LNCS (2785), pp. 245-254, Rome, Italy, 2002. (PDF)
  4. Douglas W. Oard and Julio Gonzalo, "The CLEF 2001 Interactive Track," in Evaluation of Cross-Language Information Retrieval Systems, Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001 Revised Papers, Springer-Verlag LNCS (2406), Darmstadt, Germany, pp. 308-319, 2001. (PDF)

TREC Arabic CLIR Track Overviews (2001-2002)

These are track overview papers and other papers that resulted from my work as a track coordinator in the Text Retrieval Conference (TREC). These papers describe evaluation design issues for information retrieval from Arabic, regardless of the query language. My own research on information retrieval techniques using these evaluation designs can be found below in the Multilingual Information Access section.
  1. Douglas W. Oard and Frederic C. Gey, "The TREC-2002 Arabic-English CLIR Track," in The Eleventh Text Retrieval Conference, Gaithersburg, MD, pp. 17-26, 2002. (PDF)
  2. Douglas W. Oard, Fredric C. Gey and Bonnie J. Dorr, "Evaluating Arabic Retrieval from English or French Queries," in LREC Workshop on Arabic Language Resources and Evaluation, Las Palmas, Spain, pp. 5-10, 2002. (PDF)
  3. Fredric C. Gey and Douglas W. Oard, "The TREC-2001 Cross-Language Information Retrieval Track: Searching Arabic using English, French or Arabic Queries," in The Tenth Text Retrieval Conference, pp. 114-121, Gaithersburg, MD, 2001. (PDF)
  4. Douglas W. Oard and Fredric C. Gey, "The TREC-2001 Arabic Information Retrieval Evaluation," in ACL Workshop on Arabic Language Processing, pp. 95-96, Toulouse, France, 2001. (PDF)

Other Evaluation Design

These papers report on evaluation design research conducted outside the scope of a shared-task evaluation that I helped to coordinate.
  1. Ning Gao, William Webber and Douglas W. Oard, "Reducing Reliance on Relevance Judgments for System Comparison by Using Expectation-Maximization," in Proceedings of the of the 36th European Conference on Information Retrieval, 12 pages, Amsterdam, The Netherlands, 2014. (PDF)
  2. Dina Demner-Fushman, Daqing He and Douglas W. Oard. "Exploring Interactive Relevance Feedback With a Two-Pass Study Design," Technical Report CS-TR-4621, University of Maryland Computer Science Department, 2004. (PDF)
  3. Bonnie Dorr, Christof Monz, Douglas Oard, David Zajic and Richard Schwartz. "Extrinsic Evaluation of Automatic Metrics for Summarization," Technical Report CS-TR-4610, University of Maryland Computer Science Department, 2004. (PDF)

Multilingual Information Access

These papers address the problem of finding documents that are written in one language (e.g., Chinese) using requests that are written in a different language (e.g., English). This problem is often referred to as "Cross-Language Information Retrieval" (CLIR), but Multilingual Information Access (MLIA) is a more inclusive term that better describes the scope of the work described here. Papers that address MLIA for spoken or scanned content can be found in those sections, interspersed with my other papers that address those topics. TREC and CLEF track overview papers that address evaluation design for some specific MLIA problems that have been the focus of international evaluation venues can be found in the evaluation design sections above.
  1. Sungho Kim, Youngjoong Ko and Douglas W. Oard, "Combining Lexical and Statistical Translation Evidence for Cross-Language Information Retrieval," to appear in Journal of the Association for Information Science and Technology (JASIST), 2014. (preprint: PDF)
  2. Yejun Wu and Douglas W. Oard, "English and Chinese Bilingual Topic Aspect Classification: Examining Similarity Measures, Optimal LSA Dimensions, and Centroid Correction of Translated Training Examples," in 76th Annual Conference of the American Society for Information Science and Technology, contributed paper, 12 pages, Montreal, Canada, 2013. (PDF)
  3. Ferhan Ture, Jimmy Lin and Douglas W. Oard, "Combining Statistical Translation Techniques for Cross-Language Information Retrieval," in 24th International Conference on Computational Linguistics, 17 pages, Mumbai, India, 2012. (PDF)
  4. Ferhan Ture, Douglas W. Oard and Philip Resnik, "Encouraging Consistent Translation Choices," in Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics, pp. 417-426, Montreal, Canada, 2012. (PDF)
  5. Ferhan Ture, Jummy Lin and Douglas W. Oard, "Looking Inside the Box: Context-Sensitive Translation for Cross-Language Information Retrieval," in 35th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2 pages, Portland, OR, 2012. (PDF)
  6. Jianqiang Wang and Douglas W. Oard, "Matching Meaning for Cross-Language Information Retrieval." Information Processing and Management, 48(4)631-653, 2012. (PDF) (Publisher)
  7. Douglas W. Oard, Carl Madson, Joseph Olive, John McCary and Caitlin Christianson (eds.), "Operational Engines," in Joseph Olive, Caitlin Christianson and John McCary (eds.), Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pp. 845-932, Springer, 2011. (Publisher)
  8. Tan Xu and Douglas W. Oard "FIRE-2008 at Maryland," in Working Notes of the Forum for Information Retrieval Evaluation, 12 pages, Kolkata, India, 2008. (PDF)
  9. Yejun Wu and Douglas W. Oard, "Bilingual Aspect Classification Based on Cross-language Text Classification," 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 203-210, Singapore, 2008. (PDF)
  10. Douglas W. Oard, Daqing He and Jianqiang Wang, "User-Assisted Query Translation for Cross-Language Information Retrieval," Information Processing and Management, 44(1)181-211, 2008. (PDF) (Publisher)
  11. Pengyi Zhang, Lynne Plettenberg, Judith Klavans, Douglas W. Oard and Dagobert Soergel, "Task-Based Interaction with an Integrated Multilingual Multimedia Information System: A Formative Evaluation," Joint Conference on Digital Libraries, pp. 117-126, Vancouver, BC, Canada, 2007. (PDF)
  12. Jianqiang Wang and Douglas W. Oard, "Combining Bidirectional Translation and Synonymy for Cross-Language Information Retrieval," in 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202-209, Seattle, 2006. (PDF)
  13. Daqing He, Douglas W. Oard, and Lynne Plettenberg, "Studying the Use of Interactive Multilingual Information Retrieval", in ACM SIGIR Workshop on New Directions in Multilingual Information Access, Amsterdam, 5 pages, 2006. (PDF)
  14. Gina-Anne Levow, Douglas W. Oard and Philip Resnik, "Dictionary-Based Cross-Language Retrieval," Information Processing and Management, 41(3)523-547, 2005. (PDF)
  15. J. Scott Olsson, Douglas W. Oard and Jan Hajic, "Cross-Language Text Classification," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 645-646, 2005. (PDF)
  16. Daqing He, Jianqiang Wang, Jun Luo and Douglas W. Oard, "iCLEF 2004 at Maryland: Summarization Design for Interactive Cross-Language Question Answering," in Multilingual Information Access for Text, Speech and Images, Fifth Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Revised Selected Papers Series, Springer-Verlag, LNCS (3491), Bath, UK, pp. 340-362, 2004. (PDF)
  17. Douglas W. Oard, Julio Gonzalo, Mark Sanderson, Fernando Lopez-Ostenero and Jianqiang Wang, "Interactive Cross-Language Document Selection," Information Retrieval, 7(1-2)205-228, 2004. (PDF)
  18. H. Ma, D. Doermann, B. Karagol-Ayan, D. Oard, and J. Wang, "Parsing and Tagging of Bilingual Dictionaries," Traitement Automatique des Langues, 44(2)125-150, 2004. (PDF)
  19. Tamer Elsayed, Douglas W. Oard, David Doermann and Gary Kuhn, "TDT- 2004: Adaptive Topic Tracking at Maryland," in Working Notes of the TDT-2004 Workshop, Gaithersburg, MD, 5 pages, 2004. (PDF)
  20. Kareem Darwish and Douglas W. Oard, "Probabilistic Structured Query Methods," in Twenty-Sixth International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 338-344, Toronto, Canada, 2003. (PDF)
  21. Daqing He, Douglas W. Oard, Jianqiang Wang, Jun Luo, Dina Demner- Fushman, Kareem Darwish, Philip Resnik, Sanjeev Khudanpur, Michael Nossal, Michael Subotin and Anton Leuski, "Making MIRACLEs: Interactive Translingual Search for Cebuano and Hindi," ACM Transactions on Asian Language Information Processing, 2(3)219-244, 2003. (PDF), (Publisher)
  22. Dina Demner-Fushman and Douglas W. Oard, "The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval," in Hawaii International Conference on System Sciences, 10 pages, Kona, HI, 2003. (PDF)
  23. Douglas W. Oard and Franz Josef Och, "Rapid-Response Machine Translation for Unexpected Languages," 7 pages, Machine Translation Summit IX, New Orleans, 2003. (PDF)
  24. Douglas W. Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn and Kevin Knight, "Desperately Seeking Cebuano," in Third Conference on Human Language Technologies, short paper (3 pages), Edmonton, Canada, 2003. (PDF)
  25. Abdessamad Echicabi, Douglas W. Oard, Daniel Marcu and Ulf Hermjakob, "Answering Spanish Questions from English Documents," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 514-522, 2003. (PDF)
  26. Bonnie J. Dorr, Daqing He, Jun Luo, Douglas W. Oard, Richard Schwartz, Jianqiang Wang and David Zajic, "iCLEF-2003 at Maryland: Headline Generation and Interactive Query Formulation," in Comparative Evaluation of Multilingual Information Access Systems, Fourth Workshop of the Cross-Language Evaluation Forum, Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 435-449, 2003. (PDF)
  27. Daqing He, Jianqiang Wang and Douglas W. Oard and Michael Nossal, "Comparing User-Assisted and Automatic Query Translation," in Advances in Cross-Language Information Retrieval Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002, Revised papers, Springer-Verlag LNCS (2785), pp. 267-278, Rome, Italy, 2002. (PDF)
  28. Gina-Anne Levow and Douglas W. Oard, "Signal Boosting for Translingual Topic Tracking" in Allen, James, ed. Topic Detection and Tracking: Event-Based Information Organization, Chapter 9, pp. 175-195, Kluwer Academic, 2002. (PDF)
  29. Douglas W. Oard and Funda Ertunc, "Translation-Based Indexing for Cross-Language Information Retrieval," in 24th BCS-IRSG European Colloquium on IR Research, pp. 324-333, Glasgow, UK, 2002. (PDF)
  30. David Doermann, Huanfeng Ma, Burcu Karagol-Ayan and Douglas W. Oard, "Translation Lexicon Acquisition from Bilingual Dictionaries," in Proceedings of the Ninth SPIE Symposium on Document Recognition and Retrieval, pp. 37-48, San Jose, CA, 2002.
  31. Kareem Darwish and Douglas W. Oard, "CLIR Experiments at Maryland for TREC-2002: Evidence Combination for Arabic-English Retrieval," in The Eleventh Text Retrieval Conference, Gaithersburg, MD, pp. 703-711, 2002. (PDF)
  32. Daqing He, Hyuk Ro Park, G. Craig Murray, Michael Subotin and Douglas W. Oard, "TDT-2002: Topic Tracking at Maryland: First Experiments with the Lemur Toolkit," in Working Notes of the Topic Detection and Tracking Workshop, 7 pages (online proceedings), Gaithersburg, MD, 2002. (PDF)
  33. Kareem Darwish, David Doermann, Ryan Jones, Douglas Oard, and Mika Rautiainen, "TREC-10 Experiments at Maryland: CLIR and Video," in The Tenth Text Retrieval Conference, pp. 552-564, Gaithersburg, MD, 2001. (PDF)
  34. Gina Levow, Douglas Oard, Philip Resnik, and Clara Cabezas, "Rapidly Retargetable Interactive Translingual Retrieval," in Proceedings of the First International Conference on Human Language Technology, pp. 294-298, San Diego, 2001. (PDF)
  35. Philip Resnik, Douglas Oard and Gina Levow, "Improved Cross-Language Retrieval using Backoff Translation," in Proceedings of the First International Conference on Human Language Technology, pp. 153-155, San Diego, 2001. (PDF)
  36. Jianqiang Wang and Douglas W. Oard, "iCLEF 2001 at Maryland: Comparing Word-for-Word Gloss and MT," in Evaluation of Cross-Language Information Retrieval Systems, Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001 Revised Papers, Springer-Verlag LNCS (2406), Darmstadt, Germany, pp. 336-354, 2001. (PDF)
  37. Douglas W. Oard and Jianqiang Wang, "NTCIR-2 ECIR Experiments at Maryland: Comparing Structured Queries and Balanced Translation," in Proceedings of the Second NTCIR Workshop on Evaluation of Japanese and Chinese Text Retrieval and Text Summarization, pp. 97-104, Tokyo, 2001. (PDF)
  38. Douglas W. Oard, "Evaluating Interactive Cross-Language Document Retrieval: Document selection," Proceedings of the First Cross-Language Evaluation Forum, pp. 57-71, Lisbon, 2000. (PDF)
  39. Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "CLEF Experiments at Maryland: Statistical stemming and backoff Translation," in Cross-Language Information Retrieval and Evaluation, Workshop of Cross-Language Evaluation Forum, CLEF 2000, Revised Papers Springer-Verlag, LNCS (2069), pp. 176-187, Lisbon, 2000. (PDF)
  40. Ruth Sperer and Douglas W. Oard, "Structured Translation for Cross-Language Information Retrieval," in Proceedings of the 23rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 120-127, Athens, Greece, 2000. (PDF)
  41. Paul G. Hackett and Douglas W. Oard, "Comparison of Word-Based and Syllable-Based Retrieval for Tibetan," Poster paper, in Fifth International Workshop on Information retrieval with Asian Languages, pp. 197-198, Hong Kong, 2000. (PDF)
  42. Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "TREC-9 Experiments at Maryland: Interactive CLIR," in The Ninth Text Retrieval Conference, pp. 543-550, Gaithersburg, MD, 2000. (PDF)
  43. Gina-Anne Levow and Douglas W. Oard, "Translingual Topic Tracking: Applying Lessons from the MEI Project," in Working notes of the Topic Detection and Tracking Workshop, (5 pages), Gaithersburg, MD, 2000. (PDF)
  44. Gina-Anne Levow and Douglas W. Oard, "Translingual Topic Tracking With PRISE," in Working notes of the Topic Detection and Tracking Workshop, pp. 175-180, Tysons Corner, VA, 2000. (PDF)
  45. Douglas W. Oard and Jianqiang Wang, "NTCIR CLIR Experiments at the University of Maryland," in Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, pp. 157-161, Tokyo, 1999. (PDF)
  46. Douglas W. Oard, Jianqiang Wang, Dekang Lin, and Ian Soboroff, "TREC-8 Experiments at Maryland: CLIR, QA and Routing," in The Eighth Text Retrieval Conference, pp. 623-636, Gaithersburg, MD, 1999. (PDF)
  47. Douglas W. Oard and Philip Resnik, "Support for Interactive Searching in Cross-Language Information Retrieval," Information Processing and Management, 35(3)363-379, 1999. (PDF) (Publisher)
  48. Gina-Anne Levow and Douglas W. Oard, "Evaluating Lexicon Coverage for Cross-Language Information Retrieval" in Proceedings of the Workshop on Multilingual Information Processing and Asian Language Processing, pp. 69-74, Beijing, 1999. (PDF)
  49. Douglas W. Oard and Jianqiang Wang, "Effects of Term Segmentation in Chinese/English Cross-Language Information Retrieval," in Proceedings of the Symposium on String Processing and Information Retrieval, pp. 149-157, Cancun, Mexico, 1999. (PDF)
  50. Douglas W. Oard, "Topic Tracking with the PRISE Information Retrieval System," in Proceedings of the DARPA Broadcast News Workshop, pp. 209-211, Reston, VA, 1999. (PDF)
  51. Douglas W. Oard, "Resources for Chinese/English Cross-Language IR," 25 pp., University of Maryland, 1999. (PDF) [the greek letters are misrendered versions of unfilled, half-filled, and completely filled circles that broke when Microsoft updated the character set for Word]
  52. Douglas W. Oard, "A Comparative Study of Query and Document Translation for Cross-Language Information Retrieval," in Proceedings of the Third Conference of the Association for Machine Translation in the Americas, pp. 472-483, Philadelphia, PA, 1998. (PDF)
  53. Bonnie J. Dorr and Douglas W. Oard, "Evaluating Resources for Query Translation in Cross-Language Information Retrieval," in Proceedings of the First International Conference on Language Resource Evaluation, Volume II, pp. 759-764, Granada, Spain, 1998. (PDF)
  54. Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language Text Filtering Effectiveness," in Gregory Grefenstette (ed.), Cross-Language Information Retrieval, Chapter 12, pp. 151-161, Kluwer Academic, 1998. [This is essentially the same as the SIGIR 96 workshop paper below.]
  55. Douglas W. Oard, "TREC-7 Experiments at the University of Maryland," in The Seventh Text Retrieval Conference, pp. 541-545, Gaithersburg, MD, 1998. (PDF)
  56. Douglas W. Oard and Paul Hackett, "Document Translation for Cross-Language Text Retrieval at the University of Maryland," in The Sixth Text Retrieval Conference, pp. 687-696, Gaithersburg MD, 1997. (PDF)
  57. Douglas W. Oard, "Adaptive Filtering of Multilingual Document Streams," in Fifth RIAO Conference on Computer Assisted Information Searching on the Internet, Volume 1, pp. 233-254, Montreal, Canada, 1997. (PDF)
  58. Douglas W. Oard, "Alignment of Spanish and English TREC Topic Descriptions," in The Fifth Text Retrieval Conference, pp. 547-553, Gaithersburg MD, 1996. (PDF)
  59. Douglas W. Oard, "Adaptive Vector Space Text Filtering for Monolingual and Cross-Language Applications," Ph.D. Dissertation, University of Maryland, College Park, 1996. (PDF)
  60. Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language Text Filtering Effectiveness," in Proceedings of Cross-Linguistic Multilingual Information Retrieval Workshop, ACM SIGIR Conference, pp. 8-14, Zurich, 1996. (PDF)
  61. Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr and Christos Faloutsos, "On Automatic Filtering of Multilingual Texts," Proceedings of IEEE International Conference on Systems, Man and Cybernetics, pp. 1645-1650, San Antonio, TX, 1994. (PDF)

Speech Retrieval

These papers address techniques for searching spoken content based on written or spoken queries.
  1. Jerome White, Douglas W. Oard, Nitendra Rajput and Marion Zalk, "Simulating Early-Termination Search for Verbose Spoken Queries," Emperical Methods in Natural Language Processing, 11 pages, Seattle, WA, 2013. (PDF)
  2. Abhijeet Sangwan, Lakshmish Kaushik, Chengzhu Yu, John H.L. Hansen and Douglas W. Oard, "Houston, We Have a Solution: Using NASA Apollo Program to Advance Speech and Language Processing Technology," INTERSPEECH, pp. 1135-1139, Lyon, France, 2013. (PDF)
  3. Douglas W. Oard, Abhijeet Sangwan and John H.L. Hansen, "Reconstruction of Apollo Mission Control Center Activity," in SIGIR Workshop on Exploration, Navigation and Retrieval of Information in Cultural Heritage (ENRICH), 4 pages, Dublin, Ireland, 2013. (PDF)
  4. Joseph Malionek, Douglas W. Oard, John Hansen and Abhijeet Sangwan, "Linking Transcribed Conversational Speech," in 36th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Dublin, Ireland, 2013. (PDF)
  5. Douglas W. Oard, "Query By Babbling: A Research Agenda," in CIKM Workshop on Information and Knowledge Management for Developing Regions, 5 pages, Maui, HI, 2012. (PDF)
  6. J. Scott Olsson and Douglas W. Oard, "Combining Evidence from LVCSR and Ranked Utterance Retrieval for Robust Domain-Specific Ranked Retrieval," Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Boston, 2009. (PDF)
  7. J. Scott Olsson and Douglas W. Oard, "Phrase-Based Query Degradation Modeling for Vocabulary-Independent Ranked Utterance Retrieval" Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technology Conference, Boulder, 2009. (PDF)
  8. J. Scott Olsson and Douglas W. Oard, "Combining Speech Retrieval Results with Generalized Additive Models," Association for Computational Linguistics-Human Language Technology Conference, pp. 461-469, Columbus, OH, 2008. (PDF)
  9. J. Scott Olsson and Douglas W. Oard, "Improving Text Classification for Oral History Archives with Temporal Domain Knowledge," 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 623-630, Amsterdam, 2007. (PDF)
  10. Pavel Ircing, Douglas W. Oard and Jan Hoideker, "First Experiments Searching Spontaneous Czech Speech," in 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 835-836, 2007. (PDF)
  11. Baolong Liu and Douglas W. Oard, "One-Sided Measures for Evaluating Ranked Retrieval Effectiveness with Spontaneous Conversational Speech," in 29th Annual International ACM SIGIR Conference on Research and Development on Information Retrieval, Seattle, pp. 673-674, 2006. (PDF)
  12. Diana Inkpen, Muath Alzghool, Gareth J.F. Jones and Douglas W. Oard, "Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection," Conference on Human Language Technologies and the North American Chapter of the Association for Computational Linguistics, 4 pages, New York, 2006. (PDF)
  13. Jianqiang Wang and Douglas W. Oard, "CLEF 2006 CL-SR at Maryland: English and Czech," in Evaluation of Multilingual and Multi-modal Information Retrieval, Revised Selected Papers, CLEF-2006, Springer-Verlag, LNCS (4730), Alicante, Spain, 7 pages, 2006. (PDF)
  14. Jianqiang Wang and Douglas W. Oard, "CLEF 2005 CL-SR at Maryland: Document and Query Expansion using Side Collections and Thesauri," in Multilingual Information Repositories, Revised Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022), Vienna, Austria, pp. 800-809, 2005. (PDF)
  15. William Byrne, David Doermann, Martin Franz, Samuel Gustman, Jan Hajic, Douglas Oard, Michael Picheny, Josef Psutka, Bhuvana Ramabhadran, Dagobert Soergel, Todd Ward and Wei-Jing Zhu, "Automated Recognition of Spontaneous Speech for Access to Multilingual Oral History Archives," IEEE Transactions on Speech and Audio Processing, 12(4)420-435, 2004. (PDF) (Publisher)
  16. Douglas W. Oard, Dagobert Soergel, Craig Murray, David Doermann, Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, James Mayfield, Samuel Gustman, and Stephanie Strassel, "Building an Information Retrieval Test Collection for Spontaneous Conversational Speech," Twenty-Seventh ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 41-48, Sheffield, UK, 2004. (PDF)
  17. Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang, Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," Computer Speech and Language, 18(2)163-179, 2004. (PDF) (Publisher)
  18. Sudeep Gandhe, Andrew Gordon, Anton Leuski, David R. Traum and Douglas W. Oard, "First Steps Towards Linking Dialogues: Mediating Between Free-Text Questions and Pre-recorded Video Answers," in 24th Army Science Conference, Orlando, FL, 8 pages, 2004. (PDF)
  19. Jinmook Kim Douglas W. Oard and Dagobert Soergel, "Searching Large Collections of Recorded Speech: A Preliminary Study," in Annual Conference of the American Society for Information Science and Technology, Long Beach, CA, pp. 330-339, 2003. (PDF)
  20. Douglas W. Oard and Anton Leuski, "Searching Recorded Speech Based on the Temporal Extent of Topic Labels," in AAAI Spring Symposium on Intelligent Multimedia Knowledge Management, Palo Alto, CA, 5 pages, 2003. (PDF)
  21. Jinmook Kim, Dagobert Soergel and Douglas W. Oard, "MALACH Workshop 2: Final Report," 62 pp., 2003.
  22. Samuel Gustman, Dagobert Soergel, Douglas Oard, William Byrne, Michael Picheny, Bhuvana Ramabhadran and Douglas Greenberg, "Supporting Access to Large Digital Oral History Archives," in Second Joint Conference on Digital Libraries, pp. 18-27, Portland, OR, 2002. (PDF)
  23. Douglas W. Oard, Dina Demner-Fushman, Jan Hajic, Bhuvana Ramabhadran, Samuel Gustman, William J. Byrne, Dagobert Soergel, Bonnie Dorr, Philip Resnik and Michael Picheny, "Cross-Language Access to Recorded Speech in the MALACH Project," in Fifth International Conference on Text,S peech and Dialog, pp. 57-64, Brno, Czech Republic, 2002. (PDF)
  24. Jinmook Kim and Douglas W. Oard, "The Use of Speech Retrieval Systems: A Study Design," in ACM SIGIR Workshop on IR Techniques for Speech Applications, New Orleans, pp. 86-93, 2001. (PDF)
  25. Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang, Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," in Proceedings of the First International Conference on Human Language Technology, pp. 239-245, San Diego, 2001. (PDF)
  26. Helen Meng, Sanjeev Khudanpur, Gina-Anne Levow, Douglas W. Oard, and Hsin-Min Wang, "Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval," in NAACL Workshop on Embedded Machine Translation, pp. 23-30, Seattle, WA, 2000. (PDF)
  27. Douglas W. Oard, "User Interface Design for Speech-Based Retrieval," Bulletin of the American Society for Information Science, vol. 26, no. 5, pp. 20-22, June/July, 2000. (Publisher)
  28. Helen Meng, Sanjeev Khudanpur, Douglas W. Oard, and Hsin-Min Wang, "Mandarin-English Information (MEI)," in Working notes of the Topic Detection and Tracking Workshop, pp. 117-121, Tysons Corner, VA, 2000. (PDF)
  29. Laura Slaughter, Douglas W. Oard, Vernon Warnick, Galen Wilkerson and Julie Harding, "A Graphical Interface for Speech-Based Retrieval," in Proceedings of the Third ACM Conference on Digital Libraries, pp. 305-306, Pittsburgh, PA, 1998. (PDF)

Search and Sense-making in Email Collections

These papers address techniques for helping people find things in large collections of electronic mail that are not their own. I do not work on the counterpart problem of Personal Information Management, in which tools are built to help people better manage their own email collections. Papers reporting on evaluation design for email search in the TREC Legal Track can also be found in the TREC Legal Track Overview section.
  1. Tan Xu and Douglas W. Oard, "Exploring Example-Based Person Search in Email," in 35th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2 pages, Portland, OR, 2012. (PDF)
  2. Hyunmo Kang, Catherine Plaisant, Tamer Elsayed, and Douglas W. Oard, "Making Sense of Archived Email: Exploring the Enron Collection with NetLens," Journal of the American Society for Information Science and Technology, 61(4)723-744, 2010. (PDF) (Publisher)
  3. Tamer Elsayed, Douglas W. Oard, and Galileo Namata, "Resolving Personal Names in Email Using Context Expansion," accepted for presentation at Association for Computational Linguistics-Human Language Technology Conference, pp. 941-949, Columbus, OH, 2008. (PDF)
  4. Adam Perer, Ben Shneiderman, and Douglas W. Oard, "Using Rhythms of Relationships to Understand Email Archives," Journal of the American Society for Information Science and Technology, 57(14)1936-1948, 2006. (PDF) (Publisher)
  5. Yejun Wu, Douglas W. Oard and Ian Soboroff, "An Exploratory Study of the W3C Mailing List Test Collection for Retrieval of Emails with Pro and/or Con arguments," in Third Conference on Email and Anti-Spam, 10 pages, Mountain View, CA, 2006. (PDF)
  6. Tamer Elsayed and Douglas W. Oard, "Modeling Identity in Archival Collections of Email: A Preliminary Study," in Conference on Email and Anti-Spam, 9 pages, Mountain View, CA, 2006. (PDF)
  7. Yejun Wu and Douglas W. Oard, "Indexing Emails and Email Threads for Retrieval," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 665-666, 2005. (PDF)
  8. Jimmy Lin, Eileen Abels, Dina Demner-Fushman, Douglas W. Oard, Philip Wu, and Yejun Wu, "A Menagerie of Tracks at Maryland: HARD, Enterprise, QA, and Genomics, Oh My!," in The Fourteenth Text Retrieval Conference, Gaithersburg, MD, 16 pages, 2005. (PDF)
  9. Anton Leuski, Douglas W. Oard and Rahul Bhagat, "eArchivarius: Accessing Collections of Electronic Mail," in Twenty-Sixth International ACM-SIGIR Conference on Research and Development in Information Retrieval, description of system demonstration, pp. 468, Toronto, Canada, 2003. (PDF)

Search and Sense-making in Text Chat

This is a research area on which I will likely publish more in the future.
  1. Lidan Wang and Douglas W. Oard, "Context-based Message Expansion for Disentanglement of Interleaved Text Conversations" Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technology Conference, Boulder, 2009. (PDF)

E-Discovery

  1. Mossaab Bagdouri, William Webber, David D. Lewis and Douglas W. Oard, "Towards Minimizing the Annotation Cost of Certified Text Classification," in ACM Conference on Information and Knowledge Management, 10 pages, San Francisco, CA, 2013. (PDF)
  2. William Webber, Mossaab Bagdouri, David D. Lewis and Douglas W. Oard, "Sequential Testing in Classifier Evaluation Yields Biased Estimates of Effectiveness," in 36th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 4 pages, Dublin, Ireland, 2013. (PDF)
  3. Feng Charlie Zhao, Douglas W. Oard and Jason R Baron, "Improving Search Effectiveness in the Legal E-Discovery Process Using Relevance Feedback," in Third International Workshop on Discovery of Electronically Stored Information (DESI III), 10 pages, Barcelona, Spain, 2009. (PDF)

Document Image Retrieval

These papers address techniques for searching scanned documents. Papers reporting on the evaluation design for the TREC Legal Track, which included scanned documents, can be found in the evaluation design section above.
  1. Rajiv Jain, Douglas W. Oard and David Doermann, Scalable Ranked Retrieval Using Document Images, in 21st SPIE Document Recognition and Retrieval Conference, 15 pages, San Francisco, CA, 2014. (PDF)
  2. Utpal Garain, Arjun Das, David Doermann and Douglas Oard, Leveraging Statistical Transliteration for Dictionary-Based English-Bengali CLIR of OCR'd Text, in 24th International Conference on Computational Linguistics, 9 pages, Mumbai, India, 2012. (PDF)
  3. Lidan Wang and Douglas W. Oard, "Query Expansion for Noisy Legal Documents," in The Sixteenth Text Retrieval Conference, 9 pages, Gaithersburg, MD, 2008.
  4. Douglas Oard, Tamer Elsayed, Jianqiang Wang, Yejun Wu, Pengyi Zhang, Eileen Abels, Jimmy Lin and Dagobert Soergel, TREC-2006 at Maryland: Blog, Enterprise, Legal and QA Tracks," in The Fifteenth Text Retrieval Conference, 16 pages, Gaithersburg, MD, 2006. (PDF)
  5. Kareem Darwish and Douglas W. Oard, "Balanced Query Methods for OCR-Based Retrieval," 2003 Symposium on Document Image Understanding Technology, Greenbelt, MD, 2003. (PDF)
  6. Kareem Darwish and Douglas W. Oard, "Term Selection for Searching Printed Arabic," in Twenty-Fifth International ACM-SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, pp. 261-268, 2002. (PDF)
  7. Yuen-Hsien Tseng and Douglas W. Oard, "Document Image Retrieval Techniques for Chinese," 2001 Symposium on Document Image Understanding Technology, pp. 151-158, Columbia, MD, 2001. (PDF)
  8. Douglas W. Oard, "Issues in Cross-Language Retrieval from Document Image Collections," 1999 Symposium on Document Image Understanding Technology, pp. 229-234, Annapolis, 1999. (PDF)

Computational Social Science

These papers involve the application of computational techniques to foster social science research. Many of my other papers also address issues that have potential application to social science research; what distinguishes these papers is that supporting social science research was the principal motivation for this work.
  1. Yasuhiro Takayama, Yoichi Tomiura, Emi Ishita, Zheng Wang, Douglas Oard, Kenneth Fleischmann and An-Shou Cheng, Improving Automatic Sentence-Level Annotation of Human Values Using Augmented Feature Vectors, in Conference of the Pacific Association for Computational Linguistics, 6 pages, Tokyo, Japan, 2013. (PDF)
  2. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita, and Douglas W. Oard, The Role of Innovation and Wealth in the Net Neutrality Debate: A Content Analysis of Human Values in Congressional and FCC Hearings, Journal of the American Society for Information Science and Technology (JASIST), 63(7)1360-1373, 2012. (PDF) (Publisher)
  3. Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, An-Shou Cheng and Thomas Clay Templeton, "Investigating Multi-Label Sentence Classification for Human Values," Annual Conference of the American Society for Information Science and Technology, 4 pages, Pittsburgh, PA, 2010. (PDF)
  4. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita and {\bf Douglas W. Oard}, "Values of Stakeholders in the Net Neutrality Debate: Applying Content Analysis to Telecommunications Policy," in Hawaii International Conference on System Sciences, 10 pages, Kauai, HI, 2010. (PDF)
  5. Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas W. Oard and Asad B. Sayeed, Exploring the Relationships among ICTs: A Scalable Computational Approach Using KL Divergence and Hierarchical Clustering," in Hawaii International Conference on System Sciences, 10 pages, Kauai, HI, 2010. (PDF)
  6. Emi Ishita, An-Shou Chen, Douglas W. Oard and Kenneth R. Fleischmann, "Multi-label Classification for Human Values" (in Japanese), in Annual Conference of the Japan Society of Library and Information Science, 4 pages, Tokyo, Japan, 2009. (PDF)
  7. Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas W. Oard and Asad B. Sayeed, "Understanding IT Innovations through Computational Analysis of Discourse," in International Conference on Information Systems, 9 pages, Phoenix, AZ, 2009. (PDF)
  8. Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng, Ping Wang, and Emi Ishita, "Automatic Classification of Human Values: Applying Computational Thinking to Information Ethics," Annual Conference of the Association for Information Science and Technology, Vancouver, 2009. (Publisher)
  9. Ping Wang, Chia-Jung Tsui, Kenneth R. Fleischmann, Douglas W. Oard and Lidan Wang, "Understanding IT Innovations Through Discourse Analysis," Fourth iSchools Conference, 3 pages, Chapel Hill, 2009. (PDF)
  10. An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang and Douglas W. Oard, "Advancing Social Science Research by Applying Computational Linguistics," in Proceedings of the Annual Conference of the American Society for Information Science and Technology, 12 pages, Columbus, 2008. (PDF)

Information Integration

These papers address issues that involve structured representation of information found in (or that can be inferred from) unstructured documents. This includes my work on the narrower problems of information extraction, co-reference resolution, and text classification. My principal interest is in how these techniques can be employed in integrated systems that are designed to satisfy specific types of information needs.
  1. Dawn Lawrie, James Mayfield, Paul McNamee and Douglas W. Oard, "Cross-Language Person-Entity Linking from Twenty Languages," Journal of the Association for Information Science and Technology (JASIST), to appear in 2014. (preprint: PDF)
  2. Douglas W. Oard, Min Wu, Kari Kraus, Adi Haij-ahmad, Hui Su and Ravi Garg, "Its About Time: Projecting Temporal Metadata for Historically Significant Recordings," 7 pages, iConference, Berlin, Germany, 2014. (PDF)
  3. Paul McNamee, James Mayfield, Tim Finin, Tim Oates, Dawn Lawrie, Tan Xu and Douglas Oard, "KELVIN: A Tool for Automated Knowledge Base Construction," in Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4 page demonstration paper, Atlanta, GA, 2013. (PDF)
  4. Paul McNamee, Veselin Stoyanov, James Mayfield, Tim Finin, Tim Oates, Tan Xu, Douglas W. Oard and Dawn Lawrie, "HLTCOE Participation at TAC 2012: Entity Linking and Cold Start Knowledge Base Construction," in Proceedings of the Text Analysis Conference, 11 pages, Gaithersburg, MD, 2012. (PDF)
  5. Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard, "Creating and Curating a Cross-Language Entity Linking Collection," 8th International Conference on Language Resources and Evaluation, 5 pages, Istanbul, Turkey, 2012. (PDF)
  6. Paul McNamee, James Mayfield, Douglas W. Oard, Tan Xu, Wu Ke, Veselin Stoyanov and David Doermann, "Cross-Language Entity Linking in Maryland During a Hurricane," in Proceedings of the Text Analysis Conference, 11 pages, Gaithersburg, MD, 2011. (PDF)
  7. Jun Gong, Lidan Wang and Douglas W. Oard, "Matching Person Names Through Name Transformation, in ACM Conference on Information and Knowledge Management, 4 pages, Hong Kong, China, 2009. (PDF)
  8. Yejun Wu and Douglas W. Oard, "Beyond Topicality, Finding Opinionated Documents," Annual Conference of the Association for Information Science and Technology, Vancouver, 2009. (PDF)
  9. Jun Gong and Douglas W. Oard, "Selecting Hierarchical Clustering Cut Points for Web Person-Name Disambiguation," Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Boston, 2009. (PDF)
  10. Asad Sayeed, Tamer Elsayed, Nikesh Garera, David Alexander, Tan Xu, Douglas W. Oard, David Yarowsky and Christine Piatko, Arabic Cross-Document Coreference Resolution, Annual Conference of the Association for Computational Linguistics / International Joint Conference on Natural Language Processing, pp. 357-360, Singapore, 2009. (PDF)
  11. James Mayfield, David Alexander, Bonnie Dorr, Jason Eisner, Tamer Elsayed, Tim Finin, Clay Fink, Marjorie Freedman, Nikesh Garera, Paul McNamee, Saif Mohammad, Douglas W. Oard, Christine Piatko, Asad Sayeed, Zarem Syed, Ralph Weischedel, Tan Xu and David Yarowsky, "Cross-Document Coreference Resolution: A Key Technology for Learning by Reading," AAAI Spring Symposium on Learning by Reading and Learning to Read, 6 pages, Stanford, 2009. (PDF)
  12. James Mayfield, Bonnie J. Dorr, Tim Finin, Douglas W. Oard and Christine Piatko, "Knowledge Base Evaluation for Semantic Knowledge Discovery," in Symposium on Syntactic Knowledge Discovery, Organization and Use, New York, 2 pages, 2008. (PDF)
  13. Tan Xu, Douglas W. Oard, Tamer Elsayed and Asad Sayeed, "Knowledge Representation from Information Extraction," Joint Conference on Digital Libraries, Pittsburgh, p. 475, 2008. (PDF)
  14. Yejun Wu and Douglas W. Oard, "NTCIR-6 at Maryland: Chinese Opinion Analysis Pilot Task," in Proceedings of the Sixth NTCIR Workshop, Tokyo, 6 pages, 2007. (PDF)
  15. J. Scott Olsson and Douglas W. Oard, "Evaluating Feature Selection Combination Methods for Automatic Text Classification," in Conference on Information and Knowledge Management, Arlington, VA, pp. 798-799, 2006. (PDF)
  16. Douglas W. Oard, "Integration of Natural Language with Structured Data: Three Test Collections," Information Integration Workshop, Philadelphia, 2 pages, 2006. (PDF)
  17. Dina Demner-Fushman, Philip Resnik and Douglas W. Oard. "Genomic Entity Recognition at TREC," JCDL TREC Genomics Pre-Track Workshop, Portland, 2002. (PDF)
  18. Paul Losiewicz, Douglas W. Oard and Ronald N. Kostoff, "Textual Data Mining to Support Science and Technology Management," Journal of Intelligent Information Systems, 15(2)99-119, 2000. (PDF)

Recommender Systems

These papers address techniques for recommending new content to users based on learned representations of the stable interests of those users. The term "recommender systems" is used expansively here to include both content-based and behavior-based systems, and systems that rely on either explicit or implicit feedback from the user.
  1. Melanie Gnasa, Armin B. Cremers and Douglas W. Oard, "ISKADOR: Unified User Modeling for Integrated Searching," in 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, p. 898, 2007. (PDF)
  2. Penelope Brooks, Khoo Yit Phang, Douglas W. Oard, Ryen W. White, Rachael Bradley, and Francois Guimbretiere, "Measuring the Utility of Gaze Detection for Task Modeling: A Preliminary Study," in IUI-2006 Workshop on Intelligent User Interfaces for Intelligence Analysis, Sydney, Australia, 4 pages, 2006. (PDF)
  3. Tamer Elsayed and Douglas W. Oard, "On Evaluation of Adaptive Topic Tracking Systems," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 597-598, 2005. (PDF)
  4. Douglas W. Oard, Anton Leuski and Stuart Stubblebine, "Protecting the Privacy of Observable Behavior in Distributed Recommender Systems," ACM SIGIR Workshop on Implicit Methods, Toronto, Canada, 4 pages, 2003. (PDF)
  5. Jinmook Kim and Douglas W. Oard, "Observable Behavior for Implicit User Modeling: A Framework for User Studies," in Journal of the Korean Society for Library and Information Science, volume 35, pp. 173-189, 2001. (PDF)
  6. Douglas W. Oard and Jinmook Kim, "Modeling Information Content Using Observable Behavior," in Proceedings of the 64th Annual Conference of the American Society for Information Science and Technology, pp. 481-488, Washington, 2001. (PDF)
  7. Jinmook Kim, Douglas W. Oard and Kathleen Romanik, "User Modeling for Information Access Based on Implicit Feedback," in Third ISKO Workshop on Information Filtering, pp. 25-37, Paris, 2001. (PDF)
  8. Jinmook Kim, Douglas W. Oard and Kathleen Romanik. Using implicit feedback for user modeling in internet and intranet searching. University of Maryland CLIS Technical Report 00-01, 2000. (PDF)
  9. Douglas W. Oard and Jinmook Kim, "Implicit Feedback for Recommender Systems," in AAAI Workshop on Recommender Systems, pp. 81-83, Madison, WI, 1998. (PDF)
  10. Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr, and Christos Faloutsos, "High Performance Cognitive and Interactive Text Filtering," Proceedings of IEEE International Conference on Systems, Man, and Cybernetics, Volume V, pp. 4398-4403, Vancouver, Canada, 1995. (PDF)

Other Topics

Papers on topics that are new to me will initially show up in this category, and then ultimately perhaps become the anchor of a category of their own.
  1. Katie Shilton, Michael Kurtz, Bruce Ambacher, Erik Mitchell, Douglas Oard and Ann Weeks, "Bridging By Design: The Curation and Management of Digital Assets Specialization at the University of Maryland," in Proceedings of the Framing the Digital Curation Curriculum Conference (DigCurV), 5 pages, Florence, Italy, 2013. (PDF)
  2. Tan Xu, Paul McNamee and Douglas W. Oard, "HLTCOE at TREC 2013: Temporal Submission," in The Twenty-Second Text Retrieval Conference, 8 pages, Gaithersberg, MD, 2013. (PDF)
  3. Douglas W. Oard and Noriko Kando, "Extrinsic Evaluation of Patent MT, in Fifth International Workshop on Evaluating Information Access, 5 pages, Tokyo, Japan, 2013. (PDF)
  4. Keith C. Walker and Douglas W. Oard, "Extending Argument Maps to Provide Decision Support for Rulemaking," in Hawaii International Conference on System Sciences, 10 pages, Maui, HI, 2013. (PDF)
  5. Amalia S. Levi and Douglas W. Oard, "From Personal Narratives to Collective Memory: Spinning a Web from Oral History," in XVII International Oral History Association Conference, 31 pages, Buenos Aires, Argentina, 2012. (PDF)
  6. Pengyi Zhang, Dagobert Soergel, Judith L. Klavans and Douglas W. Oard, "Extending Sense-Making Models with Ideas from Cognition and Learning Theories," in Proceedings of the Annual Conference of the American Society for Information Science and Technology, 12 pages, Columbus, 2008. (PDF)
  7. Tamer Elsayed, Jimmy Lin and Douglas W. Oard, "Pairwise Document Similarity for Large Collections with MapReduce," Annual Conference of the Association for Computational Linguistics-Human Language Technology Conference, Columbus, OH, companion volume, pp. 265-268, 2008. (PDF)
  8. Ashwin Swaminathan, Yinian Mao, Guan-Ming Su, Hongmei Gou, Avinash L Varna, Shan He, Min Wu and Douglas W. Oard, "Confidentiality-Preserving Rank-Ordered Search," ACM Workshop on Storage, Security and Survivability, Alexandria, VA, 6 pages, 2007. (PDF)
  9. Kareem Darwish and Douglas W. Oard, "Adapting Morphology for Arabic Information Retrieval," in Abdelhadi Soudi, Gunter Neumann and Antal Van den Bosch (eds.), Arabic Computational Morphology: Knowledge-based and Empirical Methods, Kluwer/Springer Series on Text, Speech, and Language Technology, 2006. (PDF) (Publisher)
  10. Wilma Bainbridge, Douglas W. Oard and Ryen White, "An Interface to Search Human Movements Based on Geographic and Chronological Metadata," in Proceedings of the 28th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, poster paper, pp. 579-580, 2005. (PDF)
  11. Daqing He, Dina Demner-Fushman, Douglas W. Oard, Damianos Karakos, and Sanjeev Khudanpur, "Improving Passage Retrieval Using Interactive Elicitation and Statistical Modeling," in The Thirteenth Text Retrieval Conference, Gaithersburg, MD, 8 pages, 2004. (PDF)
  12. Douglas W. Oard, Sheldon Wolk and Anthony Ephremides, "On The Integrated Scheduling of Hardkill and Softkill Assets Using Dynamic Programming," Naval Research Laboratory, 1994. (PDF)

Project Pages

When research projects create a project specific page, I will generally include a link here. Some very old projects are not included.
  1. E-Discovery
  2. Oral History in the Digital Age
  3. PopIT
  4. JIKD
  5. MALACH
  6. US/EU Digital Library Spoken Word Archive Group

Edited Works

    ACM TALIP Special Issue on the TIDES Surprise Language
    A pair of special issues (June and September 2003) of the ACM Transactions on Asian Language Information Processing that I edited. Membership in the ACM Digital Library is needed to access the articles.
    Team TIDES Newsletter
    The newsletter for the DARPA Translingual Information Detection Extraction and Summarization (TIDES) program. I edited the first two (December 2002 and April 2003) and helped out with the third (October 2003). The April 2003 and October 2003 issues contain articles that I wrote about the surprise language exercises.

Workshop Pages

These pages provide access to resources (e.g., papers) that were assembled for workshops and evaluation campaigns that I helped to organize.
  1. FIRE 2013 Question Answering for the Spoken Web (QASW) track.
  2. ICAIL 2013 Workshop on Discovery of Electronically Stored Information
  3. AAAI-2011 Workshop on Analyzing Microtext
  4. SIGIR 2011 Information Retrieval for E-Discovery Workshop
  5. ICAIL 2011 Workshop on Discovery of Electronically Stored Information
  6. First DC-area IR Experts (DIRE) Meeting
  7. TREC Legal Track
  8. Second Iternational Workshop on Supporting Search and Sense-making for Electronically Stored Information in Discovery Proceedings (DESI II)
  9. SIGIR 2007 Workshop on Searching Spontaneous Conversational Speech
  10. ICAIL 2007 Workshop on Discovery of Electronically Stored Information (DESI I)
  11. HLT 2004 Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval
  12. CLEF Interactive Track (iCLEF)
  13. TREC-2002 Arabic/English CLIR Track (TREC-2001 also available)
  14. 2001 Workshop on Evaluation of Interactive Cross-Language Retrieval
  15. Summer 2000 Johns Hopkins Workshop on Mandarin-English Information (MEI)
  16. 2000 Workshop on Interactive Searching of Foreign Language Collections
  17. 1999 Joint ACM Digital Library/SIGIR Workshop on Multilingual Information Discovery and AccesS
  18. AAAI Spring 1997 Symposium on Cross-Language Text and Speech Retrieval

Research Software

Some software that I have developed for my research projects can be downloaded from a page that describes the available files. All of this is now quite old.

Research Directories

Community-wide resources on subjects that have been on interest me. These pages are not actively maintained, so they are best thought of as a snapshot of what a field looked like near the time I first built them.
Last modified: Tue Feb 25 13:55:59 2014
Doug Oard oard@umd.edu