Doug Oard's Research Page
This page contains a mix of peer reviewed and unrefereed journal
articles, book chapters, and conference and workshop papers, and an
edited book, organized by subject and listed most-recent-first within
a subject. Many of the papers are available here as PDF. Sometimes
the PDF here is the initially submitted version rather than the
version finally published (this should be clear from formatting);
links to the publisher's Web site are provided for journal articles
when possible. This page is sometimes updated less frequently than I
would like, so if there's something specific that you are looking for
that is not yet here let me know and I'll do my best to get it posted.
Papers Written for a Broad Audience
This is a mix of overviews of a topic that were prepared for various
venues and position papers that describe a specific interest that are
sometimes prepared as a basis for discussion at a workshop.
- James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul
McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion
Weller, Efsun Kayi, Kate Sanders, Marc Mason and #Noah Hibbler,
On the Evaluation of Machine-Generated Reports, Proceedings of
the 47th International ACM SIGIR Conference on Research and
Development in Information Retrieval, Washington DC, Perspectives
paper, 12 pages, 2024. PDF
- Petra Galuščáková and Douglas W. Oard and Suraj Nair,
Cross-Language Information Retrieval, CoRR abs/2111.05988, 49
pages, 2022. (PDF preprint)
- Tetsuya Sakai, Douglas W. Oard, and Noriko Kando, Evaluating
Information Retrieval and Access Tasks: NTCIR's Legacy of
Research Impact, Springer, 2020. (Publisher
Open Access)
- Douglas W. Oard, The Future of Information Retrieval Evaluation,
in Evaluating Information Retrieval and Access Tasks: NTCIR's
Legacy of Research Impact, Springer, 2020. (PDF preprint)
- Ben Carterette, Hussein Suleman and Douglas W. Oard, Report on the
1st ACM SIGIR/SIGKDD Africa School on Machine Learning for Data
Mining and Search. SIGIR Forum 53(1): 3-13 (2019). (PDF)
- Mihai Lupu, Atsushi Fujii, Douglas W. Oard, Makoto Iwayama, and
Noriko Kando, Patent-Related Tasks at NTCIR, in Mihai Lupu et al,
Current Challenges in Patent Information Retrieval (Second
Edition), pp.77-111, Springer-Verlag, 2017. (PDF preprint) (Publisher)
- Douglas W. Oard. The Moonwalkers Who Could Have Been, Quest: The
History of Spaceflight Quarterly, 23(3), 51-53, 2016. (PDF)
- Douglas W. Oard, Amalia S. Levi, Ricardo L. Punzalan and Robert
Warren, "Bridging Communities of Practice: Emerging Technologies
for Content-Centered Linking," in 18th Annual Museums and the Web
Conference, 10 pages, Baltimore, MD, 2014. (PDF)
- Douglas W. Oard and Joseph Malionek, "The Apollo Archive
Explorer," in Joint Conference on Digital Libraries, 2 page
demonstration description, Indianapolis, IN, 2013. (PDF)
- Douglas W. Oard and William Webber, "Information Retrieval for
E-Discovery," Foundations and Trends in Information Retrieval,
7(2-3)100-237, 2013. (PDF) (Publisher)
- Douglas W. Oard, ``Can Automatic Speech Recognition Replace
Manual Transcription?,'' Oral History in the Digital Age (Web
resource), 2012. (HTML)
- Douglas W. Oard, "A Whirlwind Tour of Automated Language
Processing for the Humanities and Social Sciences," in Working
Together or Apart: Promoting Digital Scholarship, Council on Library
and Information Resources, 2009. (PDF)
- Douglas W. Oard, "Multilingual Information Access," in
Encyclopedia of Library and Information Sciences, 3rd Ed., edited by
Marcia J. Bates, Editor, and Mary Niles Maack, Associate Editor,
Taylor & Francis, 2009. (PDF)
- Douglas W. Oard, "Unlocking the Potential of the Spoken Word,"
Science, 321(5897)1787-1788, 2008. (Publisher)
- Franciska de Jong, Douglas W. Oard, Willemijn Heeren and Roeland
Ordelman, "Access to Recorded Interviews: A Research Agenda," ACM
Journal on Computing and Cultural Heritage, 1(1)1-27, 2008. (PDF), (Publisher)
- Franciska de Jong, Douglas W. Oard, Roeland Ordelman and Stephan
Raaijmakers, "Searching Spontaneous Conversational Speech," workshop
report in SIGIR Forum, 41(2)104-108, 2007. (PDF)
- Douglas W. Oard, "Transcending the Tower of Babel: Supporting
Access to Multilingual Information with Cross-Language Information
Retrieval," in Robert Popp and John Yen, ed., Emergent Information
Technologies and Enabling Policies for Counter-Terrorism, Prentice
Hall, Chapter 15, pp. 299-314, 2006. (PDF)
- Douglas W. Oard, "Towards Analysis Tools for a Multilingual
Blogsphere," in AAAI Spring Symposium on Computational Approaches to
Analyzing Weblogs, Stanford, CA, 3 pages, 2006. (PDF)
- Jerry Goldman, Steve Renals, Steven Bird, Franciska de Jong,
Marcello Federico, Carl Fleischhauer, Mark Kornbluh, Lori Lamel,
Douglas W. Oard, Fabrizio Sebastiani, Claire Stewart and Richard
Wright, "Transforming Access to the Spoken Word," International
Journal on Digital Libraries, 5(4)287-298, 2005. (PDF), (Publisher)
- Douglas W. Oard, "The SIGIR Workshop Program," SIGIR Forum,
39(2)15-16, 2005. (PDF)
- Douglas W. Oard, "The Surprise Language Exercises," ACM
Transactions on Asian Language Information Processing, 2(2)79-84,
2003. (PDF) (Publisher)
- Douglas W. Oard, "Coping with Surprise: Responsive Language
Technology", Team TIDES, p. 2, October 2003. (PDF)
- Douglas W. Oard, "Surprise: It's Cebuano!", Team TIDES, pp. 2-3,
April 2003. (PDF)
- Douglas W. Oard, "Interactive Cross-Language Information
Retrieval," workshop report in SIGIR Forum, 35(1)1-3, 2001. (PDF)
- Judith Klavans, Eduard Hovy, Christian Fluhr, Robert Frederking,
Douglas Oard, Akitoshi Okumura, Kai Ishikawa, and Kenji Satoh,
"Multilingual (or Cross-Lingual) Information Retrieval" in
Multilingual Information Management: Current Levels and Future
Abilities, Eduard Hovy, Nancy Ide, Robert Frederking, Joseph Mariani,
Antonio Zampolli (eds.), Chapter 2, pp. 35-56, 2001. (HTML)
- Douglas W. Oard and Anne R. Diekema, "Cross-Language Information
Retrieval," in Martha Williams (ed.), in Annual Review of Information
Science and Technology, Volume 33, Chapter 6, pp. 223-256, 1998. (ASCII)
- Douglas Oard, Carol Peters, Miguel Ruiz, Robert Frederking,
Judith Klavans, and Paraic Sheridan, "Multilingual Information
Discovery and Access (MIDAS): A Joint ACM DL '99 / ACM SIGIR '99
Workshop," D-Lib Magazine, October, 1999. (HTML)
- Douglas W. Oard, "Extending Cross-Language Information Retrieval to
a Global Scale," NSF Workshop on Multilingual Information Management,
pp. 24-25, Granada, Spain, 1998. (PDF)
- Douglas W. Oard, "The State of the Art in Text Filtering." User
Modeling and User Adapted Interaction, 7(3)141-178, 1997. (PDF)
- Douglas W. Oard, "Serving Users In Many Languages: Cross-Language
Information Retrieval for Digital Libraries" D-Lib Magazine, December,
1997. (HTML)
- Douglas W. Oard, "Alternative Approaches for Cross-Language Text
Retrieval," in AAAI Symposium on Cross-Language Text and Speech
Retrieval, pp. 131-139, Palo Alto CA, 1997. (PDF)
- Douglas W. Oard, "Speech-Based Information Retrieval for Digital
Libraries," AAAI Symposium on Cross-Language Text and Speech
Retrieval, Palo Alto, CA, 1997. (PDF)
- Douglas W. Oard, "Cross-Language Text Retrieval Research in the
USA," Third DELOS Workshop: Cross-Language Information retrieval,
pp. 7-16, Zurich, 1997. (PDF)
- Douglas W. Oard and Bonnie J. Dorr, "A Survey of Multilingual
Text Retrieval," University of Maryland Computer Science Department,
31 pp., CS-TR-3615, 1996. (PDF)
- Christos Faloutsos and Douglas Oard, "A Survey of Information
Retrieval and Filtering Methods," University of Maryland Computer
Science Department, 23 pp., CS-TR-3514, 1995. (PDF)
These are track overview and track description papers that resulted
from my work as a track coordinator in the Text Retroeval Conference
(TREC) Neural Cross-Language Infromation Retrieval (NeuCLIR) Track.
These papers describe evaluation design issues for
information retrieval systems that are designed to support a search
using math. My own research on information retrieval techniques using
these evaluation designs can be found below in the Muntlingual Information Access section.
- Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee,
Douglas W. Oard, Luca Soldaini and Eugene Yang, Overview of the
TREC 2023 NeuCLIR Track, TREC, 2023. (PDF)
- Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee,
Douglas W. Oard, Luca Soldaini and Eugene Yang, Overview of the
TREC 2022 NeuCLIR Track, TREC, 2022. (PDF)
These are track overview and track description papers that resulted
from my work as a track coordinator in the Copnferences and Labs of
the Evaluation Forum (CLEF) Answer Retrieval for Questions on Math
(ARQMATH) lab. These papers describe evaluation design issues for
information retrieval systems that are designed to support a search
using math. My own research on information retrieval techniques using
these evaluation designs can be found below in the Math Search section.
- Behrooz Mansouri, Vit Novotny, Anurag Agrawal, Douglas W. Oard
and Richard Zanibbi, Overview of ARQMath-3 (2022): Third CLEF
Lab on Answer Retrieval for Questions on Math (Working Notes
Version). Working Notes of CLEF, pp. 1-25, 2022. (PDF)
- Behrooz Mansouri, Vit Novotny, Anurag Agrawal, Dougl0as W. Oard
and Richard Zanibbi, Overview of ARQMath-3 (2022): Third CLEF Lab
on Answer Retrieval for Questions on Math, CLEF, Springer LNCS,
2022. (PDF)
- Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard and Richard
Zanibbi, Advancing Math-Aware Search: The ARQMath-3 Lab at CLEF
2022, 8 pages, ECIR, 2022. (PDF)
- Behrooz Mansouri, Richard Zanibbi, Douglas W. Oard and Anurag
Agarwal, Overview of ARQMath-2 (2021): Second CLEF Lab on Answer
Retrieval for Questions on Math (Working Notes Version). Working
Notes of CLEF, pp. 1-24, 2021. (PDF)
- Behrooz Mansouri, Richard Zanibbi, Douglas W. Oard, Anurag
Agarwal, Overview of ARQMath-2 (2021): Second CLEF Lab on Answer
Retrieval for Questions on Math, CLEF, Springer LNCS,
pp. 215-238, 2021. (PDF)
- Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard and Richard
Zanibbi, Advancing Math-Aware Search: The ARQMath-2 Lab at CLEF
2021, ECIR, 7 pages, 2021. (PDF)
- Richard Zanibbi, Behrooz Mansouri, Anurag Agarwal and Douglas
W. Oard, ARQMath: A new benchmark for math-aware CQA and math
formula retrieval. SIGIR Forum 54(2): 4:1-4:9, 2020. (PDF)
- Richard Zanibbi, Douglas W. Oard, Anurag Agarwal and Behrooz
Mansouri, Overview of ARQMath 2020: CLEF Lab on Answer Retrieval
for Questions on Math (Updated Working Notes Version with
Eratta 2 incorporated), Working Notes of CLEF, 27 pages, 2020,
corrected in 2021. (PDF)
- Richard Zanibbi, Douglas W. Oard, Anurag Agarwal, and Behrooz
Mansouri, Overview of ARQMath 2020: CLEF Lab on Answer Retrieval
for Questions on Math, CLEF, Springer LNCS, 2020. (PDF)
- Behrooz Mansouri, Anurag Agarwal, Douglas W. Oard, Richard
Zanibbi, Finding Old Answers to New Math Questions: The ARQMath
Lab at CLEF 2020, ECIR, pp. 564-571, 2020. (PDF)
FIRE Track Overviews (2011-2013)
These are track overview papers that resulted from my work as a track
coordinator in the Forum for Information Retrieval Evaluation (FIRE).
These papers describe evaluation design issues for information
retrieval systems that are designed to support a search for digital
evidence in a litigation context. My own research on information
retrieval techniques using these evaluation designs can be found below
in the Document Image Retrieval and Speech sections.
- Douglas W. Oard, Jerome White, Jaiul Paik, Rashmi Sankepally and
Aren Jansen, "The FIRE 2013 Question Answering for the Spoken Web
Task," Fifth Forum for Information Retrieval Evaluation, 8 pages,
New Delhi, India, 2013. (PDF)
- Utpal Garain, Jiaul Paik, Tamaltaru Pal, Prasenjit Majumder,
David Doermann and Douglas W. Oard, "Overview of the FIRE 2011
RISOT Task," Third Forum for Information Retrieval Evaluation,
pp.~159--163, Mumbai, India, 2011. (PDF)
TREC Legal Track Overviews
(2006-2011)
These are track overview papers that resulted from my work as a track
coordinator in the Text Retrieval Conference (TREC). These papers
describe evaluation design issues for information retrieval systems
that are designed to support a search for digital evidence in a
litigation context. My own research on information retrieval
techniques using these evaluation designs can be found below in the Document Image Retrieval and Email sections.
- Maura R. Grossman, Gordon V. Cormack, Bruce Hedin and Douglas W. Oard,
"Overview of the TREC 2011 Legal Track," in Proceedings of the
Twentieth Text Retrieval Conference, 20 pages, Gaithersburg, MD,
2011. (PDF)
- Douglas W. Oard, Jason R. Baron, Bruce Hedin, David D. Lewis and
Stephen Tomlinson, "Evaluation of Information Retrieval for
E-Discovery," Artificial Intelligence and Law, 18(4)347-386, 2010.
(PDF) (Publisher)
- Gordon V. Cormack, Maura R. Grossman, Bruce Hedin, and Douglas
W. Oard, "Overview of the TREC-2010 Legal Track," in Working Notes of
the Nineteenth Text Retrieval Conference, pp. 30-38, Gaithersburg, MD,
2010. (PDF)
- William Webber, Douglas W. Oard, Falk Scholer and Bruce Hedin,
"Assessor Error in Stratified Evaluation," in The 18th ACM
International Conference on Information and Knowledge Management, 10
pages, Toronto, Canada, 2010. (PDF)
- Bruce Hedin, Stephen Tomlinson, Jason R. Baron and Douglas
W. Oard, "Overview of the TREC 2009 Legal Track,'' in Proceedings of
the Eighteenth Text Retrieval Conference," 40 pages, Gaithersburg, MD,
2009. (PDF)
- Bruce Hedin and Douglas W. Oard, "Replication and Automation of
Expert Judgments: Information Engineering in Legal E-Discovery," in
IEEE Conference on Systems, Man and Cybernetics, 6 pages, San Antonio,
TX, 2009. (PDF)
- Douglas W. Oard, Bruce Hedin, Stephen Tomlinson and Jason
R. Baron, "Overview of the TREC 2008 Legal Track," in The Seventeenth
Text Retrieval Conference, Gaithersburg, MD, 45 pages, 2008. (PDF)
- Stephen Tomlinson, Douglas W. Oard, Jason R. Baron and Paul
Thompson, "Overview of the TREC 2007 Legal Track," in The Sixteenth
Text Retrieval Conference, Gaithersburg, MD, 34 pages, 2007. (PDF)
- Jason R. Baron, David D. Lewis and Douglas W. Oard, "The TREC-2006
Legal Track" in The Fifteenth Text Retrieval Conference, Gaithersburg,
MD, 20 pages, 2006. (PDF)
CLEF Cross-Language Speech Retrieval Track
Overviews (2005-2007)
These are track overview papers that resulted from my work as a track
coordinator in the Cross-Language Evaluation Forum (CLEF). These
papers describe evaluation design issues for information retrieval
from spontaneous speech, regardless of the query language. My own
research on information retrieval techniques using these evaluation
designs can be found below in the Speech
Retrieval section.
- Pavel Pecina, Petra Hoffmannova, Gareth J.F. Jones, Ying
Zhang and Douglas W. Oard, "Overview of the CLEF-2007 Cross-Language
Speech Retrieval Track," in Advances in Multilingual and Multimodal
Information Retrieval, Revised Selected Papers, CLEF 2007,
Springer-Verlag, LNCS (5152), Budapest, pp. 674-686, 2007. (PDF)
- Douglas W. Oard, Jianqiang Wang, Gareth G.F. Jones, Ryen White,
Pavel Pecina, Dagobert Soergel, Xiaoli Huang, Izhak Shafran, "Overview
of the CLEF-2006 Cross-Language Speech Retrieval Track," in Evaluation
of Multilingual and Multi-modal Information Retrieval, Revised
Selected Papers, CLEF-2006, Springer-Verlag, LNCS (4730), Alicante,
Spain, 12 pages, 2006. (PDF)
- Ryen W. White, Douglas W. Oard, Gareth J.F. Jones, Dagobert
Soergel and Xiaoli Huang, "Overview of the CLEF-2005 Cross-Language
Speech Retrieval Track," in Multilingual Information Repositories,
Revised Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022),
Vienna, Austria, pp. 744-759, 2005. (PDF)
CLEF Interactive Track Overviews (2002-2004)
These are track overview papers that resulted from my work as a track
coordinator in the Cross-Language Evaluation Forum (CLEF). These
papers describe evaluation design issues for user-in-the-loop systems
that are designed to support Multilingual Information Access (MLIA).
My own research on information retrieval techniques using these
evaluation designs can be found below in the MLIA
section.
- Julio Gonzalo and Douglas W. Oard, "iCLEF 2004 Track Overview:
Pilot Experiments in Interactive Cross-Language Question Answering,"
in Multilingual Information Access for Text, Speech and Images, Fifth
Workshop of the Cross-Language Evaluation Forum, CLEF 2004, Revised
Selected Papers Series, Springer-Verlag, LNCS (3491), Bath, UK,
pp. 310-322, 2004. (PDF)
- Julio Gonzalo and Douglas W. Oard, "The CLEF-2003 Interactive
Track," in Comparative Evaluation of Multilingual Information Access
Systems, Fourth Workshop of the Cross-Language Evaluation Forum,
Revised papers, Springer-Verlag LNCS (3237), Trondheim, Norway, 2003.
(PDF)
- Douglas Oard and Julio Gonzalo, "The CLEF-2002 Interactive
Track," in Advances in Cross-Language Information Retrieval Third
Workshop of the Cross-Language Evaluation Forum, CLEF 2002, Revised
papers, Springer-Verlag LNCS (2785), pp. 245-254, Rome, Italy,
2002. (PDF)
- Douglas W. Oard and Julio Gonzalo, "The CLEF 2001 Interactive
Track," in Evaluation of Cross-Language Information Retrieval Systems,
Second Workshop of the Cross-Language Evaluation Forum, CLEF 2001
Revised Papers, Springer-Verlag LNCS (2406), Darmstadt, Germany,
pp. 308-319, 2001. (PDF)
TREC Arabic CLIR Track Overviews (2001-2002)
These are track overview papers and other papers that resulted from my
work as a track coordinator in the Text Retrieval Conference (TREC).
These papers describe evaluation design issues for information
retrieval from Arabic, regardless of the query language. My own
research on information retrieval techniques using these evaluation
designs can be found below in the Multilingual
Information Access section.
- Douglas W. Oard and Frederic C. Gey, "The TREC-2002
Arabic-English CLIR Track," in The Eleventh Text Retrieval Conference,
Gaithersburg, MD, pp. 17-26, 2002. (PDF)
- Douglas W. Oard, Fredric C. Gey and Bonnie J. Dorr, "Evaluating
Arabic Retrieval from English or French Queries," in LREC Workshop on
Arabic Language Resources and Evaluation, Las Palmas, Spain, pp. 5-10,
2002. (PDF)
- Fredric C. Gey and Douglas W. Oard, "The TREC-2001 Cross-Language
Information Retrieval Track: Searchi
Arabic Queries," in The Tenth Text Retrieval Conference, pp. 114-121,
Gaithersburg, MD, 2001. (PDF)
- Douglas W. Oard and Fredric C. Gey, "The TREC-2001 Arabic
Information Retrieval Evaluation," in ACL Workshop on Arabic Language
Processing, pp. 95-96, Toulouse, France, 2001. (PDF)
Other Evaluation Design
These papers report on evaluation design research conducted outside
the scope of a shared-task evaluation that I helped to coordinate.
- Elizabeth Salesky, Matthew Weisner, Jacob Bremerman, Roldano
Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post,
Multilingual TEDx Corpus for Speech Recognition and Translation.
Interspeech, 5 pp., 2021. (PDF)
- Jacob Bremerman, Huda Khayrallah, Douglas W Oard and Matt Post,
On the Evaluation of Machine Translation n-best Lists, EMNLP
Workshop on Evaluation and Comparison of NLP Systems, 9 pages,
2020. (PDF)
- Jacob Bremerman, Dawn J. Lawrie, James Mayfield and Douglas
W. Oard, Two Test Collections for Retrieval Using Named Entity
Markup. CIKM, pp. 3265-3268, 2020. (PDF)
- Douglas W. Oard, Tetsuya Sakai and Noriko Kando, Celebrating 20
Years of NTCIR: The Book, 1 page, EVIA, Tokyo, Japan, 2019. (PDF)
- Ning Gao, Mossaab Bagdouri and Douglas W. Oard, Pearson Rank: A
Head-Weighted Gap-Sensitive Score-Based Correlation Coefficient,"
in 39th International ACM SIGIR Conference on Research and
Development in Information Retrieval, 4 pages, Pisa, Italy, 2016. (PDF)
- Ning Gao and Douglas W. Oard, "A Head-Weighted Gap-Sensitive
Correlation Coefficient," in Proceedings of the 28th Annual ACM
SIGIR Conference on Research and Development in Information
Retrieval, Santiago, Chile, 2015. (PDF)
- Ning Gao, William Webber and Douglas W. Oard, "Reducing Reliance
on Relevance Judgments for System Comparison by Using
Expectation-Maximization," in Proceedings of the of the 36th
European Conference on Information Retrieval, 12 pages,
Amsterdam, The Netherlands, 2014. (PDF)
- Dina Demner-Fushman, Daqing He and Douglas W. Oard, "Exploring
Interactive Relevance Feedback With a Two-Pass Study Design,"
Technical Report CS-TR-4621, University of Maryland Computer
Science Department, 2004. (PDF)
- Bonnie Dorr, Christof Monz, Douglas Oard, David Zajic and Richard
Schwartz, "Extrinsic Evaluation of Automatic Metrics for
Summarization," Technical Report CS-TR-4610, University of
Maryland Computer Science Department, 2004. (PDF)
Multilingual Information Access
These papers address the problem of finding documents that are written
in one language (e.g., Chinese) using requests that are written in a
different language (e.g., English). This problem is often referred to
as "Cross-Language Information Retrieval" (CLIR), but Multilingual
Information Access (MLIA) is a more inclusive term that better
describes the scope of the work described here. Papers that address
MLIA for spoken or scanned content can be found in those sections,
interspersed with my other papers that address those topics. TREC and
CLEF track overview papers that address evaluation design for some
specific MLIA problems that have been the focus of international
evaluation venues can be found in the evaluation design sections above.
- Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield, Douglas
W. Oard and Kevin Duh, Effectiveness=Efficiency Tradeoff of
Probabilistic Structured Queries for Cross-Language Information
Retrieval, arXiv preprint, arXiv:2404.18797, 11 pages, 2024. (PDF Preprint)
- Eugene Yang, Dawn Lawrie, James Mayfield, Douglas Oard and Scott
Miller, Translate-Distill: Learning Cross-Language Dense
Retrieval by Translation and Distillation, European Conference on
Information Retrieval, Glasgow, UK, 17 pages, 2024. (PDF)
- Suraj Nair and Douglas W. Oard, BLADE: The University of Maryland
at the TREC 2023 NeuCLIR Track, TREC, 2023. (PDF)
- Suraj Nair, Eugene Yang, Dawn Lawrie, James Mayfield and Douglas
Oard, BLADE: Combining Vocabulary Pruning and Intermediate
Pretraining for Scaleable Neural CLIR, Proceedings of the 46th
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Taipei, 11 pages, 2023. (PDF)
- Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang, #Suraj
Nair and Petra Galuščáková, HC3: A Suite of Test Collections for
CLIR Evaluation over Informal Text, Proceedings of the 46th
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Taipei, 10 pages, 2023. (PDF)
- Dawn Lawrie, James Mayfield, Suraj Nair, Douglas W. Oard and
Eugene Yang, Neural Methods for Cross-Language Information
Retrieval, SIGIR 2023 Tutorial Abstract, Proceedings of the 46th
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Taipei, 2 pages, 2023. (PDF)
- Dawn Lawrie, Eugene Yang, James Mayfield and Douglas W. Oard,
Neural Approaches to Multilingual Information Retroeval, European
Conference on Infromation Retrieval, 2023. (PDF)
- Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield and Douglas
W. Oard, Parameter-Efficient Zero-Shot Transfer for
Cross-Language Dense Retrieval with Adapters, 15 pages, ArXiv
preprint arXiv:2212,10448. (PDF)
- Suraj Nair and Douglas W. Oard, Probabilistic Structured Queries:
The University of Maryland at the TREC 2022 NeuCLIR Track, TREC,
2022. (PDF)
- Inkyung Choi, Wan-Chen Lee, Ying-Hsang Liu, Hsinlinag Chen,
Douglas W. Oard and Chi Young Oh, Cross-Cultural Information
Access (Panel Summary), 4 pages, ASIS&T, 2022. (PDF)
- Suraj Nair, Eugene Yang, Dawn Lawrie, James Mayfield and Douglas
W. Oard, Learning a Sparse Representation Model for Neural CLIR,
12 pages, DESIRES, 2022. (PDF)
- Eugene Yang, Suraj Nair, Ramraj Chandradevan, Rebecca
Iglesias-Flores and Douglas W. Oard, C3: Continued Pretraining
with Contrastive Weak Supervision for Cross-Language Ad-Hoc
Retrieval, 6 pages, SIGIR, 2022. (PDF)
- Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee,
Kenton Murray, James Mayfield and Douglas W. Oard, Transfer
Learning Approaches for Building Cross-Language Dense Retrieval
Models, 15 pages, ECIR, 2022. (PDF)
- Dawn Lawrie, James Mayfield, Douglas W. Oard and Eugene Yang,
HC4: A New Suite of Tst Collections for Ad Hoc CLIR, 16 pages,
ECIR, 2022. (PDF)
- Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuščáková, Rui
Zhang, Douglas W. Oard and Kathleen McKeown, Cross-language
Sentence Selection via Data Augmentation and Rationale Training,
Joint Conference of the 59th Annual Meeting of the Association
for Computational Linguistics and the 11th International Joint
Conference on Natural Language Processing, pp. 3881-3895,
2021. (PDF)
- Petra Galuščáková and Douglas W. Oard, Supporting Global
Knowledge Sharing using Cross-Language Information Retrieval,
NASA AI and Data Science Workshop for Earth and Space Science, 2
page poster paper, 2021. (PDF)
- Suraj Nair, Petra Galuščáková and Douglas W. Oard, Combining
contextualized and non-contextualized query translations to
improve CLIR, 4 pages, SIGIR, 2020. (PDF)
- Petra Galuščáková, Douglas W. Oard, Joe Barrow, Suraj Nair,
Han-Chin Shing, Elena Zotkina, Ramy Eskander, Rui Zhang,
MATERIALizing Cross-Language Information Retrieval: A Snapshot,
LREC Workshop on Cross-Language Search and Summarization of Text
and Speech, pp. 14-21, 2020. (PDF)
- Han-Chin Shing, Joe Barrow, Petra Galuščáková, Douglas W. Oard,
Philip Resnik, Unsupervised System Combination for Set-Based
Retrieval with Expectation Maximization, CLEF, pp. 191-197,
Lugano, Switzerland, 2019. (PDF)
- Douglas Oard, Marine Carpuat, Petra Galuščáková, Joseph Barrow,
Suraj Nair, Xing Niu, Han-Chin Shing, Weijia Xu, Elena Zotkina,
Kathleen McKeown, Smaranda Muresan, Efsun Kayi, Ramy Eskander,
Chris Kedzie, Yan Virin, Dragomir Radev, Rui Zhang, Mark Gales,
Anton Ragni and Kenneth Heafield, Surprise Languages:
Rapid-Response Cross-Language IR. Proceedings of the Ninth
International Workshop on Evaluating Information Access (EVIA
2019), 5 pages, Tokyo Japan, 2019. (PDF)
- Sungho Kim, Youngjoong Ko and Douglas W. Oard, "Combining Lexical
and Statistical Translation Evidence for Cross-Language
Information Retrieval," Journal of the Association for
Information Science and Technology (JASIST), 66(1)23-39,
2015. (preprint: PDF) (Publisher)
- Mossaab Bagdouri, Douglas W. Oard, and Vittorio Castelli, "CLIR
for Informal Content in Arabic Forum Posts," in ACM International
Conference on Information and Knowledge Management, 4 pages,
Shanghai, China, 2014. (PDF)
- Yejun Wu and Douglas W. Oard, "English and Chinese Bilingual Topic
Aspect Classification: Examining Similarity Measures, Optimal LSA
Dimensions, and Centroid Correction of Translated Training
Examples," in 76th Annual Conference of the American Society for
Information Science and Technology, contributed paper, 12 pages,
Montreal, Canada, 2013. (PDF)
- Ferhan Ture, Jimmy Lin and Douglas W. Oard, "Combining
Statistical Translation Techniques for Cross-Language Information
Retrieval," in 24th International Conference on Computational
Linguistics, 17 pages, Mumbai, India, 2012. (PDF)
- Ferhan Ture, Douglas W. Oard and Philip Resnik, "Encouraging
Consistent Translation Choices," in Proceedings of the 2012
Conference of the North American Chapter of the Association for
Computational Linguistics, pp. 417-426, Montreal, Canada,
2012. (PDF)
- Ferhan Ture, Jummy Lin and Douglas W. Oard, "Looking Inside the
Box: Context-Sensitive Translation for Cross-Language Information
Retrieval," in 35th Annual International ACM-SIGIR Conference on
Research and Development in Information Retrieval, 2 pages,
Portland, OR, 2012. (PDF)
- Jianqiang Wang and Douglas W. Oard, "Matching Meaning for
Cross-Language Information Retrieval." Information Processing and
Management, 48(4)631-653, 2012. (PDF) (Publisher)
- Douglas W. Oard, Carl Madson, Joseph Olive, John McCary and Caitlin
Christianson (eds.), "Operational Engines," in Joseph Olive, Caitlin
Christianson and John McCary (eds.), Handbook of Natural Language
Processing and Machine Translation: DARPA Global Autonomous Language
Exploitation, pp. 845-932, Springer, 2011. (Publisher)
- Tan Xu and Douglas W. Oard "FIRE-2008 at Maryland," in Working
Notes of the Forum for Information Retrieval Evaluation, 12 pages,
Kolkata, India, 2008. (PDF)
- Yejun Wu and Douglas W. Oard, "Bilingual Aspect Classification Based
on Cross-language Text Classification," 31st Annual International ACM
SIGIR Conference on Research and Development in Information Retrieval,
pp. 203-210, Singapore, 2008. (PDF)
- Douglas W. Oard, Daqing He and Jianqiang Wang, "User-Assisted
Query Translation for Cross-Language Information Retrieval,"
Information Processing and Management, 44(1)181-211, 2008. (PDF) (Publisher)
- Pengyi Zhang, Lynne Plettenberg, Judith Klavans, Douglas W. Oard
and Dagobert Soergel, "Task-Based Interaction with an Integrated
Multilingual Multimedia Information System: A Formative Evaluation,"
Joint Conference on Digital Libraries, pp. 117-126, Vancouver, BC,
Canada, 2007. (PDF)
- Jianqiang Wang and Douglas W. Oard, "Combining Bidirectional
Translation and Synonymy for Cross-Language Information Retrieval," in
29th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval, pp. 202-209, Seattle, 2006. (PDF)
- Daqing He, Douglas W. Oard, and Lynne Plettenberg, "Studying the
Use of Interactive Multilingual Information Retrieval", in ACM SIGIR
Workshop on New Directions in Multilingual Information Access,
Amsterdam, 5 pages, 2006. (PDF)
- Gina-Anne Levow, Douglas W. Oard and Philip Resnik,
"Dictionary-Based Cross-Language Retrieval," Information Processing
and Management, 41(3)523-547, 2005. (PDF)
- J. Scott Olsson, Douglas W. Oard and Jan Hajic, "Cross-Language
Text Classification," in Proceedings of the 28th Annual ACM SIGIR
Conference on Research and Development in Information Retrieval,
pp. 645-646, 2005. (PDF)
- Daqing He, Jianqiang Wang, Jun Luo and Douglas W. Oard, "iCLEF
2004 at Maryland: Summarization Design for Interactive Cross-Language
Question Answering," in Multilingual Information Access for Text,
Speech and Images, Fifth Workshop of the Cross-Language Evaluation
Forum, CLEF 2004, Revised Selected Papers Series, Springer-Verlag,
LNCS (3491), Bath, UK, pp. 340-362, 2004. (PDF)
- Douglas W. Oard, Julio Gonzalo, Mark Sanderson, Fernando
Lopez-Ostenero and Jianqiang Wang, "Interactive Cross-Language
Document Selection," Information Retrieval, 7(1-2)205-228, 2004. (PDF)
- H. Ma, D. Doermann, B. Karagol-Ayan, D. Oard, and J. Wang, "Parsing
and Tagging of Bilingual Dictionaries," Traitement Automatique des
Langues, 44(2)125-150, 2004. (PDF)
- Tamer Elsayed, Douglas W. Oard, David Doermann and Gary Kuhn,
"TDT- 2004: Adaptive Topic Tracking at Maryland," in Working Notes of
the TDT-2004 Workshop, Gaithersburg, MD, 5 pages, 2004. (PDF)
- Kareem Darwish and Douglas W. Oard, "Probabilistic Structured
Query Methods," in Twenty-Sixth International ACM-SIGIR Conference on
Research and Development in Information Retrieval, pp. 338-344,
Toronto, Canada, 2003. (PDF)
- Daqing He, Douglas W. Oard, Jianqiang Wang, Jun Luo, Dina Demner-
Fushman, Kareem Darwish, Philip Resnik, Sanjeev Khudanpur, Michael
Nossal, Michael Subotin and Anton Leuski, "Making MIRACLEs:
Interactive Translingual Search for Cebuano and Hindi," ACM
Transactions on Asian Language Information Processing, 2(3)219-244,
2003. (PDF), (Publisher)
- Dina Demner-Fushman and Douglas W. Oard, "The Effect of Bilingual
Term List Size on Dictionary-Based Cross-Language Information
Retrieval," in Hawaii International Conference on System Sciences, 10
pages, Kona, HI, 2003. (PDF)
- Douglas W. Oard and Franz Josef Och, "Rapid-Response Machine
Translation for Unexpected Languages," 7 pages, Machine Translation
Summit IX, New Orleans, 2003. (PDF)
- Douglas W. Oard, David Doermann, Bonnie Dorr, Daqing He, Philip
Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David
Yarowsky, Anton Leuski, Philipp Koehn and Kevin Knight, "Desperately
Seeking Cebuano," in Third Conference on Human Language Technologies,
short paper (3 pages), Edmonton, Canada, 2003. (PDF)
- Abdessamad Echicabi, Douglas W. Oard, Daniel Marcu and Ulf
Hermjakob, "Answering Spanish Questions from English Documents," in
Comparative Evaluation of Multilingual Information Access Systems,
Fourth Workshop of the Cross-Language Evaluation Forum, Revised
papers, Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 514-522,
2003. (PDF)
- Bonnie J. Dorr, Daqing He, Jun Luo, Douglas W. Oard, Richard
Schwartz, Jianqiang Wang and David Zajic, "iCLEF-2003 at Maryland:
Headline Generation and Interactive Query Formulation," in Comparative
Evaluation of Multilingual Information Access Systems, Fourth Workshop
of the Cross-Language Evaluation Forum, Revised papers,
Springer-Verlag LNCS (3237), Trondheim, Norway, pp. 435-449, 2003. (PDF)
- Daqing He, Jianqiang Wang, Douglas W. Oard and Michael Nossal,
"Comparing User-Assisted and Automatic Query Translation," in Advances
in Cross-Language Information Retrieval Third Workshop of the
Cross-Language Evaluation Forum, CLEF 2002, Revised papers,
Springer-Verlag LNCS (2785), pp. 267-278, Rome, Italy, 2002. (PDF)
- Gina-Anne Levow and Douglas W. Oard, "Signal Boosting for
Translingual Topic Tracking" in Allen, James, ed. Topic Detection and
Tracking: Event-Based Information Organization, Chapter 9,
pp. 175-195, Kluwer Academic, 2002. (PDF)
- Douglas W. Oard and Funda Ertunc, "Translation-Based Indexing for
Cross-Language Information Retrieval," in 24th BCS-IRSG European
Colloquium on IR Research, pp. 324-333, Glasgow, UK, 2002. (PDF)
- David Doermann, Huanfeng Ma, Burcu Karagol-Ayan and Douglas
W. Oard, "Translation Lexicon Acquisition from Bilingual
Dictionaries," in Proceedings of the Ninth SPIE Symposium on Document
Recognition and Retrieval, pp. 37-48, San Jose, CA, 2002.
- Kareem Darwish and Douglas W. Oard, "CLIR Experiments at Maryland
for TREC-2002: Evidence Combination for Arabic-English Retrieval," in
The Eleventh Text Retrieval Conference, Gaithersburg, MD, pp. 703-711,
2002. (PDF)
- Daqing He, Hyuk Ro Park, G. Craig Murray, Michael Subotin and
Douglas W. Oard, "TDT-2002: Topic Tracking at Maryland: First
Experiments with the Lemur Toolkit," in Working Notes of the Topic
Detection and Tracking Workshop, 7 pages (online proceedings),
Gaithersburg, MD, 2002. (PDF)
- Kareem Darwish, David Doermann, Ryan Jones, Douglas Oard, and
Mika Rautiainen, "TREC-10 Experiments at Maryland: CLIR and Video," in
The Tenth Text Retrieval Conference, pp. 552-564, Gaithersburg, MD,
2001. (PDF)
- Gina Levow, Douglas Oard, Philip Resnik, and Clara Cabezas,
"Rapidly Retargetable Interactive Translingual Retrieval," in
Proceedings of the First International Conference on Human Language
Technology, pp. 294-298, San Diego, 2001. (PDF)
- Philip Resnik, Douglas Oard and Gina Levow, "Improved
Cross-Language Retrieval using Backoff Translation," in Proceedings of
the First International Conference on Human Language Technology,
pp. 153-155, San Diego, 2001. (PDF)
- Jianqiang Wang and Douglas W. Oard, "iCLEF 2001 at Maryland:
Comparing Word-for-Word Gloss and MT," in Evaluation of Cross-Language
Information Retrieval Systems, Second Workshop of the Cross-Language
Evaluation Forum, CLEF 2001 Revised Papers, Springer-Verlag LNCS
(2406), Darmstadt, Germany, pp. 336-354, 2001. (PDF)
- Douglas W. Oard and Jianqiang Wang, "NTCIR-2 ECIR Experiments at
Maryland: Comparing Structured Queries and Balanced Translation," in
Proceedings of the Second NTCIR Workshop on Evaluation of Japanese and
Chinese Text Retrieval and Text Summarization, pp. 97-104, Tokyo,
2001. (PDF)
- Douglas W. Oard, "Evaluating Interactive Cross-Language Document
Retrieval: Document selection," Proceedings of the First
Cross-Language Evaluation Forum, pp. 57-71, Lisbon, 2000. (PDF)
- Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "CLEF
Experiments at Maryland: Statistical stemming and backoff
Translation," in Cross-Language Information Retrieval and Evaluation,
Workshop of Cross-Language Evaluation Forum, CLEF 2000, Revised Papers
Springer-Verlag, LNCS (2069), pp. 176-187, Lisbon, 2000. (PDF)
- Ruth Sperer and Douglas W. Oard, "Structured Translation for
Cross-Language Information Retrieval," in Proceedings of the 23rd
Annual ACM SIGIR Conference on Research and Development in Information
Retrieval, pp. 120-127, Athens, Greece, 2000. (PDF)
- Paul G. Hackett and Douglas W. Oard, "Comparison of Word-Based and
Syllable-Based Retrieval for Tibetan," Poster paper, in Fifth
International Workshop on Information retrieval with Asian Languages,
pp. 197-198, Hong Kong, 2000. (PDF)
- Douglas W. Oard, Gina-Anne Levow and Clara Cabezas, "TREC-9
Experiments at Maryland: Interactive CLIR," in The Ninth Text
Retrieval Conference, pp. 543-550, Gaithersburg, MD, 2000. (PDF)
- Gina-Anne Levow and Douglas W. Oard, "Translingual Topic
Tracking: Applying Lessons from the MEI Project," in Working notes of
the Topic Detection and Tracking Workshop, (5 pages), Gaithersburg,
MD, 2000. (PDF)
- Gina-Anne Levow and Douglas W. Oard, "Translingual Topic Tracking
With PRISE," in Working notes of the Topic Detection and Tracking
Workshop, pp. 175-180, Tysons Corner, VA, 2000. (PDF)
- Douglas W. Oard and Jianqiang Wang, "NTCIR CLIR Experiments at the
University of Maryland," in Proceedings of the First NTCIR Workshop on
Research in Japanese Text Retrieval and Term Recognition, pp. 157-161,
Tokyo, 1999. (PDF)
- Douglas W. Oard, Jianqiang Wang, Dekang Lin, and Ian Soboroff,
"TREC-8 Experiments at Maryland: CLIR, QA and Routing," in The Eighth
Text Retrieval Conference, pp. 623-636, Gaithersburg, MD, 1999. (PDF)
- Douglas W. Oard and Philip Resnik, "Support for Interactive
Searching in Cross-Language Information Retrieval," Information
Processing and Management, 35(3)363-379, 1999. (PDF) (Publisher)
- Gina-Anne Levow and Douglas W. Oard, "Evaluating Lexicon Coverage
for Cross-Language Information Retrieval" in Proceedings of the
Workshop on Multilingual Information Processing and Asian Language
Processing, pp. 69-74, Beijing, 1999. (PDF)
- Douglas W. Oard and Jianqiang Wang, "Effects of Term Segmentation
in Chinese/English Cross-Language Information Retrieval," in
Proceedings of the Symposium on String Processing and Information
Retrieval, pp. 149-157, Cancun, Mexico, 1999. (PDF)
- Douglas W. Oard, "Topic Tracking with the PRISE Information
Retrieval System," in Proceedings of the DARPA Broadcast News
Workshop, pp. 209-211, Reston, VA, 1999. (PDF)
- Douglas W. Oard, "Resources for Chinese/English Cross-Language
IR," 25 pp., University of Maryland, 1999. (PDF) [the greek letters are
misrendered versions of unfilled, half-filled, and completely filled
circles that broke when Microsoft updated the character set for Word]
- Douglas W. Oard, "A Comparative Study of Query and Document
Translation for Cross-Language Information Retrieval," in Proceedings
of the Third Conference of the Association for Machine Translation in
the Americas, pp. 472-483, Philadelphia, PA, 1998. (PDF)
- Bonnie J. Dorr and Douglas W. Oard, "Evaluating Resources for
Query Translation in Cross-Language Information Retrieval," in
Proceedings of the First International Conference on Language Resource
Evaluation, Volume II, pp. 759-764, Granada, Spain, 1998. (PDF)
- Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language
Text Filtering Effectiveness," in Gregory Grefenstette (ed.),
Cross-Language Information Retrieval, Chapter 12, pp. 151-161, Kluwer
Academic, 1998. [This is essentially the same as the SIGIR 96 workshop
paper below.]
- Douglas W. Oard, "TREC-7 Experiments at the University of
Maryland," in The Seventh Text Retrieval Conference, pp. 541-545,
Gaithersburg, MD, 1998. (PDF)
- Douglas W. Oard and Paul Hackett, "Document Translation for
Cross-Language Text Retrieval at the University of Maryland," in The
Sixth Text Retrieval Conference, pp. 687-696, Gaithersburg MD,
1997. (PDF)
- Douglas W. Oard, "Adaptive Filtering of Multilingual Document
Streams," in Fifth RIAO Conference on Computer Assisted Information
Searching on the Internet, Volume 1, pp. 233-254, Montreal, Canada,
1997. (PDF)
- Douglas W. Oard, "Alignment of Spanish and English TREC Topic
Descriptions," in The Fifth Text Retrieval Conference, pp. 547-553,
Gaithersburg MD, 1996. (PDF)
- Douglas W. Oard, "Adaptive Vector Space Text Filtering for
Monolingual and Cross-Language Applications," Ph.D. Dissertation,
University of Maryland, College Park, 1996. (PDF)
- Douglas W. Oard and Bonnie J. Dorr, "Evaluating Cross-Language
Text Filtering Effectiveness," in Proceedings of Cross-Linguistic
Multilingual Information Retrieval Workshop, ACM SIGIR Conference,
pp. 8-14, Zurich, 1996. (PDF)
- Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr and Christos
Faloutsos, "On Automatic Filtering of Multilingual Texts," Proceedings of
IEEE International Conference on Systems, Man and Cybernetics,
pp. 1645-1650, San Antonio, TX, 1994. (PDF)
Speech Retrieval
These papers address techniques for searching spoken content based on
written or spoken queries.
- Douglas W. Oard, Christopher Bearman, David Baker, Susannah
Paletz and Johanne Trippas, Operational Disconnect Detection in
Mission Control, ICASSP Fearless Steps Apollo Workshop, Seoul,
South Korea, 2 pages, 2024. (PDF)
- Petra Galuščáková, Suraj Nair and Douglas W. Oard: Combine and
Re-Rank: The University of Maryland at the TREC 2020 Podcasts
Track, TREC (notebook paper), 9 pages, 2020. (PDF)
- Suraj Nair, Anton Ragni, Ondrej Klejch, Petra Galuščáková and
Douglas Oard, Experiments with Cross-Language Speech Retrieval
for Lower-Resource Languages, Asia Information Retrieval
Symposium, pp. 145-157, Hong Kong, China, 2019. (PDF)
- Ning Gao, Gregory Sell, Douglas Oard, Mark Dredze, Leveraging
Side Information for Speaker Identification with the Enron
Conversational Telephone Speech Collection, IEEE Automatic Speech
Recognition and Understanding Workshop, 7 pages, 2017. (PDF)
- Ning Gao, Douglas W. Oard and Mark Dredze, Support for
Interactive Identification of Mentioned Entities in
Conversational Speech, 40th International ACM SIGIR Conference on
Research and Development in Information Retrieval 4 pages, Tokyo,
Japan, 2017. (PDF)
- Tiffany Jachja and Douglas W. Oard, "Goal-Directed Information
Seeking in Time-Synchronized and Topic-Linked Records of the
Apollo Lunar Missions," The ACM Conference on Human Information
Interaction and Retrieval, 4 pages, Oslo, Norway, 2017.
(PDF)
- Douglas W. Oard, John H.L. Hansen, Abhijeet Sangawan, Bryan Toth,
Lakshmish Kaushik and Chengzhu Yu, "Toward Access to
Multi-Perspective Archival Spoken Word Content," International
Conference on Asian Digital Libraries, 6 pages, Tsukuba, Japan,
2016. (PDF)
- Douglas W. Oard, Rashmi Sankepally, Jerome White and Craig
Harman, "Vapor Engine: Demonstrating an Early Prototype of a
Language-Independent Search Engine for Speech," in ACM SIGIR
Conference on Human Information Interaction and Retrieval, 4
pages, Chapel Hill, NC, 2016. (PDF)
- Douglas W. Oard, Rashmi Sankepally, Jerome White, Aren Jansen and
Craig Harman, "A Test Collection for Spoken Gujarati Queries,"
Proceedings of the 28th Annual ACM SIGIR Conference on Research
and Development in Information Retrieval, Santiago, Chile, 2015. (PDF)
- Jerome White, Douglas Oard, Aren Jansen, Jiaul Paik and Rashmi
Sankepally, Using Zero-Resource Spoken Term Discovery for Ranked
Retrieval, Annual Conference of the North American Chapter of the
Association for Computational Linguistics - Human Language
Technologies, Denver, CO, 2015. (PDF)
- Ali Ziaei, Lakshmish Kaushik, Abhijeet Sangwan, John H.L. Hansen
and Doug Oard, "Speech Activity Detection for NASA Apollo Space
Missions: Challenges and Solutions," in 15th Annual Conference of
the International Speech Communication Association, 5 pages,
Singapore, 2014. (PDF)
- Chengzhu Yu, John Hansen and Douglas W. Oard, "Houston, We have a
Solution: A Case study of the Analysis of Astronaut Speech during
NASA Apollo 11 for Long-term Speaker Modeling," in 15th Annual
Conference of the International Speech Communication Association,
4 pages, Singapore, 2014. (PDF)
- Jerome White, Douglas W. Oard, Nitendra Rajput and Marion Zalk,
"Simulating Early-Termination Search for Verbose Spoken Queries,"
Emperical Methods in Natural Language Processing, 11 pages,
Seattle, WA, 2013. (PDF)
- Abhijeet Sangwan, Lakshmish Kaushik, Chengzhu Yu, John
H.L. Hansen and Douglas W. Oard, "Houston, We Have a Solution:
Using NASA Apollo Program to Advance Speech and Language
Processing Technology," INTERSPEECH, pp. 1135-1139, Lyon, France,
2013. (PDF)
- Douglas W. Oard, Abhijeet Sangwan and John H.L. Hansen,
"Reconstruction of Apollo Mission Control Center Activity," in
SIGIR Workshop on Exploration, Navigation and Retrieval of
Information in Cultural Heritage (ENRICH), 4 pages, Dublin,
Ireland, 2013. (PDF)
- Joseph Malionek, Douglas W. Oard, John Hansen and Abhijeet
Sangwan, "Linking Transcribed Conversational Speech," in 36th
Annual International ACM-SIGIR Conference on Research and
Development in Information Retrieval, 4 pages, Dublin, Ireland,
2013. (PDF)
- Douglas W. Oard, "Query By Babbling: A Research Agenda," in CIKM
Workshop on Information and Knowledge Management for Developing
Regions, 5 pages, Maui, HI, 2012. (PDF)
- J. Scott Olsson and Douglas W. Oard, "Combining Evidence from
LVCSR and Ranked Utterance Retrieval for Robust Domain-Specific
Ranked Retrieval," Annual International ACM-SIGIR Conference on
Research and Development in Information Retrieval, Boston,
2009. (PDF)
- J. Scott Olsson and Douglas W. Oard, "Phrase-Based Query
Degradation Modeling for Vocabulary-Independent Ranked Utterance
Retrieval" Proceedings of the Annual Conference of the North
American Chapter of the Association for Computational Linguistics
Human Language Technology Conference, Boulder, 2009. (PDF)
- J. Scott Olsson and Douglas W. Oard, "Combining Speech Retrieval
Results with Generalized Additive Models," Association for
Computational Linguistics-Human Language Technology Conference,
pp. 461-469, Columbus, OH, 2008. (PDF)
- J. Scott Olsson and Douglas W. Oard, "Improving Text
Classification for Oral History Archives with Temporal Domain
Knowledge," 30th Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval, pp. 623-630,
Amsterdam, 2007. (PDF)
- Pavel Ircing, Douglas W. Oard and Jan Hoideker, "First
Experiments Searching Spontaneous Czech Speech," in 30th Annual
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Amsterdam, pp. 835-836, 2007. (PDF)
- Baolong Liu and Douglas W. Oard, "One-Sided Measures for
Evaluating Ranked Retrieval Effectiveness with Spontaneous
Conversational Speech," in 29th Annual International ACM SIGIR
Conference on Research and Development on Information Retrieval,
Seattle, pp. 673-674, 2006. (PDF)
- Diana Inkpen, Muath Alzghool, Gareth J.F. Jones and Douglas
W. Oard, "Investigating Cross-Language Speech Retrieval for a
Spontaneous Conversational Speech Collection," Conference on
Human Language Technologies and the North American Chapter of the
Association for Computational Linguistics, 4 pages, New York,
2006. (PDF)
- Jianqiang Wang and Douglas W. Oard, "CLEF 2006 CL-SR at Maryland:
English and Czech," in Evaluation of Multilingual and Multi-modal
Information Retrieval, Revised Selected Papers, CLEF-2006,
Springer-Verlag, LNCS (4730), Alicante, Spain, 7 pages, 2006. (PDF)
- Jianqiang Wang and Douglas W. Oard, "CLEF 2005 CL-SR at Maryland:
Document and Query Expansion using Side Collections and
Thesauri," in Multilingual Information Repositories, Revised
Selected Papers, CLEF-2005, Springer-Verlag, LNCS (4022), Vienna,
Austria, pp. 800-809, 2005. (PDF)
- William Byrne, David Doermann, Martin Franz, Samuel Gustman, Jan
Hajic, Douglas Oard, Michael Picheny, Josef Psutka, Bhuvana
Ramabhadran, Dagobert Soergel, Todd Ward and Wei-Jing Zhu,
"Automated Recognition of Spontaneous Speech for Access to
Multilingual Oral History Archives," IEEE Transactions on Speech
and Audio Processing, 12(4)420-435, 2004. (PDF) (Publisher)
- Douglas W. Oard, Dagobert Soergel, Craig Murray, David Doermann,
Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, James
Mayfield, Samuel Gustman, and Stephanie Strassel, "Building an
Information Retrieval Test Collection for Spontaneous
Conversational Speech," Twenty-Seventh ACM-SIGIR Conference on
Research and Development in Information Retrieval, pp. 41-48,
Sheffield, UK, 2004. (PDF)
- Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur,
Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang,
Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information
(MEI): Investigating Translingual Speech Retrieval," Computer
Speech and Language, 18(2)163-179, 2004. (PDF) (Publisher)
- Sudeep Gandhe, Andrew Gordon, Anton Leuski, David R. Traum and
Douglas W. Oard, "First Steps Towards Linking Dialogues:
Mediating Between Free-Text Questions and Pre-recorded Video
Answers," in 24th Army Science Conference, Orlando, FL, 8 pages,
2004. (PDF)
- Jinmook Kim Douglas W. Oard and Dagobert Soergel, "Searching
Large Collections of Recorded Speech: A Preliminary Study," in
Annual Conference of the American Society for Information Science
and Technology, Long Beach, CA, pp. 330-339, 2003. (PDF)
- Douglas W. Oard and Anton Leuski, "Searching Recorded Speech
Based on the Temporal Extent of Topic Labels," in AAAI Spring
Symposium on Intelligent Multimedia Knowledge Management, Palo
Alto, CA, 5 pages, 2003. (PDF)
- Jinmook Kim, Dagobert Soergel and Douglas W. Oard, "MALACH
Workshop 2: Final Report," 62 pp., 2003.
- Samuel Gustman, Dagobert Soergel, Douglas Oard, William Byrne,
Michael Picheny, Bhuvana Ramabhadran and Douglas Greenberg,
"Supporting Access to Large Digital Oral History Archives," in
Second Joint Conference on Digital Libraries, pp. 18-27,
Portland, OR, 2002. (PDF)
- Douglas W. Oard, Dina Demner-Fushman, Jan Hajic, Bhuvana
Ramabhadran, Samuel Gustman, William J. Byrne, Dagobert Soergel,
Bonnie Dorr, Philip Resnik and Michael Picheny, "Cross-Language
Access to Recorded Speech in the MALACH Project," in Fifth
International Conference on Text,S peech and Dialog, pp. 57-64,
Brno, Czech Republic, 2002. (PDF)
- Jinmook Kim and Douglas W. Oard, "The Use of Speech Retrieval
Systems: A Study Design," in ACM SIGIR Workshop on IR Techniques
for Speech Applications, New Orleans, pp. 86-93, 2001. (PDF)
- Helen Meng, Berlin Chen, Erika Grams, Sanjeev Khudanpur,
Gina-Anne Levow, Wai-Kit Lo, Douglas W. Oard, Karen Tang,
Hsin-Min Wang and Jianqiang Wang, "Mandarin-English Information
(MEI): Investigating Translingual Speech Retrieval," in
Proceedings of the First International Conference on Human
Language Technology, pp. 239-245, San Diego, 2001. (PDF)
- Helen Meng, Sanjeev Khudanpur, Gina-Anne Levow, Douglas W. Oard,
and Hsin-Min Wang, "Mandarin-English Information (MEI):
Investigating Translingual Speech Retrieval," in NAACL Workshop
on Embedded Machine Translation, pp. 23-30, Seattle, WA,
2000. (PDF)
- Douglas W. Oard, "User Interface Design for Speech-Based
Retrieval," Bulletin of the American Society for Information
Science, vol. 26, no. 5, pp. 20-22, June/July, 2000. (Publisher)
- Helen Meng, Sanjeev Khudanpur, Douglas W. Oard, and Hsin-Min
Wang, "Mandarin-English Information (MEI)," in Working notes of
the Topic Detection and Tracking Workshop, pp. 117-121, Tysons
Corner, VA, 2000. (PDF)
- Laura Slaughter, Douglas W. Oard, Vernon Warnick, Galen Wilkerson
and Julie Harding, "A Graphical Interface for Speech-Based
Retrieval," in Proceedings of the Third ACM Conference on Digital
Libraries, pp. 305-306, Pittsburgh, PA, 1998. (PDF)
Search and Sense-making in Email Collections
These papers address techniques for helping people find things in
large collections of electronic mail that are not their own. I do not
work on the counterpart problem of Personal Information Management, in
which tools are built to help people better manage their own email
collections. Papers reporting on evaluation design for email search
in the TREC Legal Track can also be found in the TREC Legal Track
Overview section.
- Tan Xu and Douglas W. Oard, "Exploring Example-Based Person
Search in Email," in 35th Annual International ACM-SIGIR
Conference on Research and Development in Information Retrieval,
2 pages, Portland, OR, 2012. (PDF)
- Hyunmo Kang, Catherine Plaisant, Tamer Elsayed, and Douglas
W. Oard, "Making Sense of Archived Email: Exploring the Enron
Collection with NetLens," Journal of the American Society for
Information Science and Technology, 61(4)723-744, 2010. (PDF) (Publisher)
- Tamer Elsayed, Douglas W. Oard, and Galileo Namata, "Resolving
Personal Names in Email Using Context Expansion," accepted for
presentation at Association for Computational Linguistics-Human
Language Technology Conference, pp. 941-949, Columbus, OH,
2008. (PDF)
- Adam Perer, Ben Shneiderman, and Douglas W. Oard, "Using Rhythms
of Relationships to Understand Email Archives," Journal of the
American Society for Information Science and Technology,
57(14)1936-1948, 2006. (PDF) (Publisher)
- Yejun Wu, Douglas W. Oard and Ian Soboroff, "An Exploratory Study
of the W3C Mailing List Test Collection for Retrieval of Emails with
Pro and/or Con arguments," in Third Conference on Email and Anti-Spam,
10 pages, Mountain View, CA, 2006. (PDF)
- Tamer Elsayed and Douglas W. Oard, "Modeling Identity in Archival
Collections of Email: A Preliminary Study," in Conference on Email and
Anti-Spam, 9 pages, Mountain View, CA, 2006. (PDF)
- Yejun Wu and Douglas W. Oard, "Indexing Emails and Email Threads
for Retrieval," in Proceedings of the 28th Annual ACM SIGIR Conference
on Research and Development in Information Retrieval, poster paper,
pp. 665-666, 2005. (PDF)
- Jimmy Lin, Eileen Abels, Dina Demner-Fushman, Douglas W. Oard,
Philip Wu, and Yejun Wu, "A Menagerie of Tracks at Maryland: HARD,
Enterprise, QA, and Genomics, Oh My!," in The Fourteenth Text
Retrieval Conference, Gaithersburg, MD, 16 pages, 2005. (PDF)
- Anton Leuski, Douglas W. Oard and Rahul Bhagat, "eArchivarius:
Accessing Collections of Electronic Mail," in Twenty-Sixth
International ACM-SIGIR Conference on Research and Development in
Information Retrieval, description of system demonstration, pp. 468,
Toronto, Canada, 2003. (PDF)
Search and Sense-making in Text Chat
This is a research area on which I may publish more
in the future.
- Rashmi Sankepally and Douglas W. Oard, An Initial Test Collection
for Ranked Retrieval of SMS Conversations, in Eleventh Language
Resources and Evaluation Conference, Miyazaki, Japan, 2018. (PDF)
- Lidan Wang and Douglas W. Oard, "Context-based Message Expansion
for Disentanglement of Interleaved Text Conversations" Proceedings of
the Annual Conference of the North American Chapter of the Association
for Computational Linguistics Human Language Technology Conference,
Boulder, 2009. (PDF)
Search Among Sensitive Content
The focus of this work is on balancing relevance with protection for
sensitive content.
- Jason R. Baron, Nathaniel W. Rollings and Douglas W. Oard, Using
ChatGPT for the FOIA Exemption 5 Deliberative Process Privilege,
Proceedings of the Third International Workshop on Artificial
Intelligence and Intelligent Assistance for Legal Professionals
in the Digital Workplace (LegalAIIA), Braga, Portugal, 2023. (PDF)
- Mahmoud F. Sayed, Nishanth Mallekav and Douglas W. Oard,
Comparing Intrinsic and Extrinsic Evaluation of Sensitivity
Classification, 8 pages, ECIR, 2022. (PDF)
- Jason R. Baron, Mahmoud F. Sayed and Douglas W. Oard, Providing
More Efficient Access To Government Records: A Use Case Involving
Application of Machine Learning to Improve FOIA Review for the
Deliberative Process Privilege, ACM Journal on Computing and
Cultural Heritage, 19pp., to appear, 2021. (PDF)
- Modassir Iqbal, Katie Shilton, Mahmoud F. Sayed, Douglas W. Oard,
Jonah Lynn Rivera and William Cox, Search with Discretion: Value
Sensitive Design of Training Data for Information Retrieval,
Proceedings of the ACM on Human Computer Interaction (also
presented at CSCW 2021), 20 pages, 2021. (PDF)
- Graham McDonald and Douglas W. Oard, Search Among Sensitive
Content, ECIR 2021 Tutorial Abstract, in Proceedings of the 43rd
European Conference on IR Re search, 1 page, 2021. (PDF)
- Mahmoud Sayed, William Cox, Jonah Lynn Rivera, Caitlin
Christian-Lamb, Modassir Iqbal, Douglas W. Oard and Katie
Shilton, A Test Collection for Relevance and Sensitivity, 4
pages, SIGIR, 2020. (PDF)
- Jimmy Lin, Ian Milligan, Douglas Oard, Nick Ruest and Katie
Shilton, We Could, But Should We? Ethical Considerations for
Providing Access to GeoCities and Other Historical Digital
Collections, CHIIR, 10 pages, Vancouver, BC, Canada, 2020. (PDF)
- Alexandra Olteanu, Jean Garcia-Gathright, Maarten de Rijke, and
Michael D. Ekstrand (eds.) and Adam Roegiest, Aldo Lipani, Alex
Beutel, Alexandra Olteanu, Ana Lucic, Ana-Andreea Stoica,
Anubrata Das, Asia Biega, Bart Voorn, Claudia Hauff, Damiano
Spina, David Lewis, Douglas W. Oard, Emine Yilmaz, Faegheh
Hasibi, Gabriella Kazai, Graham McDonald, Hinda Haned, Iadh
Ounis, Ilse van der Linden, Jean Garcia-Gathright, Joris Baan,
Kamuela N. Lau, Krisztian Balog, Maarten de Rijke, Mahmoud Sayed,
Maria Panteli, Mark Sanderson, Matthew Lease, Michael
D. Ekstrand, Preethi Lahoti, and Toshihiro Kamishima (authors),
FACTS-IR: Fairness, Accountability, Confidentiality,
Transparency, and Safety in Information Retrieval, SIGIR Forum,
(53)2, 2019. (PDF)
- Mahmoud F. Sayed, Douglas W. Oard: Jointly Modeling Relevance and
Sensitivity for Search Among Sensitive Content. SIGIR,
pp. 615-624, Paris, France, 2019. (PDF)
- Katie Shilton, Amy Wickner, Douglas W. Oard and Jimmy Lin,
Protecting Sensitive Content in Email: Archival Views on
Challenges and Opportunities, The First International Workshop on
Privacy-Sensitive Collections for Digital Scholarship, 4 pages,
Montreal, Canada, 2017. (PDF)
- Douglas W. Oard, Katie Shilton and Jimmy Lin, Evaluating Search
Among Secrets, in The Seventh International Workshop on
Evaluating Information Access, Tokyo, Japan, 2016. (PDF)
Math Search
The focus of this work is on searching mathematical content, or mixed
math and text content.
- Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL
Systems in the CLEF 2022 ARQMath Lab: Introducing MathAMR for
Math-Aware Search, Working Notes of CLEF, 18 pages, 2022.
- Behrooz Mansouri, Douglas W. Oard, Anurag Agrawal and Richard
Zanibbi, Effects of Context, Complexity, and Clustering on
Evaluation for Math Formula Retrieval, arXiv preprint
arXiv:2111.10504, 10 pages, 2021. (PDF)
- Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL
Systems in the CLEF 2021 ARQMath Lab: Sentence-BERT for Answer
Retrieval, Learning-to-Rank for Formula Retrieval, Working Notes
of CLEF, pp. 47-62, 2021. (PDF)
- Behrooz Mansouri, Richard Zanibbi and Douglas W. Oard, Learning
to Rank for Mathematical Formula Retrieval. SIGIR, pp. 952-961,
2021. (PDF)
- Behrooz Mansouri, Douglas W. Oard and Richard Zanibbi, DPRL
Systems in the CLEF 2020 ARQMath Lab. CLEF Working Notes, 12
pages, 2020. (PDF)
- Behrooz Mansouri, Shaurya Rohatgi, Douglas W. Oard, Jian Wu,
C. Lee Giles, Richard Zanibbi, Tangent-CFT: An Embedding Model
for Mathematical Formulas, ICTIR, pp. 11-18, Santa Clara, CA,
2019. (PDF)
- Behrooz Mansouri, Richard Zanibbi and Douglas Oard, Toward
Math-Enabled Digital Libraries: Characterizing Searches for
Mathematical Concepts, Joint Conference on Digital Libraries,
Urbana, IL, 2019. (PDF)
Microblog Search
The focus of this work is on searching short text posted to Twitter
and similar services.
- Mossaab Bagdouri and Douglas W. Oard, CLIP at TREC 2016: LiveQA
and RTS, The Twenty-Fifth Text Retrieval Conference, 6 pages,
Gaithersburg, MD, 2016. (PDF)
- Mossaab Bagdouri and Douglas W. Oard, CLIP at TREC 2015:
Microblog and Live QA," in The Twenty-Fourth Text Retrieval
Conference, 8 pages, Gaithersburg, MD, 2015. (PDF)
- Mossaab Bagdouri and Douglas W. Oard, "Profession-Based Person
Search in Microblocs: Using Seed Sets to Find Journalists," in
Proceedings of the 24rd Annual International ACM CIKM Conference
on Information and Knowledge Management, 10 pages, Melbourne,
Australia, 2015. (PDF)
- Mossaab Bagdouri and Douglas W. Oard, "On Prediccoting Deletions of
Microblog Posts," in Proceedings of the 24rd Annual International
ACM CIKM Conference on Information and Knowledge Management, 4
pages, Melbourne, Australia, 2015. (PDF)
- Tan Xu, Paul McNamee, and Douglas W. Oard, "HLTCOE at TREC 2014:
Microblog and Clinical Decision Support", in The Twenty-Third
Text Retrieval Conference, 8 pages, Gaithersberg, MD,
2014. (PDF)
- Tan Xu and Douglas W. oard, "Wikipedia-Based Topic Clustering for
Microblogs," 10 pages, Annual Meeting of the American Society for
Information Science and Technology, New Orleans, LA, 2011. (PDF)
E-Discovery
The focus of this work is on the design and evaluation of systems that
can support the exchange of documentary evidence among litigants
incident to civil litigation. The word "discovery" is also used with
other meanings by information retrieval researchers, but here it is
used in the legal sense.
- Douglas W. Oard, Fabrizio Sebastiani and Jyothi K. Vinjumur,
Jointly Minimizing the Expected Costs of Review for
Responsiveness and Privilege in E-Discovery, ACM Transactions on
Information Systems, 37(1)11:1-11:35, 2018. (PDF, Publisher, SIGIR 2020 Talk (MP4), SIGIR slides (PPTX)). Figure 4
in the paper has incorrect colors in the legend; a corrected figure is available.
- Douglas W. Oard, Jyothi Vinjumur and Fabrizio Sebastiani, When is
it Rational to Review for Privilege? ICAIL DESI VII Workshop on
Using Advanced Data Analysis in eDiscovery and Related
Disciplines to Identify and Protect Sensitive Information in
Large Collections, 10 pages, London, UK, 2017. (PDF)
- William Webber and Douglas W. Oard, "Metrics in Predictive
Coding," Perspectives on Predictive Coding and Other Advanced
Search and Review Technologies for the Legal Practitioner,
American Bar Association, 2016.
- Jyothi K. Vinjumur, Douglas W. Oard and Amittai Axelrod, "An AID
for Avoiding Inadvertent Disclosure: Supporting Interactive
Review for Privilege in E-Discovery," in ACM SIGIR Conference on
Human Information Interaction and Retrieval, 10 pages, Chapel
Hill, NC, 2016. (PDF)
- Jyothi K. Vinjumur and Douglas W. Oard, "Finding the privileged
Few: Supporting Privilege Review for E-Discovery," in Annual
Meeting of the Association for Information Science and
Technology, 4 pages, St. Louis, MO, 2015. (PDF)
- Jyothi K. Vinjumur, Douglas W. Oard and Jiaul H. Paik, "Assessing
the Reliability and Reusability of an E-Discovery Privilege Test
Collection," in 37th Annual International ACM-SIGIR Conference on
Research and Development in Information Retrieval, 4 pages, Gold
Coast, Australia, 2014. (PDF)
- Mossaab Bagdouri, William Webber, David D. Lewis and Douglas
W. Oard, "Towards Minimizing the Annotation Cost of Certified
Text Classification," in ACM Conference on Information and
Knowledge Management, 10 pages, San Francisco, CA, 2013. (PDF)
- William Webber, Mossaab Bagdouri, David D. Lewis and Douglas
W. Oard, "Sequential Testing in Classifier Evaluation Yields
Biased Estimates of Effectiveness," in 36th Annual International
ACM-SIGIR Conference on Research and Development in Information
Retrieval, 4 pages, Dublin, Ireland, 2013. (PDF)
- Feng Charlie Zhao, Douglas W. Oard and Jason R Baron, "Improving
Search Effectiveness in the Legal E-Discovery Process Using Relevance
Feedback," in Third International Workshop on Discovery of
Electronically Stored Information (DESI III), 10 pages, Barcelona,
Spain, 2009. (PDF)
Document Image Retrieval
These papers address techniques for searching scanned documents.
Papers reporting on the evaluation design for the TREC Legal Track,
which included scanned documents, can be found in the evaluation
design section above.
- Rajiv Jain, Douglas W. Oard and David Doermann, Scalable Ranked
Retrieval Using Document Images, in 21st SPIE Document
Recognition and Retrieval Conference, 15 pages, San Francisco,
CA, 2014. (PDF)
- Utpal Garain, Arjun Das, David Doermann and Douglas Oard,
Leveraging Statistical Transliteration for Dictionary-Based
English-Bengali CLIR of OCR'd Text, in 24th International
Conference on Computational Linguistics, 9 pages, Mumbai, India,
2012. (PDF)
- Lidan Wang and Douglas W. Oard, "Query Expansion for Noisy Legal
Documents," in The Sixteenth Text Retrieval Conference, 9 pages,
Gaithersburg, MD, 2008.
- Douglas Oard, Tamer Elsayed, Jianqiang Wang, Yejun Wu, Pengyi
Zhang, Eileen Abels, Jimmy Lin and Dagobert Soergel, TREC-2006 at
Maryland: Blog, Enterprise, Legal and QA Tracks," in The Fifteenth
Text Retrieval Conference, 16 pages, Gaithersburg, MD, 2006. (PDF)
- Kareem Darwish and Douglas W. Oard, "Balanced Query Methods for
OCR-Based Retrieval," 2003 Symposium on Document Image Understanding
Technology, Greenbelt, MD, 2003. (PDF)
- Kareem Darwish and Douglas W. Oard, "Term Selection for Searching
Printed Arabic," in Twenty-Fifth International ACM-SIGIR Conference on
Research and Development in Information Retrieval, Tampere, Finland,
pp. 261-268, 2002. (PDF)
- Yuen-Hsien Tseng and Douglas W. Oard, "Document Image Retrieval
Techniques for Chinese," 2001 Symposium on Document Image
Understanding Technology, pp. 151-158, Columbia, MD, 2001. (PDF)
- Douglas W. Oard, "Issues in Cross-Language Retrieval from
Document Image Collections," 1999 Symposium on Document Image
Understanding Technology, pp. 229-234, Annapolis, 1999. (PDF)
Archival Access
Some of my papers refer to archives simply with the broad meaning
"collections of content," but the papers in this section are focused
on learning to find content in archival institutions.
- Douglas W. Oard, Tokinori Suzuki, Emi Ishita and Noriko Kando,
Searching Unseen Sources for Historical Information: Evaluation
Design for the NTCIR-18 SUSHI Pilot Task, SIGIR-AP Workshop on
Evaluation Methodologies, Testbeds and Community for Information
Access Research, 8 pages, 2024. (PDF)
- Tokinori Suzuki, Douglas W. Oard, Emi Ishita and Yoichi Tomiura,
Searching for Physical Documents in Archival Repositories,
Proceedings of the 47th International ACM SIGIR Conference on
Research and Development in Information Retrieval, Washington DC,
5 pages, 2024. (PDF)
- Tokinori Suzuki, Douglas Oard, Emi Ishita and Yoichi Tomiura,
Automatically Detecting Referencesfrom the Scholarly Literature
to Records in Archives, International Conference on Asian Digital
Libraries, Taipei, 2023. (PDF)
- Douglas W. Oard, Known by the Company it Keeps: Proximity-Based
Indexing for Physical Content in Archival Repositories, TPDL, 14
pages, 2023 (PDF)
Computational Social Science
These papers involve the application of computational techniques to
foster social science research. Many of my other papers also address
issues that have potential application to social science research;
what distinguishes these papers is that supporting social science
research was the principal motivation for this work.
- Satoshi Fukuda, Emi Ishita, Yoichi Tomiura and Douglas W. Oard,
Automating the Choice Between Single or Dual Annotation for
Classifier Training, International Conference on Asian Digital
Libraries, 15 pp., 2021. (PDF)
- Emi Ishita, Satoshi Fukuda, Toru Oga, Yoichi Tomiura, Douglas
W. Oard and Kenneth. R. Fleischmann, Cost-Effective Learning for
Classifying Human Values, iConference, poster paper, Boras,
Sweden, 2020. (PDF)
- Emi Ishita, Satoshi Fukuda, Toru Oga, Douglas W. Oard, Kenneth
R. Fleischmann, Yoichi Tomiura and An-Shou Cheng, Toward
Three-Stage Automation of Annotation for Human Values,
iConference, College Park, MD, 2019. (PDF)
- Emi Ishita, Toru Oga, Yasuhiro Takayama, An-Shou Cheng, Douglas
W. Oard and Kenneth R. Fleischmann, Yoichi Tomiura, Toward
Automating Detection of Human Values in the Nuclear Power Debate,
80th Annual Meeting of the Association for Information Science
and Technology, 2 pages, Washington, DC, 2017. (PDF)
- Yasuhiro Takayama, Yoichi Tomiura, Kenneth R. Fleischmann,
Douglas W. Oard, An-Shou Cheng and Emi Ishita, "An Automatic
Dictionary Extraction and Annotation Method Using Simulated
Annealing for Detecting Human Values," Sixth International
Conference on E-Service and Knowledge Management, Okayama, Japan,
2015. (PDF)
- Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, Yoichi
Tomiura, Yasuhiro Takayama and An-Shou Cheng, "Learning curves
for automating content analysis: How much human annotation is
needed?," Sixth International Conference on E-Service and
Knowledge Management, Okayama, Japan, 2015. (PDF)
- Kenneth R. Fleischmann, Yasuhiro Takayama, An-Shou Cheng, Yoichi
Tomiura, Douglas W. Oard and Emi Ishita, "Thematic Analysis of
Words that Invoke Values in the Net Neutrality Debate,"
iConference, 6 pages, Newport Beach, CA, 2015. (PDF)
- Yasuhiro Takayama, Yoichi Tomiura, Emi Ishita, Douglas W. Oard,
Kenneth R. Fleischmann, and An-Shou Cheng, "A Word-Scale
Probabilistic Latent Variable Model for Detecting Human Values,"
in ACM International Conference on Information and Knowledge
Management, 10 pages, Shanghai, China, 2014. (Corrected PDF, Corrections from published version,
PDF)
- Yasuhiro Takayama, Yoichi Tomiura, Emi Ishita, Zheng Wang,
Douglas Oard, Kenneth Fleischmann and An-Shou Cheng, "Improving
Automatic Sentence-Level Annotation of Human Values Using
Augmented Feature Vectors," in Conference of the Pacific
Association for Computational Linguistics, 6 pages, Tokyo, Japan,
2013. (PDF)
- An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita, and
Douglas W. Oard, The Role of Innovation and Wealth in the Net
Neutrality Debate: A Content Analysis of Human Values in
Congressional and FCC Hearings, Journal of the American Society
for Information Science and Technology (JASIST), 63(7)1360-1373,
2012. (PDF) (Publisher)
- Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng, Jordan
Boyd-Graber, Thomas Clay Templeton, Emi Ishita, Jes A. Koepfler,
and William A. Wallace, "Content Analysis for Values
Elicitation," Proceedings of the CHI Workshop on Methods for Accounting
for Values in Human-Centered Computing, 4 pages, Austin, TX,
2012. (PDF)
- Pranav Anand, Joseph King, Jordan Boyd-Graber, Earl Wagner, Craig
Martell, Doug Oard, and Philip Resnik, "Believe Me -- We Can Do
This! Annotating Persuasive Acts in Blog Text", AAAI Workshop on
Computational Models of Natural Argument, San Francisco, CA,
2011. (PDF)
- Emi Ishita, Douglas W. Oard, Kenneth R. Fleischmann, An-Shou
Cheng and Thomas Clay Templeton, "Investigating Multi-Label
Sentence Classification for Human Values," Annual Conference of
the American Society for Information Science and Technology, 4
pages, Pittsburgh, PA, 2010. (PDF)
- An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang, Emi Ishita and
Douglas W. Oard, "Values of Stakeholders in the Net Neutrality
Debate: Applying Content Analysis to Telecommunications Policy,"
in Hawaii International Conference on System Sciences, 10 pages,
Kauai, HI, 2010. (PDF)
- Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas
W. Oard and Asad B. Sayeed, Exploring the Relationships among
ICTs: A Scalable Computational Approach Using KL Divergence and
Hierarchical Clustering," in Hawaii International Conference on
System Sciences, 10 pages, Kauai, HI, 2010. (PDF)
- Emi Ishita, An-Shou Chen, Douglas W. Oard and Kenneth
R. Fleischmann, "Multi-label Classification for Human Values" (in
Japanese), in Annual Conference of the Japan Society of Library
and Information Science, 4 pages, Tokyo, Japan, 2009. (PDF)
- Chia-Jung Tsui, Ping Wang, Kenneth R. Fleischmann, Douglas
W. Oard and Asad B. Sayeed, "Understanding IT Innovations through
Computational Analysis of Discourse," in International Conference
on Information Systems, 9 pages, Phoenix, AZ, 2009. (PDF)
- Kenneth R. Fleischmann, Douglas W. Oard, An-Shou Cheng, Ping
Wang, and Emi Ishita, "Automatic Classification of Human Values:
Applying Computational Thinking to Information Ethics," Annual
Conference of the Association for Information Science and
Technology, Vancouver, 2009. (Publisher)
- Ping Wang, Chia-Jung Tsui, Kenneth R. Fleischmann, Douglas
W. Oard and Lidan Wang, "Understanding IT Innovations Through
Discourse Analysis," Fourth iSchools Conference, 3 pages, Chapel
Hill, 2009. (PDF)
- An-Shou Cheng, Kenneth R. Fleischmann, Ping Wang and Douglas
W. Oard, "Advancing Social Science Research by Applying
Computational Linguistics," in Proceedings of the Annual
Conference of the American Society for Information Science and
Technology, 12 pages, Columbus, 2008. (PDF)
Information Integration
These papers address issues that involve structured representation of
information found in (or that can be inferred from) unstructured
documents. This includes my work on the narrower problems of
information extraction, co-reference resolution, and text
classification. My principal interest is in how these techniques can
be employed in integrated systems that are designed to satisfy
specific types of information needs.
- Joe Barrow, Rajiv Jain, Nedim Lipka, Franck Dernoncourt, Vlad
I. Morariu, Varun Manjunatha, Douglas W. Oard, Philip Resnik and
Henning Wachsmuth, Syntopical Graphs for Computational
Argumentation Tasks, Joint Conference of the 59th Annual Meeting
of the Association for Computational Linguistics and the 11th
International Joint Conference on Natural Language Processing,
pp. 1583-1595. 2021. (PDF)
- Joe Barrow, Rajiv Jain, Vlad I. Morariu, Varun Manjunatha,
Douglas W. Oard, Philip Resnik, A Joint Model for Document
Segmentation and Segment Labeling, ACL, pp. 313-322, 2020. (PDF)
- Rashmi Sankepally; Tongfei Chen; Benjamin Van Durme; Douglas
W. Oard, "A Test Collection for Coreferent Mention Retrieval", 4
pages, SIGIR, Ann Arbor, MI, 2018. (PDF)
- Ning Gao, Mark Dredze and Douglas W. Oard, Enhancing Scientific
Collaboration Through Knowledge Base Population and Linking for
Meetings, Hawaii International Conference on System Sciences,
Waikoloa, HI, 2018. (PDF)
- Ning Gao, Mark Dredze and Douglas W. Oard, Person Entity Linking
in Email with NIL Detection, Journal for the Association for
Information Science and Technology, 68(10)2412-2424, 2017. (Publisher)
- Ning Gao, Mark Dredze and Douglas W. Oard, Knowledge-Based
Population for Organization Mention in Email, in 5th Workshop on
Automated Knowledge Base Conttruction,, 5 pages, 2016. (PDF)
- Tim Finin, Dawn Lawrie, Paul McNamee, James Mayfield, Douglas
Oard, Nanyun Peng, Ning Gao, Yiu-Chang Lin, Josh MacLin and Tim
Dowd, HLTCOE Participation in TAC KBK 2015: Cold Start and TEDL,
in Proceedings of the Text Analysis Conference, 14 pages,
Gaithersburg, MD, 2015. (PDF)
- Ning Gao, Douglas Oard and Mark Dredze, A Test Collection for
Email Entity Linking, 4th NIPS Workshop on Automated Knowledge
Base Construction (AKBC), 5 pages, Montreal, Canada, 2013. (PDF)
- Dawn Lawrie, James Mayfield, Paul McNamee and Douglas W. Oard,
"Cross-Language Person-Entity Linking from 20 Languages,"
Journal of the Association for Information Science and Technology
(JASIST), 66(6)2091-1105, 2015. (preprint: PDF) (Publisher)
- Hui Su, Adi Hajj-Ahmad, Min Wu and Douglas W. Oard, "Exploring
the Use of ENF for Multimedia Synchronization," IEEE
International Conference on Acoustics, Speech, and Signal
Processing, 5 pages, Florence, Italy, 2014. (PDF)
- Douglas W. Oard, Min Wu, Kari Kraus, Adi Hajj-ahmad, Hui Su and
avi Garg, "Its About Time: Projecting Temporal Metadata for
Historically Significant Recordings," 7 pages, iConference,
Berlin, Germany, 2014. (PDF)
- Paul McNamee, James Mayfield, Tim Finin, Tim Oates, Dawn Lawrie,
Tan Xu and Douglas Oard, "KELVIN: A Tool for Automated Knowledge
Base Construction," in Proceedings of the 2013 Conference of the
North American Chapter of the Association for Computational
Linguistics: Human Language Technologies, 4 page demonstration
paper, Atlanta, GA, 2013. (PDF)
- Paul McNamee, Veselin Stoyanov, James Mayfield, Tim Finin, Tim
Oates, Tan Xu, Douglas W. Oard and Dawn Lawrie, "HLTCOE Participation
at TAC 2012: Entity Linking and Cold Start Knowledge Base
Construction," in Proceedings of the Text Analysis Conference, 11
pages, Gaithersburg, MD, 2012. (PDF)
- Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard,
"Creating and Curating a Cross-Language Entity Linking Collection,"
8th International Conference on Language Resources and Evaluation, 5
pages, Istanbul, Turkey, 2012. (PDF)
- Paul McNamee, James Mayfield, Douglas W. Oard, Tan Xu, Wu Ke,
Veselin Stoyanov and David Doermann, "Cross-Language Entity Linking in
Maryland During a Hurricane," in Proceedings of the Text Analysis
Conference, 11 pages, Gaithersburg, MD, 2011. (PDF)
- Jun Gong, Lidan Wang and Douglas W. Oard, "Matching Person Names
Through Name Transformation, in ACM Conference on Information and
Knowledge Management, 4 pages, Hong Kong, China, 2009. (PDF)
- Yejun Wu and Douglas W. Oard, "Beyond Topicality, Finding
Opinionated Documents," Annual Conference of the Association for
Information Science and Technology, Vancouver, 2009. (PDF)
- Jun Gong and Douglas W. Oard, "Selecting Hierarchical Clustering
Cut Points for Web Person-Name Disambiguation," Annual International
ACM-SIGIR Conference on Research and Development in Information
Retrieval, Boston, 2009. (PDF)
- Asad Sayeed, Tamer Elsayed, Nikesh Garera, David
Alexander, Tan Xu, Douglas W. Oard, David Yarowsky and Christine
Piatko, Arabic Cross-Document Coreference Resolution, Annual
Conference of the Association for Computational Linguistics /
International Joint Conference on Natural Language Processing,
pp. 357-360, Singapore, 2009. (PDF)
- James Mayfield, David Alexander, Bonnie Dorr, Jason Eisner, Tamer
Elsayed, Tim Finin, Clay Fink, Marjorie Freedman, Nikesh Garera, Paul
McNamee, Saif Mohammad, Douglas W. Oard, Christine Piatko, Asad
Sayeed, Zarem Syed, Ralph Weischedel, Tan Xu and David Yarowsky,
"Cross-Document Coreference Resolution: A Key Technology for Learning
by Reading," AAAI Spring Symposium on Learning by Reading and Learning
to Read, 6 pages, Stanford, 2009. (PDF)
- James Mayfield, Bonnie J. Dorr, Tim Finin, Douglas W. Oard and
Christine Piatko, "Knowledge Base Evaluation for Semantic Knowledge
Discovery," in Symposium on Syntactic Knowledge Discovery,
Organization and Use, New York, 2 pages, 2008. (PDF)
- Tan Xu, Douglas W. Oard, Tamer Elsayed and Asad Sayeed,
"Knowledge Representation from Information Extraction," Joint
Conference on Digital Libraries, Pittsburgh, p. 475, 2008. (PDF)
- Yejun Wu and Douglas W. Oard, "NTCIR-6 at Maryland: Chinese
Opinion Analysis Pilot Task," in Proceedings of the Sixth NTCIR
Workshop, Tokyo, 6 pages, 2007. (PDF)
- J. Scott Olsson and Douglas W. Oard, "Evaluating Feature
Selection Combination Methods for Automatic Text Classification," in
Conference on Information and Knowledge Management, Arlington, VA,
pp. 798-799, 2006. (PDF)
- Douglas W. Oard, "Integration of Natural Language with Structured
Data: Three Test Collections," Information Integration Workshop,
Philadelphia, 2 pages, 2006. (PDF)
- Dina Demner-Fushman, Philip Resnik and Douglas W. Oard. "Genomic
Entity Recognition at TREC," JCDL TREC Genomics Pre-Track Workshop,
Portland, 2002. (PDF)
- Paul Losiewicz, Douglas W. Oard and Ronald N. Kostoff, "Textual
Data Mining to Support Science and Technology Management," Journal of
Intelligent Information Systems, 15(2)99-119, 2000. (PDF)
Recommender Systems
These papers address techniques for recommending new content to users
based on learned representations of the stable interests of those
users. The term "recommender systems" is used expansively here to
include both content-based and behavior-based systems, and systems
that rely on either explicit or implicit feedback from the user.
- Melanie Gnasa, Armin B. Cremers and Douglas W. Oard, "ISKADOR:
Unified User Modeling for Integrated Searching," in 30th Annual
International ACM SIGIR Conference on Research and Development in
Information Retrieval, Amsterdam, p. 898, 2007. (PDF)
- Penelope Brooks, Khoo Yit Phang, Douglas W. Oard, Ryen W. White,
Rachael Bradley, and Francois Guimbretiere, "Measuring the Utility of
Gaze Detection for Task Modeling: A Preliminary Study," in IUI-2006
Workshop on Intelligent User Interfaces for Intelligence Analysis,
Sydney, Australia, 4 pages, 2006. (PDF)
- Tamer Elsayed and Douglas W. Oard, "On Evaluation of Adaptive
Topic Tracking Systems," in Proceedings of the 28th Annual ACM SIGIR
Conference on Research and Development in Information Retrieval,
poster paper, pp. 597-598, 2005. (PDF)
- Douglas W. Oard, Anton Leuski and Stuart Stubblebine, "Protecting
the Privacy of Observable Behavior in Distributed Recommender
Systems," ACM SIGIR Workshop on Implicit Methods, Toronto, Canada, 4
pages, 2003. (PDF)
- Jinmook Kim and Douglas W. Oard, "Observable Behavior for Implicit
User Modeling: A Framework for User Studies," in Journal of the Korean
Society for Library and Information Science, volume 35, pp. 173-189,
2001. (PDF)
- Douglas W. Oard and Jinmook Kim, "Modeling Information Content
Using Observable Behavior," in Proceedings of the 64th Annual
Conference of the American Society for Information Science and
Technology, pp. 481-488, Washington, 2001. (PDF)
- Jinmook Kim, Douglas W. Oard and Kathleen Romanik, "User Modeling
for Information Access Based on Implicit Feedback," in Third ISKO
Workshop on Information Filtering, pp. 25-37, Paris, 2001. (PDF)
- Jinmook Kim, Douglas W. Oard and Kathleen Romanik. Using implicit
feedback for user modeling in internet and intranet
searching. University of Maryland CLIS Technical Report 00-01,
2000. (PDF)
- Douglas W. Oard and Jinmook Kim, "Implicit Feedback for Recommender
Systems," in AAAI Workshop on Recommender Systems, pp. 81-83, Madison,
WI, 1998. (PDF)
- Douglas W. Oard, Nicholas DeClaris, Bonnie J. Dorr, and Christos
Faloutsos, "High Performance Cognitive and Interactive Text
Filtering," Proceedings of IEEE International Conference on Systems,
Man, and Cybernetics, Volume V, pp. 4398-4403, Vancouver, Canada,
1995. (PDF)
Other Topics
Papers on topics that are new to me will initially show up in this
category, and then ultimately perhaps become the anchor of a category
of their own.
- Dawn Larwrie, Efsun Kayi, Eugene Yang, James Mayfield and Douglas
W. Oard, PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval,
Proceedings of the 47th International ACM SIGIR Conference on
Research and Development in Information Retrieval, Washington DC,
5 pages, 2024. (PDF)
- Nathaniel W. Rollings, Peter A. Rankel and Douglas W. Oard,
Multi-Faceted Question Fusion in the TREC 2022 CrisisFACTS Track,
TREC, 2022. (PDF)
- Xin Qian, Douglas W. Oard and Joel Chan, Conversational
Interaction with Historical Figures: What’s it good for?,
iConference, 17 pages, 2022. (PDF)
- Xin Qian and Douglas W. Oard, Full-Collection Search with Passage
and Document Evidence: Maryland at the TREC 2021 Conversational
Assistance Track, TREC, 9 pages, 2021. (PDF)
- Han-Chin Shing, Chaitanya Shivade, Nima Pourdamghani, Feng Nan,
Philip Resnik, Douglas Oard and Parminder Bhatia, Towards
Clinical Encounter Summarization: Learning to Compose Discharge
Summaries from Prior Notes, Preprint, CoRR abs/2104.13498, 12
pp., 2021. (PDF Preprint)
- Mahmoud F. Sayed and Douglas W. Oard, The University of Maryland
at the TREC 2020 Fair Ranking Track, TREC, 4 pages, 2020. (PDF)
- Han-Chin Shing, Philip Resnik, Douglas W. Oard, A Prioritization
Model for Suicidality Risk Assessment, ACL, pp. 8124-8137,
2020. (PDF)
- Tokinori Suzuki, Daisuke Ikeda, Petra Galuščáková, Douglas
W. Oard, Towards Automatic Cataloging of Image and Textual
Collections with Wikipedia, ICADL, pp. 167-180, Kuala Lumpur,
Malaysia, 2019. (PDF)
- Kristine Rogers and Douglas W. Oard, UMD_CLIP: Using Relevance
Feedback for Find Diverse Documents for TREC Dynamic Domain 2017,
Working Notes of the Twenty-Sixth Text Retrieval Conference,
Gaithersburg, MD, 2017. (PDF)
- Mossaab Bagdouri and Douglas W. Oard, Building Bridges Across
Social Platforms: Answering Twitter Questions with Yahoo!
Answers, 40th International ACM SIGIR Conference on Research and
Development in Information Retrieval, short paper, 4 pages,
Tokyo, Japan, 2017. (PDF)
- Jiaul H. Paik and Douglas W. Oard, A Fixed-Point Method for
Weighting Terms in Verbose Informational Queries, in ACM
International Conference on Information and Knowledge Management,
10 pages, Shanghai, China, 2014. (PDF)
- Tanya Clement, Kari Kraus, Jentery Sayers, Whitney Trettien,
David Tcheng, Loretta Auvil, Tony Borries, Min Wu, Doug Oard, Adi
Hajj-Ahmad, Hui Su, Mary Caton Lingold, Daren Mueller, William
J. Turkel, and Devon Elliott, "Digital Humanities: The
Intersections of Sound and Method," panel abstract in Digital
Humanities Conference, Lausanne, Switzerland, 2014. (PDF)
- Katie Shilton, Michael Kurtz, Bruce Ambacher, Erik Mitchell,
Douglas Oard and Ann Weeks, "Bridging By Design: The Curation and
Management of Digital Assets Specialization at the University of
Maryland," in Proceedings of the Framing the Digital Curation
Curriculum Conference (DigCurV), 5 pages, Florence, Italy,
2013. (PDF)
- Tan Xu, Paul McNamee and Douglas W. Oard, "HLTCOE at TREC 2013:
Temporal Submission," in The Twenty-Second Text Retrieval
Conference, 8 pages, Gaithersberg, MD, 2013. (PDF)
- Douglas W. Oard and Noriko Kando, "Extrinsic Evaluation of Patent
MT, in Fifth International Workshop on Evaluating Information
Access, 5 pages, Tokyo, Japan, 2013. (PDF)
- Keith C. Walker and Douglas W. Oard, "Extending Argument Maps to
Provide Decision Support for Rulemaking," in Hawaii International
Conference on System Sciences, 10 pages, Maui, HI, 2013. (PDF)
- Amalia S. Levi and Douglas W. Oard, "From Personal Narratives to
Collective Memory: Spinning a Web from Oral History," in XVII
International Oral History Association Conference, 31 pages,
Buenos Aires, Argentina, 2012. (PDF)
- Pengyi Zhang, Dagobert Soergel, Judith L. Klavans and Douglas
W. Oard, "Extending Sense-Making Models with Ideas from Cognition and
Learning Theories," in Proceedings of the Annual Conference of the
American Society for Information Science and Technology, 12 pages,
Columbus, 2008. (PDF)
- Tamer Elsayed, Jimmy Lin and Douglas W. Oard, "Pairwise Document
Similarity for Large Collections with MapReduce," Annual Conference of
the Association for Computational Linguistics-Human Language
Technology Conference, Columbus, OH, companion volume, pp. 265-268,
2008. (PDF)
- Ashwin Swaminathan, Yinian Mao, Guan-Ming Su, Hongmei Gou,
Avinash L Varna, Shan He, Min Wu and Douglas W. Oard,
"Confidentiality-Preserving Rank-Ordered Search," ACM Workshop on
Storage, Security and Survivability, Alexandria, VA, 6 pages,
2007. (PDF)
- Kareem Darwish and Douglas W. Oard, "Adapting Morphology for
Arabic Information Retrieval," in Abdelhadi Soudi, Gunter Neumann and
Antal Van den Bosch (eds.), Arabic Computational Morphology:
Knowledge-based and Empirical Methods, Kluwer/Springer Series on Text,
Speech, and Language Technology, 2006. (PDF) (Publisher)
- Wilma Bainbridge, Douglas W. Oard and Ryen White, "An Interface
to Search Human Movements Based on Geographic and Chronological
Metadata," in Proceedings of the 28th Annual ACM SIGIR Conference on
Research and Development in Information Retrieval, poster paper,
pp. 579-580, 2005. (PDF)
- Daqing He, Dina Demner-Fushman, Douglas W. Oard, Damianos
Karakos, and Sanjeev Khudanpur, "Improving Passage Retrieval Using
Interactive Elicitation and Statistical Modeling," in The Thirteenth
Text Retrieval Conference, Gaithersburg, MD, 8 pages, 2004. (PDF)
- Douglas W. Oard, Sheldon Wolk and Anthony Ephremides, "On The
Integrated Scheduling of Hardkill and Softkill Assets Using Dynamic
Programming," Naval Research Laboratory, 1994. (PDF)
Project Pages
When research projects create a project specific page, I will
generally include a link here. Some very old projects are not
included.
- Safely
Searching Among Sensitive Content
- ArQAT: Arabic Question
Answering in Twitter
- Text
Classification for Human Values
- E-Discovery
- Oral History in
the Digital Age
- PopIT
- JIKD
- MALACH
- US/EU
Digital Library Spoken Word Archive Group
Edited Works
- ACM
TALIP Special Issue on the TIDES Surprise Language
- A pair of special issues (June and September 2003) of the ACM
Transactions on Asian Language Information Processing that I
edited. Membership in the ACM Digital Library is needed to
access the articles.
- Team
TIDES Newsletter
- The newsletter for the DARPA Translingual Information Detection
Extraction and Summarization (TIDES) program. I edited the
first two (December 2002 and April 2003) and
helped out with the third (October 2003). The April 2003 and
October 2003 issues contain articles that I wrote about the
surprise language exercises.
Talk Videos
It is becoming more common to post recorded talks. Here are a few
from around the Web that I know of.
- Speaking
with the Past: Novel forms of access to spoken word
collections, Center for Archival Futures Speaker Series,
University of Maryland, College Park, 2022.
- Search the
World: Cross-Language Informaton Retrieval, Search Mastery
Speaker Series, University of Maryland, College Park, 2021
(correction: 2lingual is not owned by Google!)
- Search
Among Sensitive Content, European Conference on Information
Retrieval Tutorial, 2021. This was a jointly presented tutorial
with Graham McDonald, who spoke first.
- IR4All,
Building Search Engines for Everyone, AFIRM 2020 ACM
SIGIR/SIGKDD Africa Summer School on Machine Learning for Data
Mining and Search.
- Search
Among Secrets: Separating the Wheat from the Buzzsaw,
Intelligent Systems Dotoral Program Seminar, UNED, Madrid, Spain,
2014.
- Thinking
Big, 2012. This is a short video on using serch technology
for access to oral history made for the Oral History in the
Digital Age project, 2012.
- Who
'Dat: Identity Resolution in Large Email Collections,
Microsoft Research, 2009.
- Nobody Writes Letters
Anymore, MAVIR Seminar, UNED, Madrid, Spain, 2009.
- Oral History in the Digital Age, Library of
Congress, 2012. This is a joint seminar series talk with Mark
Kornbluh, who spoke first.
- Mandarin-English
Information, Johns Hopkins University, 2000. This is a team
talk led by Helen Meng, who spoke first.
Workshop Pages
These pages provide access to resources (e.g., papers) that were
assembled for workshops and evaluation campaigns that I helped to
organize.
- LREC 2020
Workshop on Cross-Language Search and Summarization of Text and
Speech
- ICAIL 2017
Workshop on Discovery of Electronically Stored Information
- ICAIL 2015
Workshop on Discovery of Electronically Stored Information
- FIRE 2013 Question Answering
for the Spoken Web (QASW) track.
- ICAIL 2013
Workshop on Discovery of Electronically Stored Information
- AAAI-2011
Workshop on Analyzing Microtext
- SIGIR 2011
Information Retrieval for E-Discovery Workshop
- ICAIL 2011
Workshop on Discovery of Electronically Stored Information
- First DC-area IR
Experts (DIRE) Meeting
- TREC Legal Track
- Second
Iternational Workshop on Supporting Search and Sense-making for
Electronically Stored Information in Discovery Proceedings
(DESI II)
- SIGIR 2007 Workshop on
Searching Spontaneous Conversational Speech
- ICAIL 2007
Workshop on Discovery of Electronically Stored Information
(DESI I)
- HLT
2004 Workshop on Interdisciplinary Approaches to Speech
Indexing and Retrieval
- CLEF
Interactive Track (iCLEF)
- TREC-2002
Arabic/English CLIR Track (TREC-2001
also available)
- 2001
Workshop on Evaluation of Interactive Cross-Language
Retrieval
- Summer
2000 Johns Hopkins Workshop on Mandarin-English Information
(MEI)
- 2000
Workshop on Interactive Searching of Foreign Language
Collections
- 1999 Joint ACM
Digital Library/SIGIR Workshop on Multilingual Information Discovery
and AccesS
- AAAI Spring 1997
Symposium on Cross-Language Text and Speech Retrieval
Research Software
Some software that I have developed for my research projects can be
downloaded from a page that describes the
available files. All of this is now quite old.
Research Directories
Community-wide resources on subjects that have been on interest me.
These pages are not actively maintained, so they are best thought of
as a snapshot of what a field looked like long ago near the time I
first built them.
Last modified: Wed Jun 8 08:18:28 2022
Doug Oard
oard@umd.edu