This paper will briefly review the history of computational scientific discovery that uses methods from artificial intelligence. (Non-cognitive, non-AI computational work is outside the scope of this paper.) The first part of the paper will concentrate on the work since 1990 (Shrager & Langley, Eds.). The extensive reference list provides a guide for further reading. The second half of the paper will compare three methods used in my own work on reasoning strategies in scientific change. Finally, I will point out advantages and disadvantages of the computational approach from my perspective as a philosopher of science.
"The traditional problem of finding an effective method for formulating true hypotheses that best explain phenomena has been transformed into finding heuristic methods that generate plausible explanations. The problem of giving rules for producing true scientific statements has been replaced by the problem of finding efficient heuristic rules for culling the reasonable candidates for an explanation from an appropriate set of possible candidates" [and finding methods for constructing the candidates] (Discovery as heuristic search in a search space enabled AI methods to be applied to discovery tasks.Buchanan 1985, 110-111).
The first expert system, DENDRAL, was a scientific discovery system.
It formed hypotheses about chemical compounds, given mass-spectrographic data
(Lindsay, Buchanan, Feigenbaum, & Lederberg, 1980;1993).
This was followed by Meta-DENDRAL, which discovered new rules in mass spectrographic analysis,
so as to by-pass the problem of getting rules from experts
(Buchanan & Feigenbaum, 1978).
Although its original algorithm was a computational realization of Lederberg's systematic scan strategy
(
A more historical-cognitive approach was the aim of the work on BACON, which rediscovered various scientific laws by finding patterns in numerical data (Langley, Simon, Bradshaw & Zytkow, 1987). Simon's early work on finding patterns in sequences (Simon & Kotovsky, 1963) was extended in BACON to heuristic search for patterns in numerical data. The most creative of BACON's abilities was the decomposition of relational data to conjecture intrinsic properties in one or more of the objects engaging in the relations. This step went beyond curve-fitting and was based on the metaphysical assumption that an entity's relational properties are caused by its intrinsic properties. In addition to the data-driven tasks modeled in BACON, the group also investigated theory-driven discovery in STAHL. One wonders to what extent these programs model actual cognitive processes of historical scientists, as opposed to finding strategies which are sufficient to reproduce the historical results. As with most simulations, they provide "how possibly" accounts. Using studies of notebook evidence, the KEKADA system (Kulkari & Simon, 1988) modeled reasoning patterns in some discoveries of the biochemist Hans Krebs and focused on responses to surprising experimental results, helping to dispel the mystery of serendipity in discovery.
A seminal conference on computational methods for scientific discovery, whose proceedings were published in 1990 (Shrager & Langley, Eds.) is a useful source for the state to the field at that time.
Buchanan (e.g., Lee et al., 1996) continues work on rule induction applied to various scientific databases. Simon is studying the difficult problems of constructing diagrammatic representations (Larkin & Simon, 1987; Qin & Simon, 1995) and of modeling relations between diagrammatic and verbal reasoning (Tabachneck-Schijf, Leonardo, & Simon, 1996). Zytkow continues to work on various aspects of discovery, including analyzing the components needed for an autonomous discovery agent (e.g., Zytkow, 1995/96) and knowledge discovery in databases (e.g., Zytkow & Zembowicz, 1996).
Much of the current work in computational discovery is occurring within applications to particular sciences. According to Peter Karp, the whole field of bioinformatics is doing computational scientific discovery but there is a gradient from computational discoveries that are not based on AI methods, to computational discoveries that are based on AI methods, to methods with a "cognitive flavor." Not much of the bioinformatics work falls into the last category. However, Karp (et al., 1996) applied reasoning by analogy to predict metabolic pathways in the bacterium, H. influenzae,based on the extensive knowledge base that he and Monica Riley, a bacterial geneticist, have developed for E. coli.
Larry Hunter, a frequent editor of publications in AI and molecular biology (e.g., Hunter 1993), recently informed me that there is a clear success is the application of AI technology to molecular biology: hidden Markov models (HMMs) for molecular sequence analysis. They are being applied to automatically build models of families of nucleotide and amino acid sequences. These models are useful as extremely sensitive classifiers of novel sequences, and also generate multiple sequence alignments of large numbers of sequences in a computationally efficient way. Tools based on this approach are now in wide use in the biological community. A review article is Eddy (1996). Also, AI-based qualitative reasoning technologies have produced several good applications in reasoning about metabolism. Perhaps somewhat surprising is that the work in intelligent systems in molecular biology, for the most part, does not employ discovery methods discussed at the Shrager and Langley (Eds. 1990) conference.
The extensive protein sequence database has provided a challenge for those seeking to find computational methods to predict how the linear amino acids will fold into the secondary and tertiary structures in proteins. The Human Genome Project, which is rapidly producing millions of bases of sequence information about both human and model organism genomes, presents a challenge for computational approaches. Good programs are needed for discovering genes, both coding regions and regulatory regions, in these linear sequences. Current programs are not good at finding introns, intervening sequences between the coding regions of genes. Since the genetic system has some means of detecting introns, one can expect computational systems to be able to discover the signal(s). Knowledge discovery in scientific databases (e.g., Fayyad, Haussler & Stolorz, 1996) promises to be an important area in coming years.
Raul Valdes-Perez's (1994) work in chemistry shows the power of computational systems in doing a systematic search of a hypothesis space, given certain constraints. MECHEM is able to find reaction pathways that chemists have missed.
Buchanan's work on rule discovery in scientific databases and Valdes-Perez's work on systematically conjecturing chemical reaction pathways illustrate the power of design AI systems that aim, not at realistically modeling human cognitive capacities, but using computational methods to circumvent human limitations. Humans are not good at searching massive databases and manipulating sets of rules with many features to make predictions. Cognitive science research has shown that humans have a tendency to focus too rapidly on one hypothesis before doing a systematic search of a hypothesis space. Discovery programs that are more systematic and more thorough than humans are an aid to scientists.
I visited in Joshua Lederberg's Laboratory for Molecular Genetics and Informatics and participated in episodes of anomaly resolution that exemplified some of the revision strategies I had proposed (Darden & Cook 1994). One difficulty with the live-in-the lab approach is that little may happen while you are there; fortunately, I was able to observe some anomaly resolution strategies in use. Although I have attempted to implement some of the strategies in AI programs in order to demonstrate their efficacy (e.g., Darden & Rada, 1988; Kettler & Darden 1993; Darden, 1997), I have returned to historical-philosophical work, testing whether strategies from the Mendelian case apply to molecular biology (Darden, 1995).
Computational discovery work has advantages and disadvantages. Finding an adequate knowledge representation for a scientific case is difficult. Early work attempted to represent the relations between genes and chromosomes in part-whole hierarchies and to implement reasoning via inheritance and upward propagation of properties (Darden & Rada, 1988). A much more fruitful method for knowledge representation in genetics was the functional representation (Josephsons, Eds., 1994) for genetic processes (Darden 1997). Furthermore, when one is designing a computational system to rediscover a historical hypothesis, one must navigate between designing a system that trivially reproduces exactly what one is seeking versus designing a system that is unable to accomplish the task at all. Analogy systems often suffer these problems: either the analog is represented in such a way that the system easily finds it or there are so many analogs that the task becomes impossible (for attempts to navigate between these problems, see Kettler & Darden, 1993; Holyoak & Thagard, 1995).
An advantage of computational methods is the precision and completeness that is required to build a working system. The philosopher-historian may neglect aspects that the programmer must specify in detail if the system is to run. A computational approach forces one to reexamine aspects that may be otherwise neglected. However, this advantage is purchased at the price of much time and effort to implement even small parts of a historical case. Various aspects of human discovery, such as the use of pictorial models (e.g., the beads on a string model for genes on chromosomes), provide substantial difficulties when designing an implementation. On the plus side, once one has invested the effort in building a running system, then there is the fun of running experiments, doing "what-if" analyses, testing alternative strategies.
The approach in our TRANSGENE system (Darden, Moberg, Thadani & Josephson, 1992; Darden, 1997) was also used by Karp (1990) in his GENSIM and HYPGEN systems and points to a fruitful way to design a computational discovery system. A qualitative simulator of biological (or other) processes is built and used to make predictions. Data is supplied to test the predictions and another component of the system compares the prediction with data, detects anomalies, and uses diagnosis/redesign strategies to localize the fault in the simulator and redesign a module to remove the anomaly. Perhaps this architecture may be of use in building future AI systems or perhaps more traditional simulation models might be coupled with a revision system to do diagnosis/redesign for anomaly resolution and model improvement.
It will be exciting to see what computational scientific discovery produces in the coming years.
www.cs.cmu.edu/~sci-disc
Baker, L. M. & Dunbar, K. (1996). "Constraints on the Experimental Design Process in Real-World Science," in G.W. Cottrell (Ed.), Proceedings of the Eighteenth Annual Conference of the Cognitive Science Society,pp. 154-159. Mahwah, NJ: Lawrence Erlbaum Associates.
Bechtel, W. & Richardson, R.C. (1993). Discovering Complexity: Decomposition and Localization as Strategies in Scientific Research. Princeton, N. J.: Princeton University.
Buchanan, B.G. (1982). "Mechanizing the Search for Explanatory Hypotheses," in P. Asquith & T. Nickles (Eds.), PSA 1982V. 2, pp. 129-146. Philosophy of Science Association, East Lansing, Michigan.
Buchanan, B.G. (1985). "Steps Toward Mechanizing Discovery," in K. Schaffner (Ed.), Logic of Discovery and Diagnosis in Medicine. Berkeley: University of California Press.
Buchanan, B.G. & Feigenbaum, E. A. (1978). "DENDRAL and Meta-DENDRAL: Their Applications Dimension," Artificial Intelligence11:5-24.
Darden, L. (1987). "Viewing the History of Science as Compiled Hindsight," AI Magazine8(2):33-41.
Darden, L. (1991). Theory Change in Science: Strategies from Mendelian Genetics. New York: Oxford University Press.
Darden, L. (1995). "Exemplars, Abstractions, and Anomalies: Representations and Theory Change in Mendelian and Molecular Genetics," in J. G. Lennox & G. Wolters (Eds.), Concepts, Theories, and Rationality in the Biological Sciences. Pittsburgh, PA: University of Pittsburgh Press.
Darden, L. (1997). "Anomaly-Driven Theory Redesign: Computational Philosophy of Science Experiments," in T. W. Bynum & J. Moor (Eds.), Digital Phoenix: How Computers are Changing Philosophy.Oxford: Blackwell.
Darden, L. (Ed.) (1996). PSA 1996, Proceedings of the 1996 Biennial Meeting of the Philosophy of Science Association, Part I: Contributed Papers, Philosophy of Science,Supplement to Volume 63, September 1996.
Darden, L. (Ed.) (1997). PSA 1996, Proceedings of the 1996 Biennial Meeting of the Philosophy of Science Association, Part II: Symposium Papers, Philosophy of Science,Supplement to Volume 64, forthcoming September 1997.
Darden, L. & Rada, R. (1988). "Hypothesis Formation Using Part-Whole Interrelations," in David Helman (Ed.), Analogical Reasoning.Dordrecht: Reidel.
Darden, L., Moberg, D., Thadani, S. & Josephson, J. (1992). A Computational Approach to Scientific Theory Revision: The TRANSGENE Experiments (Tech. Rep. 92-LD-TRANSGENE). Columbus, OH: Laboratory for Artificial Intelligence Research, Ohio State University.
Darden, L. & Cook, M. (1994). "Reasoning Strategies in Molecular Biology: Abstractions, Scans and Anomalies," in D. Hull, M. Forbes, & R. M. Burian (Eds.), PSA 1994,v. 2, pp. 179-191. East Lansing, Michigan: Philosophy of Science Association.
Dunbar, K. (1993). "Concept Discovery in a Scientific Domain," Cognitive Science17:397-434.
Dunbar, K. (1995). "How Scientists Really Reason: Scientific Reasoning in Real-World Laboratories," in R. J. Sternberg & J. E. Davidson (Eds.), The Nature of Insight.Cambridge, MA: MIT Press.
Eddy, S. R. (1996). "Hidden Markov Models," ftp://genome.wustl.edu/pub/eddy/hmm-review.ps.Z
Fayyad, U., Djorgovski, S. G., & Weir, N. (1996). "From Digitized Images to Online Catalogs," AI Magazine17 (2): 51-66.
Fayyad, U., Haussler, D. & Stolorz, Z. (1996). "KDD for Science Data Analysis: Issues and Examples," In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96),pp. 50-56. Menlo Park, CA: AAAI Press.
Holyoak, K. J. and Thagard, P. (1995). Mental Leaps: Analogy in Creative Thought. Cambridge, Mass.: MIT Press.
Hunter, L. (Ed.) (1993). Artificial Intelligence and Molecular Biology. Cambridge, MA: MIT Press.
Josephson, J.R. & Josephson, S.G. (Eds.) (1994). Abductive Inference: Computation, Philosophy, Technology. New York: Cambridge University Press.
Karp, P. (1990). "Hypothesis Formation as Design," in J. Shrager and P. Langley (Eds.), Computational Models of Scientific Discovery and Theory Formation. San Mateo, California: Morgan Kaufmann.
Karp, P., Ouzouni, C. & Paley, S. M. (1996). "HinCyc: A Knowledge Base of the Complete Genome and Metabolic Pathways of H. influenzae," in D.J. States, P. Agarwal, T. Gaasterland, L. Hunter & R. Smith (Eds.), ISMB-96: Proceedings of the Fourth International Conference on Intelligent Systems for Molecular Biology, pp. 116-124. Menlo Park, CA: AAAI Press.
Kettler, B. & Darden, L. (1993). "Protein Sequencing Experiment Planning Using Analogy," in L. Hunter, D. Searls, & J. Shavlik (Eds.), ISMB-93, Proceedings of the First International Conference on Intelligent Systems for Molecular Biology, pp. 216-224. Menlo Park, CA: AAAI Press.
Kleiner, S. A. (1993). The Logic of Discovery: A Theory of the Rationality of Scientific Research. Dordrecht: Kluwer.
Kulkarni, D. and Simon, H.A. (1988). "The Processes of Scientific Discovery: The Strategy of Experimentation," Cognitive Science12:139-175.
Langley, P., Simon, H.A., Bradshaw, G.L. & Zytkow, J.M. (1987). Scientific Discovery: Computational Explorations of the Creative Process. Cambridge, Massachusetts: MIT Press.
Larkin, J. H. & Simon, H. A. (1987). "Why a Diagram is (Sometimes) Worth Ten Thousand Words," Cognitive Science11:65-99.
Lederberg, J. (1965). "Signs of Life: Criterion-System of Exobiology," Nature207:9-13.
Lee, Y., Buchanan, B. G., Klopman, G., Dimayuga, M. & Rosenkranz, H. S. (1996). "The Potential of Organ Specific Toxicity for Predicting Rodent Carcinogenicity," Mutation Research358:37-62.
Lindsay, R. K., Buchanan, B. G., Feigenbaum, E. A. & Lederberg, J. (1980). Applications of Artificial Intelligence for Organic Chemistry: The DENDRAL Project. New York: McGraw Hill.
Lindsay, R. K., Buchanan, B. G., Feigenbaum, E. A. & Lederberg, J. (1993). "DENDRAL: A Case Study of the First Expert System for Scientific Hypothesis Formation," Artificial Intelligence61:209-261.
Nersessian, N. J. (1992). "How Do Scientists Think? Capturing the Dynamics of Conceptual Change in Science," in R. N. Giere (Ed.), Cognitive Models of Science. Minneapolis: University of Minnesota Press.
Nickles, T. (Ed.) (1980). Scientific Discovery, Logic and Rationality. Dordrecht: Reidel.
Nickles, T. (1994). "Enlightenment Versus Romantic Models of Creativity in Science--and Beyond," Creativity Research Journal7:277-314.
Popper, Karl (1965). The Logic of Scientific Discovery.New York: Harper Torchbooks.
Qin, Y. & Simon, H.A. (1995). "Imagery and Mental Models," in J. Glasgow, H. H. Narayanan, & B. Chandrasekaran (Eds.), Diagrammatic Reasoning: Computational and Cognitive Perspectives. Menlo Park, CA: AAAI/MIT Press.
Schaffner, K. (1993). Discovery and Explanation in Biology and Medicine. Chicago: University of Chicago Press.
Schunn, C. D. & Dunbar, K. (1996). "Priming, Analogy, and Awareness in Complex Reasoning," Memory & Cognition24:271-284.
Shrager, J. & Langley, P. (Eds.) (1990). Computational Models of Scientific Discovery and Theory Formation. San Mateo, California: Morgan Kaufmann.
Simon, H.A. (1977). Models of Discovery.Dordrecht: Reidel.
Simon, H.A. and Kotovsky, K. (1963). "Human Acquisition of Concepts for Sequential Patterns," Psychological Review70:534-546.
Simon, H.A., Valdes-Perez, R.E. & Sleeman, D.H. (1997, forthcoming). Editorial: "Scientific Discovery and Simplicity of Method," Artificial Intelligence91(2).
Spirtes, P., Glymour, C. & Scheines, R. (1993). Causation, Prediction, and Search.New York: Springer Verlag.
Tabachneck-Schijf, H. J. M., Leonardo, A. M., Simon, H.A. (1996). CaMeRa: A Computational Model of Multiple Presentations(Tech. Rep. CIP 530). Pittsburgh, PA: Carnegie Mellon University.
Valdes-Perez, R.E. (1994). "Algorithm to Infer the Structures of Molecular Formulas within a Reaction Pathway," Journal of Computational Chemistry15:1266-1277.
Valdes-Perez, R.E. (1995). "Machine Discovery Praxis," Foundations of Science1:219-224.
Zytkow, J.M. (1995/96). "Creating a Discoverer: Autonomous Knowledge Seeking Agent," Foundations of Science2:253-283.
Zytkow, J.M. (1996). "Automated Discovery of Empirical Laws," Fundamenta Informaticae27:299-318.
Zytkow, J.M. (Ed.) (1996). Machine Discovery. Kluwer Academic Publishers: Dordrecht.
Zytkow, J.M. & Zembowicz, R. (1996). "Automated Pattern Mining with a Scale Dimension," in Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 889-894. Menlo Park, CA: AAAI Press.