Dan Gusfield

Dan Gusfield
Born
Daniel Mier Gusfield
Alma materUniversity of California, Berkeley (BS, PhD)
University of California, Los Angeles (MS)
Known forStable marriage problem
Awards
Scientific career
FieldsComputer science
Computational biology[1]
InstitutionsUniversity of California, Davis
Yale University
ThesisSensitivity analysis for combinatorial optimization (1980)
Doctoral advisorRichard Karp[2][3]
Websiteweb.cs.ucdavis.edu/~gusfield

Daniel Mier Gusfield is an American computer scientist, Distinguished Professor of Computer Science at the University of California, Davis. Gusfield is known for his research in combinatorial optimization and computational biology.[1]

Education

Gusfield received his undergraduate degree in computer science at the University of California, Berkeley, in 1973,[citation needed] his Master of Science degree in computer science from the University of California, Los Angeles (UCLA), in 1975,[citation needed] and his Ph.D. in engineering science from Berkeley in 1980;[3] his doctoral advisor was Richard Karp.[2]

Career and research

Gusfield joined the faculty at Yale University in Computer Science in 1980, and left in 1986 to join the Department of Computer Science at UC Davis as an associate professor. Gusfield was made Professor of Computer Science in 1992 and served as the chair of the Department of Computer Science at UC Davis from 2000 to 2004. Gusfield was named distinguished professor in 2016, which is the highest campus-wide rank at the University of California at Davis.[4]

Gusfield's early work was in combinatorial optimization and its real-world application. One of his early major results was in network flow, where he presented a simple technique to convert any network flow algorithm to one that builds a Gomory-Hu tree, using only five added lines of pseudo-code.[5] Another contribution was in stable matching, where he contributed to a polynomial-time algorithm[6] for the Egalitarian Stable Marriage Problem, proposed by Donald Knuth. Gusfield's work on stable marriage resulted in the book, co-authored with Robert Irving, The Stable Marriage Problem: Structure and Algorithms.[7]

Starting in 1984, Gusfield branched out into computational biology, making Gusfield one of the first computer scientists to work in this field. His first result in computational biology was written in the Yale Technical Report The Steiner-Tree Problem in Phylogeny, which has never been published in a journal. His first published paper in computational biology, "Efficient Algorithms for Inferring Evolutionary History", was initially published as a technical report in 1988,[8] and was subsequently published in the journal Networks;[9] this paper is now the most cited of Gusfield's papers. Gusfield's 1993 paper on multiple sequence alignment[10] is the first publication indexed in PubMed under "computational biology".

Gusfield's impact on the early days of Computer Science research in algorithmic computational biology is substantial. He was a member of the United States Department of Energy Human Genome Research Program Panel in 1991, and a member of the steering committee for the Rutgers-Princeton DIMACS center special year on Mathematical Support for Molecular Biology from 1994 to 1995. In 1995, he co-organized the Dagstuhl Conference on Molecular Bioinformatics. He has been a member of the editorial board of the Journal of Computational Biology since its inception in 1996. At the University of California at Davis, he was part of a three-person group that proposed the development of the UC Davis Genomics Center, and served as a member of the Genomics Center Steering Committee (1999–2003), and helped to build an interdisciplinary community of biologists and computer scientists working together on genomics problems. Finally, in 2004, Gusfield helped propose the IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), one of the few journals specifically oriented towards computer science and mathematical researchers working in computational biology. He served as its founding editor in chief until 2009,[11] and later as chair of the TCBB Steering Committee. He was more recently an invited visiting scientist at the Simons Institute for the Theory of Computing at UC Berkeley during two of its semester-long programs (first on Evolution, and later on Algorithmic Challenges in Genomics). In addition, Gusfield has been the PhD advisor or postdoctoral mentor for many well known computer scientists working in computational biology, including Prof. Oliver Eulenstein (Iowa State University),[citation needed] Dr. Paul Horton (Tokyo),[citation needed] Prof. Ming-Yang Kao (Northwestern University),[citation needed] Prof. John Kececioglu (Arizona),[citation needed] Prof. Yun S. Song (UC Berkeley and Univ. of Pennsylvania),[citation needed] Prof. R. Ravi (CMU), Prof. Jens Stoye (Bielefeld), Prof. Lusheng Wang (City University of Hong Kong)[citation needed], and Prof. Yufeng Wu (U. Connecticut).[citation needed]

Gusfield has made significant contributions to molecular sequence comparison and analysis,[12] phylogenetic tree and phylogenetic network inference,[13] haplotyping in DNA sequences,[14][15][16] the multi-state perfect phylogeny problem using chordal graph theory,[17] and fast algorithms for RNA folding.[18] Since 2014 he has focused on the application and development of integer linear programming in computational biology.

Gusfield is most well known for his book Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology,[19] which provides a comprehensive presentation of the algorithmic foundations of molecular sequence analysis for computer scientists, and has been cited more than 8000 times.[1] This book has helped to define and develop the intersection of computer science and computational biology. His second book in computational biology is on phylogenetic networks,[20] which are graph-theoretic models of evolution that go beyond the classical tree model, to address biological processes such as hybridization, recombination, and horizontal gene transfer.

His third book on computational biology was published in 2019. Integer Linear Programming in Computational and Systems Biology: An Entry-Level Text and Course (Cambridge University Press, 2019. ISBN 9781108421768) explains why and how Integer Linear Programming is a valuable technique for addressing and solving computational problems in biology. It is accompanied by over fifty computer programs that generate the needed inequalities for most of the topics discussed in the book. Subsequently, Gusfield and students explored the use of Satisfiability-solvers to efficiently solve biological problems where integer programming was not effective.

His fifth book[clarification needed] will be published by Cambridge Press in January 2024. It is entitled Proven Impossible: Elementary Proofs of Profound Impossibility from Arrow, Bell, Chaitin, Gôdel, Turing and more. It presents full, rigorous proofs of deep theorems establishing impossibility in a range of topic areas (in physics, economics, data science, computer science, mathematics, logic) using only arithmetic and simple logic. The presented proofs are built on the simplest, clearest proofs found in the literature, of theorems which originally were considered very difficult and for specialists only. The premise of the book is that more modern proofs of these theorems are much simpler and easier, and when presented for non-specialists, can be understood by anyone with no more than a junior-high education and with the discipline to follow a rigorous logical argument (pen in hand).[citation needed]

Awards and honors

Gusfield was named Fellow of the Institute of Electrical and Electronics Engineers (IEEE) in 2015[21] for contributions to combinatorial optimization and computational biology. In 2016, Gusfield was elected a Fellow of the International Society for Computational Biology (ISCB)[22] for "his notable contributions to computational biology, particularly his algorithmic work on building evolutionary trees, molecular sequence analysis, optimization problems in population genetics, RNA folding, and integer programming in biology." In 2016, Gusfield was named a distinguished professor at the University of California at Davis, which is the highest campus-wide rank. He was elected an ACM Fellow in 2017.[23]

References

  1. ^ a b c Dan Gusfield publications indexed by Google Scholar Edit this at Wikidata
  2. ^ a b Dan Gusfield at the Mathematics Genealogy Project Edit this at Wikidata
  3. ^ a b Gusfield, Daniel Mier (1980). Sensitivity analysis for combinatorial optimization (PhD thesis). University of California, Berkeley. OCLC 40134251.
  4. ^ "Dan Gusfield". web.cs.ucdavis.edu. Archived from the original on 17 June 2017. Retrieved 23 January 2019.
  5. ^ Gusfield. Very Simple Methods for All Pairs Network Flow Analysis. SIAM J. Comput. 1990
  6. ^ R.W. Irving, P. Leather, and D. Gusfield, "An efficient algorithm for the "optimal" stable marriage", Journal of the ACM, Vol. 34 Issue 3, July 1987, Pages 532-543
  7. ^ Gusfield, Dan; Irving, Robert (1999). The stable marriage problem: structure and algorithms. MIT Press. ISBN 0-262-07118-5.
  8. ^ "Computer Science- UC Davis". Cs.ucdavis.edu. 4 October 2018. Retrieved 23 January 2019.
  9. ^ D. Gusfield, "Efficient algorithms for inferring evolutionary trees", Networks 1991 doi:10.1002/net.3230210104
  10. ^ D. Gusfield, "Efficient Methods for Multiple Sequence Alignment with Guaranteed Error Bounds", Bulletin on Mathematical Biology, Vol. 55, No. 1, 141-154, 1993
  11. ^ Dan Gusfield. "Introduction to the IEEE/ACM Transactions on Computational Biology and Bioinformatics" (PDF). Computer.org. Archived from the original (PDF) on 3 April 2015. Retrieved 23 January 2019.
  12. ^ Gusfield and J. Stoye. "Linear time algorithms for finding and representing all the tandem repeats in a string", JCSS, 2004
  13. ^ Gusfield, D., Eddhu, S. and Langley, C., 2004. "Optimal, efficient reconstruction of phylogenetic networks with constrained recombination". Journal of bioinformatics and computational biology, 2(01), pp.173-213.
  14. ^ Gusfield. "Haploytyping as Perfect Phylogeny: Conceptual framework and efficient solutions." Proceedings of RECOMB 2002.
  15. ^ Gusfield, D. (2003). "Haplotype inference by pure parsimony." In Combinatorial Pattern Matching (pp. 144-155). Springer Berlin/Heidelberg.
  16. ^ D. Gusfield, "Inference of haplotypes from samples of diploid populations: complexity and algorithms." Journal of computational biology 8, no. 3 (2001): 305-323.
  17. ^ Gusfield. "The multi-state perfect phylogeny problem with missing and removable data: solutions via integer linear programming and chordal graph theory." Journal of Computational Biology, 2010.
  18. ^ Y. Frid and Gusfield. "A simple, practical, and complete -time algorithm for RNA folding using the Four-Russians speedup". Algorithms for Molecular Biology, 2010
  19. ^ Gusfield, Dan (1999). Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press. doi:10.1017/CBO9780511574931. ISBN 0-521-58519-8. S2CID 61800864.
  20. ^ Gusfield, Dan (2014). ReCombinatorics: The Algorithmics of Ancestral Recombination Graphs and Explicit Phylogenetic Networks. MIT Press. ISBN 9780262027526.
  21. ^ "2015 elevated fellow" (PDF). IEEE Fellows Directory. Archived from the original (PDF) on March 30, 2015.
  22. ^ "ISCB Fellows". Iscb.org. Retrieved 23 January 2019.
  23. ^ ACM Recognizes 2017 Fellows for Making Transformative Contributions and Advancing Technology in the Digital Age, Association for Computing Machinery, December 11, 2017, retrieved 2017-11-13