Picture of William Cohen

William W. Cohen

Associate Research Professor, Machine Learning Department, Carnegie Mellon University

Member of the Language Technology Institute; the joint CMU-Pitt Program in Computational Biology; the Lane Center for Computational Biology, and the Center for Bioimage Informatics.

[ Bio | Teaching | Projects | Publications (recent, all) | Software | Datasets | Talks | Colleagues | Blog | Contact Info | Other Stuff ]

Announcements

From June 2008 through June 2009, I am on sabbatical at Google. Expect delays in email response time to my CMU addresses.

Biography

William Cohen received his bachelor's degree in Computer Science from Duke University in 1984, and a PhD in Computer Science from Rutgers University in 1990. From 1990 to 2000 Dr. Cohen worked at AT&T Bell Labs and later AT&T Labs-Research, and from April 2000 to May 2002 Dr. Cohen worked at Whizbang Labs, a company specializing in extracting information from the web. Dr. Cohen is member of the board of the International Machine Learning Society, is an Associate Editor for the journal Artificial Intelligence, and has served as an action editor for the Journal of Machine Learning Research, the journal Machine Learning and the Journal of Artificial Intelligence Research. He was Program Co-Chair of the 2006 International Machine Learning Conference and Co-Chair of the 1994 International Machine Learning Conference, and has served on more than 20 program committees or advisory committees.

Dr. Cohen is also General Chair for the 2008 International Machine Learning Conference, which will be held July 6-9 at the University of Helsinki, in Finland.

Dr. Cohen's research interests include information integration and machine learning, particularly information extraction, text categorization and learning from large datasets. He holds seven patents related to learning, discovery, information retrieval, and data integration, and is the author of more than 100 publications.

Projects

I'm currently involved with:

Software and demos

Demos: Software:

Datasets

The following datasets are available for anyone to use for research purposes:

Recent talks and presentations

Teaching

Publications

Recent papers I'm keeping in HTML or PDF (which requires Adobe Acrobat Reader to view). Older papers are mostly in Postscript. For Windows, I use the GSView reader for postscript. Most of these papers are viewable in several formats in ResearchIndex.

Students and other colleagues

Contact Info

William Cohen
Associate Research Professor
Machine Learning Department
Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213
Wean Hall 5317 / 412-268-7664 (voice) / 412-268-2005 (fax)
Assistant: Sharon Cavlovich, sharonw+@cs.cmu.edu, 412-268-5196

Official CMU Contact Info

My preferred email address is: wcohen AT cs DOT cmu DOT edu

Other Stuff

Obscure fact: two of my papers made the Citeseer's list of most-cited machine learning papers.

For those many friends whose research I have built on, be warned. My full name, "William Weston Cohen", is an anagram of the phrase "I now cite shallow men". (From Sara Cohen - no relation! - comes this warning: "Women's rights activists would probably request you to use the following anagram instead: 'I shall now cite women'".)

I am often praised for my highly artistic and functional web site designs. An example is the site for SC Indexing, a professional book indexer. However, I accept few clients - this one happens to be my wife.

Through my advisor, Alex Borgida, I can trace my "academic lineage" back to luminaries like Leibniz and Alfred Whitehead.

Poetry anyone?