Association for Computing Machinery
ACM Special Interest Group on Knowledge Discovery & Data Mining

 

 

KDD-2000

Sixth ACM SIGKDD International Conference on
Knowledge Discovery & Data Mining

August 20-23, 2000
Boston, MA, USA

Weaving the Semantic Web:Data Mining for Hypertext

Soumen Chakrabarti

 

Abstract:

With over 800 million pages covering most areas of human endeavor, the World-wide Web is a fertile ground for data mining research to make a difference to the effectiveness of information search.Today, Web surfers access the Web through two dominant interfaces: clicking on hyperlinks and searching via keyword queries.This process is often tentative and unsatisfactory.Better support is needed for expressing one's information need and dealing with a search result in more structured ways than available now.Data mining and machine learning have significant roles to play towards this end.

 

In this tutorial we will survey recent advances in learning and mining problems related to hypertext in general and the Web in particular. We will review the continuum of supervised to semi-supervised to unsupervised learning problems, highlight the specific challenges which distinguish data mining in the hypertext domain from data mining in the context of data warehouses, and summarize the key areas of recent and ongoing research.

 

Biography of Organizer:

Soumen Chakrabarti received his B.Tech in Computer Science from the Indian Institute of Technology, Kharagpur, in 1991 and his M.S. and Ph.D. in Computer Science from the University of California, Berkeley in 1992 and 1996.At Berkeley he worked on compilers and runtime systems for running scalable parallel scientific software on message passing multiprocessors.He was a Research Staff Member at IBM Almaden Research Center between 1996 and 1999. He is currently an Assistant Professor in the Department of Computer Science and Engineering at the Indian Institute of Technology, Bombay.

 

His current research interests include hypertext information retrieval, web analysis and data mining.He designed the Focused Crawler and part of the Clever

search engine, filing several patents in the process.

His work on focused crawling got the Best Paper award at the8th International World Wide Web Conference in 1999.He has served on the program committees for KDD 1998, KDD 1999, and WWW 2000.

KDD-2000 Home