|
|
KDD-2000 Sixth ACM SIGKDD International Conference on Weaving the Semantic
Web:� Data Mining for Hypertext
Soumen Chakrabarti
Abstract: With over 800 million pages covering most areas of human endeavor, the World-wide Web is a fertile ground for data mining research to make a difference to the effectiveness of information search.� Today, Web surfers access the Web through two dominant interfaces: clicking on hyperlinks and searching via keyword queries.� This process is often tentative and unsatisfactory.� Better support is needed for expressing one's information need and dealing with a search result in more structured ways than available now.� Data mining and machine learning have significant roles to play towards this end. In this tutorial we will survey recent advances in
learning and mining problems related to hypertext in general and the Web in
particular. We will review the continuum of supervised to semi-supervised to
unsupervised learning problems, highlight the specific challenges which
distinguish data mining in the hypertext domain from data mining in the
context of data warehouses, and summarize the key areas of recent and ongoing
research. Biography
of Organizer: Soumen Chakrabarti
received his B.Tech in Computer Science from the Indian Institute of
Technology, Kharagpur, in 1991 and his M.S. and Ph.D. in Computer Science
from the University of California, Berkeley in 1992 and 1996.� At Berkeley he worked on compilers and
runtime systems for running scalable parallel scientific software on message
passing multiprocessors.� He was a
Research Staff Member at IBM Almaden Research Center between 1996 and 1999.
He is currently an Assistant Professor in the Department of Computer Science
and Engineering at the Indian Institute of Technology, Bombay.� His current research
interests include hypertext information retrieval, web analysis and data
mining.� He designed the Focused Crawler and part
of the Clever search engine, filing
several patents in the process. His work on focused crawling got the Best Paper award
at the� 8th International World Wide
Web Conference in 1999.� He has served
on the program committees for KDD 1998, KDD 1999, and WWW 2000. |
|