KDD 2005 - Research Track Accepted Papers: Aug 21-24, Chicago, IL. USA

Instructions for Authors

POSTER SPECIFICATIONS The poster boards available at the conference will be: 4 ft x 8 ft. Each poster board will hold TWO actual posters (i.e., papers). Each poster will have an area of 3.75 ft x 3 ft available for its presentation. You are free to use a single large sheet of paper or a set of slides, as long as you restrict yourself to the above area in the board. Poster space will be pre-assigned (you will know about your location on-site) and the space will be available for setup a few hours before the poster session.

In addition, each poster will be presented for 2 minutes in the poster session on Monday August 22nd from 3:30-5:45. You will have 2 minutes to present your poster slides in a moderated format. You will receive instructions via email on where to upload your slides so that the moderator will have them ready for you in the Poster Session.

Any questions about style, formatting or content should go to:

Any technical difficulties should be reported to: Michal Sabala,

Equipment Note
Each room for workshops, tutorials, research track and industrial track, will be equipped with microphones, speakers and a data projector. Presenters will be able to test their laptops with the projectors 20 minutes before their scheduled session.

Research Track Accepted Papers

Full papers:

#107 Improving Discriminative Sequential Learning with Rare-but-Important Associations
Authors: Phan Xuan-Hieu, Nguyen Le-Minh, Ho Tu-Bao, Horiguchi Susumu
#113 Reasoning about Sets using Redescription Mining
Authors: Mohammed Zaki, Naren Ramakrishnan
#164 Robust Boosting and its Relation to Bagging
Authors: Saharon Rosset
#172 SVM Selective Sampling for Ranking with Application to Data Retrieval
Authors: Hwanjo Yu
#189 Fast Discovery of Unexpected Patterns in Data Relative to a Bayesian Network
Authors: Szymon Jaroszewicz, Tobias Scheffer
#211 Probabilistic Workflow Mining
Authors: Ricardo Silva, Jiji Zhang, James G. Shanahan
#216 Mining Images on Semantics via Statistical Learning
Authors: Jianping Fan Fan, Mohand-Said Hacid
#234 Streaming Feature Selection using alpha investing
Authors: Jing Zhou, Dean Foster, Robert Stine, Lyle Ungar
#242 On the Use of Linear Programming for Unsupervised Text Classification
Authors: Mark Sandler
#248 Consistent Bipartite Graph Co-Partitioning for Star-Structured High-Order Heterogeneous Data Co-Clustering
Authors: Bin GAO, Tie-Yan LIU, Xin Zheng, Qian-sheng Chen, Wei-Ying Ma
#261 Detection of Emerging Space-Time Clusters
Authors: Daniel Neill, Andrew Moore, Maheshkumar Sabhnani, Kenny Daniel
#283 Mining tree queries in a graph
Authors: Bart Goethals, Eveline Hoekx, Jan Van den Bussche
#287 Rule extraction from Hyperplane-based Classifiers
Authors: Glenn Fung, Sathyakama Sandilya, Bharat Rao
#291 Graphs over time: densification laws, shrinking diameters and possible explanations
Authors: Jure Leskovec, Jon Kleinberg, Christos Faloutsos
#292 Sampling-Based Sequential Subgroup Mining
Authors: Martin Scholz
#309 Nomograms for Visualizing Support Vector Machines
Authors: Aleks Jakulin, Martin Mozina, Janez Demsar, Ivan Bratko, Blaz Zupan
#330 Cross-Relational Clustering with User's Guidance
Authors: Xiaoxin Yin, Jiawei Han, Philip Yu
#333 A General Model for Clustering Binary Data
Authors: Tao Li
#339 A Bayesian Network Classifier with Inverse Tree Structure for Voxelwise Magnetic Resonance Image Analysis
Authors: Rong Chen, Edward Herskovits
#365 A New Scheme on Privacy-Preserving Data Classification
Authors: Nan Zhang, Shengquan Wang, Wei Zhao
#372 Web Object Indexing Using Domain Knowledge
Authors: Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiyao Zhang
#373 Query Chains: Learning to Rank From Implicit Feedback
Authors: Filip Radlinski, Thorsten Joachims
#391 Anonymity-Preserving Data Collection
Authors: zhiqiang yang, Sheng Zhong, Rebecca N. Wright
#392 The Predictive Power of Online Chatter
Authors: Daniel Gruhl, R. Guha, Ravi Kumar, Jasmine Novak, Andrew Tomkins
#397 On Mining Cross-Graph Quasi-Cliques
Authors: Jian Pei, Daxin Jiang, Aidong Zhang
#410 Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining
Authors: Qiaozhu Mei, ChengXiang Zhai
#411 Finding partial orders from unordered 0-1 data
Authors: Antti Ukkonen, Mikael Fortelius, Heikki Mannila
#415 Simple and Effective Visual Models for Gene Expression Cancer Diagnostics
Authors: Gregor Leban, Minca Mramor, Ivan Bratko, Blaz Zupan
#448 Dimension Induced Clustering
Authors: Aris Gionis, Alexander Hinneburg, Spiros Papadimitriou, Panayiotis Tsaparas
#479 Local sparsity control for Naive Bayes with extreme misclassification costs
Authors: Aleksander Kolcz
#484 Summarizing Itemset Patterns: A Profile-Based Approach
Authors: Xifeng Yan, Hong Cheng, Dong Xin, Jiawei Han
#513 Mining Closed Relational Graphs with Connectivity Constraints
Authors: Xifeng Yan, X. Jasmine Zhou, Jiawei Han
#537 A Multiple Tree Algorithm for the Efficient Association of Asteroid Observations
Authors: Jeremy Kubica, Andrew Moore, Andrew Connolly, Robert Jedicke
#541 A distributed learning framework based on probabilistic models
Authors: Srujana Merugu, Joydeep Ghosh
#565 Variable Latent Semantic Indexing
Authors: Anirban Dasgupta, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
#594 Feature Bagging for Outlier Detection
Authors: Aleksandar Lazarevic, Vipin Kumar
#599 Wavelet Synopsis for Data Streams: Minimizing Non-Euclidean Error
Authors: Sudipto Guha, Boulos Harb
#622 Non-Redundant Clustering with Conditional Ensembles
Authors: David Gondek, Thomas Hofmann
#636 Combining Partitions by Probabilistic Label Aggregation
Authors: Tilman Lange, Joachim Buhmann
#661 Combining Email Models for False Positive Reduction
Authors: Shlomo Hershkop, Salvatore Stolfo

Short (poster) papers:

#112 CLICKS: An Effective Algorithm for Mining Subspace Clusters in Categorical Datasets
Authors: Mohammed Zaki, Markus Peters, Ira Assent, Thomas Seidl
#138 Web Mining from Competitors Websites
Authors: Xin Chen, Yi-fang Wu
#141 Evaluating Similarity Measures: A Large Scale Study in the Orkut Social Network
Authors: Ellen Spertus, Mehran Sahami, Orkut Buyukkokten
#152 Key semantics extraction by dependency tree mining
Authors: Satoshi Morinaga, Hiroki Arimura, Takahiro Ikeda, Yosuke Sakao, Susumu Akamine
#154 Regression Error Characteristic Surfaces
Authors: Luis Torgo
#219 Privacy-Preserving Distributed k-means Clustering over Arbitrarily Partitioned Data
Authors: Geetha Jagannathan, Rebecca N. Wright
#241 LIPED: HMM-based Life Profiles for Adaptive Event Detection
Authors: Chien Chin Chen, Meng Chang Chen, Ming-Syan Chen
#254 Estimating missed actual positives using independent classifiers
Authors: Sandeep Mane, Jaideep Srivasta, San-Yih Hwang
#274 A Hybrid Unsupervised Approach for Document Clustering
Authors: Mihai Surdeanu, Jordi Turmo, Alicia Ageno
#276 Mining in Anticipation: Proactive-Reactive Prediction for Data Streams
Authors: Ying Yang, Xindong Wu, Xingquan Zhu
#289 Optimizing time series discretization for knowledge discovery
Authors: Fabian M�rchen, Alfred Ultsch
#304 A Generalized Framework For Mining Spatio-temporal Patterns in Scientific Data
Authors: hui yang, sameep mehta, Srinivasan Parthasarathy
#315 Density-Based Clustering of Uncertain Data
Authors: Martin Pfeifle, Hans-Peter Kriegel
#320 Information Retrieval Based on Collaborative Filtering With Latent Interest Semantic Map
Authors: Noriaki Kawamae
#340 Parallel Mining of Closed Sequential Patterns
Authors: Shengnan Cong, Jiawei Han, David Padua
#342 Determining an Author's Native Language by Mining a Text for Errors
Authors: Moshe Koppel, Jonathan Schler, Kfir Zigdon
#343 Pattern Lattice Traversal by Selective Jumps
Authors: Osmar Zaiane, Mohammad El Hajj
#354 Adversarial Learning
Authors: Daniel Lowd, Chris Meek
#368 Co-clustering by Block Value Decomposition
Authors: Bo Long, Zhongfei Zhang, Philip Yu
#377 Application of kernels to link analysis
Authors: Takahiko Ito, Masashi Shimbo, Taku Kudo, Yuji Matsumoto
#380 Model-based Overlapping Clustering
Authors: Arindam Banerjee, Chase Krumpelman, Sugato Basu, Raymond Mooney, Joydeep Ghosh
#385 Building Connected Neighborhood Graphs for Isometric Data Embedding
Authors: Li Yang
#399 Integration of Profile Hidden Markov Model Output into Association Rule Mining
Authors: Christopher Besemann, Anne Denton
#403 Towards Exploratory Test Instance Specific Algorithms for High Dimensional Classification
Authors: Charu Aggarwal
#440 Simultaneous Optimization of Complex Mining Tasks with a Knowledgeable Cache
Authors: Ruoming Jin, Kaushik Sinha, Gagan Agrawal
#441 Disovering Frequent Topological Structures from Graph Datasets
Authors: Ruoming Jin, Chao Wang, Dmitrii Polshakov, Srinivasan Parthasarathy, Gagan Agrawal
#446 Efficient Computations via Scalable Sparse Kernel Partial Least Squares and Boosted Latent Features
Authors: Michinari Momma
#470 Scalable Discovery of hidden Emails from Large Folders
Authors: giuseppe carenini, Raymond Ng, Xiaodong Zhou
#498 Formulating Distance Functions via the Kernel Trick
Authors: Gang Wu, Navneet Panda, Edward Chang
#518 Fast Window Correlations Over Uncooperative Time Series
Authors: Xiaojian Zhao, Dennis Shasha, Richard Cole
#522 A Maximum Entropy Web Recommendation System: Combining Collaborative and Content Features
Authors: Xin Jin, Yanzan Zhou, Bamshad Mobasher
#530 Mining Comparable Bilingual Text Corpora for Cross-Language Information Integration
Authors: Tao Tao, ChengXiang Zhai
#531 Creating social networks to improve peer-to-peer networking
Authors: Andrew Fast, David Jensen, Brian Neil Levine
#533 A Fast Kernel-based Multilevel Algorithm for Graph Clustering
Authors: Brian Kulis, Yuqiang Guan, Inderjit Dhillon
#564 Unweaving a Web of Documents
Authors: R. Guha, Ravi Kumar, D. Sivakumar, Ravi Sundaram
#568 Maximal Boasting
Authors: Cinda Heeren, Leonard Pitt

Webmaster: Michal Sabala