Instructions for Authors

POSTER SPECIFICATIONS The poster boards available at the conference will be: 4 ft x 8 ft. Each poster board will hold TWO actual posters (i.e., papers). Each poster will have an area of 3.75 ft x 3 ft available for its presentation. You are free to use a single large sheet of paper or a set of slides, as long as you restrict yourself to the above area in the board. Poster space will be pre-assigned (you will know about your location on-site) and the space will be available for setup a few hours before the poster session.

In addition, each poster will be presented for 2 minutes in the poster session on Monday August 22nd from 3:30-5:45. You will have 2 minutes to present your poster slides in a moderated format. You will receive instructions via email on where to upload your slides so that the moderator will have them ready for you in the Poster Session.

Any questions about style, formatting or content should go to:

Any technical difficulties should be reported to: Michal Sabala,

Equipment Note
Each room for workshops, tutorials, research track and industrial track, will be equipped with microphones, speakers and a data projector. Presenters will be able to test their laptops with the projectors 20 minutes before their scheduled session.

Research Track Accepted Papers

Full papers:

  1. #107 Improving Discriminative Sequential Learning with Rare-but-Important Associations
    Authors: Phan Xuan-Hieu, Nguyen Le-Minh, Ho Tu-Bao, Horiguchi Susumu
  2. #113 Reasoning about Sets using Redescription Mining
    Authors: Mohammed Zaki, Naren Ramakrishnan
  3. #164 Robust Boosting and its Relation to Bagging
    Authors: Saharon Rosset
  4. #172 SVM Selective Sampling for Ranking with Application to Data Retrieval
    Authors: Hwanjo Yu
  5. #189 Fast Discovery of Unexpected Patterns in Data Relative to a Bayesian Network
    Authors: Szymon Jaroszewicz, Tobias Scheffer
  6. #211 Probabilistic Workflow Mining
    Authors: Ricardo Silva, Jiji Zhang, James G. Shanahan
  7. #216 Mining Images on Semantics via Statistical Learning
    Authors: Jianping Fan Fan, Mohand-Said Hacid
  8. #234 Streaming Feature Selection using alpha investing
    Authors: Jing Zhou, Dean Foster, Robert Stine, Lyle Ungar
  9. #242 On the Use of Linear Programming for Unsupervised Text Classification
    Authors: Mark Sandler
  10. #248 Consistent Bipartite Graph Co-Partitioning for Star-Structured High-Order Heterogeneous Data Co-Clustering
    Authors: Bin GAO, Tie-Yan LIU, Xin Zheng, Qian-sheng Chen, Wei-Ying Ma
  11. #261 Detection of Emerging Space-Time Clusters
    Authors: Daniel Neill, Andrew Moore, Maheshkumar Sabhnani, Kenny Daniel
  12. #283 Mining tree queries in a graph
    Authors: Bart Goethals, Eveline Hoekx, Jan Van den Bussche
  13. #287 Rule extraction from Hyperplane-based Classifiers
    Authors: Glenn Fung, Sathyakama Sandilya, Bharat Rao
  14. #291 Graphs over time: densification laws, shrinking diameters and possible explanations
    Authors: Jure Leskovec, Jon Kleinberg, Christos Faloutsos
  15. #292 Sampling-Based Sequential Subgroup Mining
    Authors: Martin Scholz
  16. #309 Nomograms for Visualizing Support Vector Machines
    Authors: Aleks Jakulin, Martin Mozina, Janez Demsar, Ivan Bratko, Blaz Zupan
  17. #330 Cross-Relational Clustering with User's Guidance
    Authors: Xiaoxin Yin, Jiawei Han, Philip Yu
  18. #333 A General Model for Clustering Binary Data
    Authors: Tao Li
  19. #339 A Bayesian Network Classifier with Inverse Tree Structure for Voxelwise Magnetic Resonance Image Analysis
    Authors: Rong Chen, Edward Herskovits
  20. #365 A New Scheme on Privacy-Preserving Data Classification
    Authors: Nan Zhang, Shengquan Wang, Wei Zhao
  21. #372 Web Object Indexing Using Domain Knowledge
    Authors: Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiyao Zhang
  22. #373 Query Chains: Learning to Rank From Implicit Feedback
    Authors: Filip Radlinski, Thorsten Joachims
  23. #391 Anonymity-Preserving Data Collection
    Authors: zhiqiang yang, Sheng Zhong, Rebecca N. Wright
  24. #392 The Predictive Power of Online Chatter
    Authors: Daniel Gruhl, R. Guha, Ravi Kumar, Jasmine Novak, Andrew Tomkins
  25. #397 On Mining Cross-Graph Quasi-Cliques
    Authors: Jian Pei, Daxin Jiang, Aidong Zhang
  26. #410 Discovering Evolutionary Theme Patterns from Text - An Exploration of Temporal Text Mining
    Authors: Qiaozhu Mei, ChengXiang Zhai
  27. #411 Finding partial orders from unordered 0-1 data
    Authors: Antti Ukkonen, Mikael Fortelius, Heikki Mannila
  28. #415 Simple and Effective Visual Models for Gene Expression Cancer Diagnostics
    Authors: Gregor Leban, Minca Mramor, Ivan Bratko, Blaz Zupan
  29. #448 Dimension Induced Clustering
    Authors: Aris Gionis, Alexander Hinneburg, Spiros Papadimitriou, Panayiotis Tsaparas
  30. #479 Local sparsity control for Naive Bayes with extreme misclassification costs
    Authors: Aleksander Kolcz
  31. #484 Summarizing Itemset Patterns: A Profile-Based Approach
    Authors: Xifeng Yan, Hong Cheng, Dong Xin, Jiawei Han
  32. #513 Mining Closed Relational Graphs with Connectivity Constraints
    Authors: Xifeng Yan, X. Jasmine Zhou, Jiawei Han
  33. #537 A Multiple Tree Algorithm for the Efficient Association of Asteroid Observations
    Authors: Jeremy Kubica, Andrew Moore, Andrew Connolly, Robert Jedicke
  34. #541 A distributed learning framework based on probabilistic models
    Authors: Srujana Merugu, Joydeep Ghosh
  35. #565 Variable Latent Semantic Indexing
    Authors: Anirban Dasgupta, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
  36. #594 Feature Bagging for Outlier Detection
    Authors: Aleksandar Lazarevic, Vipin Kumar
  37. #599 Wavelet Synopsis for Data Streams: Minimizing Non-Euclidean Error
    Authors: Sudipto Guha, Boulos Harb
  38. #622 Non-Redundant Clustering with Conditional Ensembles
    Authors: David Gondek, Thomas Hofmann
  39. #636 Combining Partitions by Probabilistic Label Aggregation
    Authors: Tilman Lange, Joachim Buhmann
  40. #661 Combining Email Models for False Positive Reduction
    Authors: Shlomo Hershkop, Salvatore Stolfo

Short (poster) papers:

  1. #112 CLICKS: An Effective Algorithm for Mining Subspace Clusters in Categorical Datasets
    Authors: Mohammed Zaki, Markus Peters, Ira Assent, Thomas Seidl
  2. #138 Web Mining from Competitors Websites
    Authors: Xin Chen, Yi-fang Wu
  3. #141 Evaluating Similarity Measures: A Large Scale Study in the Orkut Social Network
    Authors: Ellen Spertus, Mehran Sahami, Orkut Buyukkokten
  4. #152 Key semantics extraction by dependency tree mining
    Authors: Satoshi Morinaga, Hiroki Arimura, Takahiro Ikeda, Yosuke Sakao, Susumu Akamine
  5. #154 Regression Error Characteristic Surfaces
    Authors: Luis Torgo
  6. #219 Privacy-Preserving Distributed k-means Clustering over Arbitrarily Partitioned Data
    Authors: Geetha Jagannathan, Rebecca N. Wright
  7. #241 LIPED: HMM-based Life Profiles for Adaptive Event Detection
    Authors: Chien Chin Chen, Meng Chang Chen, Ming-Syan Chen
  8. #254 Estimating missed actual positives using independent classifiers
    Authors: Sandeep Mane, Jaideep Srivasta, San-Yih Hwang
  9. #274 A Hybrid Unsupervised Approach for Document Clustering
    Authors: Mihai Surdeanu, Jordi Turmo, Alicia Ageno
  10. #276 Mining in Anticipation: Proactive-Reactive Prediction for Data Streams
    Authors: Ying Yang, Xindong Wu, Xingquan Zhu
  11. #289 Optimizing time series discretization for knowledge discovery
    Authors: Fabian M�rchen, Alfred Ultsch
  12. #304 A Generalized Framework For Mining Spatio-temporal Patterns in Scientific Data
    Authors: hui yang, sameep mehta, Srinivasan Parthasarathy
  13. #315 Density-Based Clustering of Uncertain Data
    Authors: Martin Pfeifle, Hans-Peter Kriegel
  14. #320 Information Retrieval Based on Collaborative Filtering With Latent Interest Semantic Map
    Authors: Noriaki Kawamae
  15. #340 Parallel Mining of Closed Sequential Patterns
    Authors: Shengnan Cong, Jiawei Han, David Padua
  16. #342 Determining an Author's Native Language by Mining a Text for Errors
    Authors: Moshe Koppel, Jonathan Schler, Kfir Zigdon
  17. #343 Pattern Lattice Traversal by Selective Jumps
    Authors: Osmar Zaiane, Mohammad El Hajj
  18. #354 Adversarial Learning
    Authors: Daniel Lowd, Chris Meek
  19. #368 Co-clustering by Block Value Decomposition
    Authors: Bo Long, Zhongfei Zhang, Philip Yu
  20. #377 Application of kernels to link analysis
    Authors: Takahiko Ito, Masashi Shimbo, Taku Kudo, Yuji Matsumoto
  21. #380 Model-based Overlapping Clustering
    Authors: Arindam Banerjee, Chase Krumpelman, Sugato Basu, Raymond Mooney, Joydeep Ghosh
  22. #385 Building Connected Neighborhood Graphs for Isometric Data Embedding
    Authors: Li Yang
  23. #399 Integration of Profile Hidden Markov Model Output into Association Rule Mining
    Authors: Christopher Besemann, Anne Denton
  24. #403 Towards Exploratory Test Instance Specific Algorithms for High Dimensional Classification
    Authors: Charu Aggarwal
  25. #440 Simultaneous Optimization of Complex Mining Tasks with a Knowledgeable Cache
    Authors: Ruoming Jin, Kaushik Sinha, Gagan Agrawal
  26. #441 Disovering Frequent Topological Structures from Graph Datasets
    Authors: Ruoming Jin, Chao Wang, Dmitrii Polshakov, Srinivasan Parthasarathy, Gagan Agrawal
  27. #446 Efficient Computations via Scalable Sparse Kernel Partial Least Squares and Boosted Latent Features
    Authors: Michinari Momma
  28. #470 Scalable Discovery of hidden Emails from Large Folders
    Authors: giuseppe carenini, Raymond Ng, Xiaodong Zhou
  29. #498 Formulating Distance Functions via the Kernel Trick
    Authors: Gang Wu, Navneet Panda, Edward Chang
  30. #518 Fast Window Correlations Over Uncooperative Time Series
    Authors: Xiaojian Zhao, Dennis Shasha, Richard Cole
  31. #522 A Maximum Entropy Web Recommendation System: Combining Collaborative and Content Features
    Authors: Xin Jin, Yanzan Zhou, Bamshad Mobasher
  32. #530 Mining Comparable Bilingual Text Corpora for Cross-Language Information Integration
    Authors: Tao Tao, ChengXiang Zhai
  33. #531 Creating social networks to improve peer-to-peer networking
    Authors: Andrew Fast, David Jensen, Brian Neil Levine
  34. #533 A Fast Kernel-based Multilevel Algorithm for Graph Clustering
    Authors: Brian Kulis, Yuqiang Guan, Inderjit Dhillon
  35. #564 Unweaving a Web of Documents
    Authors: R. Guha, Ravi Kumar, D. Sivakumar, Ravi Sundaram
  36. #568 Maximal Boasting
    Authors: Cinda Heeren, Leonard Pitt
Webmaster: Michal Sabala