KDD-2003
The Ninth ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining
Washington, DC, USA August 24 - 27, 2003

HOME
Organization
Program
For Authors
KDD-Cup
Registration
Hotel


KDD 2003 >> Program >> Accepted Papers

ACM SIGKDD

KDD 2003 - Accepted Papers

Full Papers (Research Track)

Full Papers (Industrial/Government Track)

Poster Papers (Research Track)

Poster Papers (Industrial/Government Track)


 
FULL PAPERS (Research Track)

  • #117 Efficient Elastic Burst Detection in Data Streams
    Authors: Yunyue Zhu, Dennis Shasha

  • #120 Mining Distance-Based Outliers in Near Linear Time with Randomization and a Simple Pruning Rule
    Authors: Stephen Bay, Mark Schwabacher

  • #146 Fragments of Order
    Authors: Aristides Gionis, Teija Kujala, Heikki Mannila

  • #151 CloseGraph: Mining Closed Frequent Graph Patterns
    Authors: Xifeng Yan, Jiawei Han

  • #153 Proximus: A Framework for Analyzing Very High Dimensional Discrete-Attributed Datasets
    Authors: Mehmet Koyuturk, Ananth Grama

  • #170 Screening and Interpreting Multi-item Associations Based on Loglinear Modeling
    Authors: Xintao Wu, Daniel Barbara, Yong Ye

  • #178 XRules: An Effective Structural Classifier for XML Data
    Authors: Mohammed Zaki, Charu Aggarwal

  • #180 Fast Vertical Mining Using Diffsets
    Authors: Mohammed Zaki, Karam Gouda

  • #194 Extracting Semantics from Datacubes using Cube Transversals and Closures
    Authors: Alain Casali, Rosine Cicchetti, Lotfi Lakhal

  • #204 Generating English Summaries of Time Series Data Using the Gricean Maxims
    Authors: Somayajulu Sripada, Ehud Reiter, Jim Hunter, Jin Yu

  • #213 Visualizing Changes in the Inherent Structure of Data for Exploratory Feature Extraction
    Authors: Elias Pampalk, Werner Goebl, Gerhard Widmer

  • #264 Towards Systematic Design of Distance Functions for Data Mining Applications
    Authors: Charu Aggarwal

  • #274 Classifying Large Data Sets Using SVM with Hierarchical Clusters
    Authors: Hwanjo Yu, Jiong Yang, Jiawei Han

  • #282 CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets
    Authors: Jianyong Wang, Jiawei Han, Jian Pei

  • #287 Aggregation-Based Feature Invention and Relational Concept Classes
    Authors: Claudia Perlich, Foster Provost

  • #290 Inverted Matrix: Efficient Discovery of Frequent Items in Large Datasets in the Context of Interactive Mining
    Authors: Mohammad El-Hajj, Osmar R. Zaiane

  • #292 On Detecting Differences Between Groups
    Authors: Geoff Webb, Shane Butler, Douglas Newlands

  • #298 Indexing Multi-Dimensional Time-Series with Support for Multiple Distance Measures
    Authors: Michail Vlachos, Marios Hadjieleftheriou, Dimitrios Gunopulos, Eamonn Keogh

  • #326 An Iterative Hypothesis-Testing Strategy for Pattern Discovery
    Authors: Richard Bolton, Niall Adams

  • #329 Cross-Training: Learning Probabilistic Mappings Between Topics
    Authors: Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godbole

  • #340 SEWeP: Using Site Semantics and a Taxonomy to Enhance the Web Personalization Process
    Authors: Magdalini Eirinaki, Michalis Vazirgiannis, Iraklis Varlamis

  • #358 Eliminating Noisy Information in Web Pages for Data Mining
    Authors: Lan Yi, Bing Liu, Xiaoli Li

  • #375 Mining Concept-Drifting Data Streams using Ensemble Classifiers
    Authors: Haixun Wang, Wei Fan, Philip Yu, Jiawei Han

  • #390 Maximizing the Spread of Influence through a Social Network
    Authors: David Kempe, Jon Kleinberg, Eva Tardos

  • #399 Mining Unexpected Rules by Pushing User Dynamics
    Authors: Ke Wang, Yuelong Jiang, Laks Lakshmanan

  • #400 Translation-Invariant Mixture Models for Curve Clustering
    Authors: Darya Chudova, Scott Gaffney, Eric Mjolsness, Padhraic Smyth

  • #401 Assessment and Pruning of Hierarchical Model Based Clustering
    Authors: Jeremy Tantrum, Alejandro Murua, Werner Stuetzle

  • #407 Algorithms for Discovering Relative Authority in Graphs
    Authors: Scott White, Padhraic Smyth

  • #422 Efficient Data Reduction with EASE
    Authors: Herve Bronnimann, Bin Chen, Manoranjan Dash, Peter Haas, Peter Scheuermann

  • #431 Adaptive Duplicate Detection Using Learnable String Similarity Measures
    Authors: Mikhail Bilenko, Raymond Mooney

  • #433 Generative Model-Based Clustering of Directional Data
    Authors: Arindam Banerjee, Inderjit Dhillon, Joydeep Ghosh, Suvrit Sra

  • #457 Privacy-Preserving K-Means Clustering over Vertically Partitioned Data
    Authors: Jaideep Vaidya, Chris Clifton

  • #461 Information-Theoretic Co-clustering
    Authors: Inderjit Dhillon, Subramanyam Mallela, Dharmendra Modha

  • #469 To Buy or Not to Buy: Mining Airline Fare Data to Minimize Ticket Purchase Price
    Authors: Oren Etzioni, Craig Knoblock, Rattapoon Tuchinda, Alexander Yates
 
FULL PAPERS (Industrial/Government Track)

  • I3. Capturing Best Practice for Microarray Gene Expression Data Analysis
    Authors: G. Piatetsky-Shapiro, T. Khabaza, S. Ramaswamy

  • I4. Passenger-Based Predictive Modeling of Airline No-show Rates
    Authors: R. D. Lawrence, S. J. Hong, J. Cherrier

  • I6. Golden Path Analyzer: Using Divide-and-Conquer to Cluster Web Clickstream
    Authors: K. Ali, S. P. Ketchpel

  • I12. The Anatomy of A Multimodal Information Filter
    Authors: Y.-L. Wu, K.-S. Goh, B. Li, H. You, E. Y. Chang

  • I15. Knowledge-Based Data Mining
    Authors: S. M. Weiss, S. J. Buckley, S. Kapoor, S. Damgaard

  • I19. Mining Hepatitis Data with Temporal Abstraction
    Authors: T. B. Ho, T. D. Nguyen, S. Kawasaki, S. Q. Le, H. Yokoi, K. Takabayashi

  • I20. Empirical Bayesian Data Mining for Discovering Patterns in Post-Marketing Drug Safety
    Authors: D. M. Fram, J. S. Almenoff, W. DuMouchel

  • I27. Discovery of Climate Indices using Clustering
    Authors: M. Steinbach, P.-N. Tan, V. Kumar, S. Klooster, C. Potter

  • I30. Clinical and Financial Outcomes Analysis with Existing Hospital Patient Records
    Authors: R. B. Rao, S. Sandilya, R. S. Niculescu, C. Germond, H. Rao

  • I33. The Data Mining Approach to Automated Software Testing
    Authors: M. Last, M. Friedman, A. Kandel

  • I37. Critical Event Prediction for Proactive Management in Large-scale Computer Clusters
    Authors: R. K. Sahoo, A. J. Oliner, I. Rish, M. Gupta, J. E. Moreira, S. Ma

  • I38. Frequent-Subsequence-Based Prediction of Outer Membrane Proteins
    Authors: R. She, F. Chen, K. Wang, M. Ester, J. L. Gardy, F. S. L. Brinkman

  • I475. Information Awareness: A Prospective Technical Assessment
    Authors: D. Jensen, M. Rattigan, and H. Blau
 
POSTER PAPERS (Research Track)

  • #121 CARPENTER: Finding Closed Patterns in Long Biological Datasets
    Authors: Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed Zaki

  • #127 Nantonac Collaborative Filtering: Recommendation Based on Order Responses
    Authors: Toshihiro Kamishima

  • #137 Graph-Based Anomaly Detection
    Authors: Caleb Noble, Diane Cook

  • #150 Efficient Decision Tree Construction on Streaming Data
    Authors: Ruoming Jin, Gagan Agrawal

  • #164 Empirical Comparisons of Various Voting Schemes in Boosting and Bagging
    Authors: Kelvin Leung, D. Stott Parker

  • #168 New Unsupervised Clustering Algorithm for Large Datasets
    Authors: William Peter, John Chiochetti

  • #174 Time and Sample Efficient Discovery of Markov Blankets and Direct Causal Relations
    Authors: Ioannis Tsamardinos, Constantin F. Aliferis, Alexander Statnikov

  • #177 Experiments with Random Projections for Machine Learning
    Authors: Dmitriy Fradkin, David Madigan

  • #184 PaintingClass: Interactive Construction, Visualization and Exploration of Decision Trees
    Authors: Soon Tee Teoh, Kwan-Liu Ma

  • #188 A Web Page Prediction Model Based On Click-Stream Tree
    Authors: Sule Gunduz, M. Tamer Ozsu

  • #195 Distributed Multivariate Regression Based on Influential Observations
    Authors: Hang Yu, Ee-Chien Chang

  • #200 A Two-Way Visualization Method for Clustered Data
    Authors: Yehuda Koren, David Harel

  • #208 Finding Recent Frequent Itemsets Adaptively over Online Data Streams
    Authors: Joong Hyuk Chang, Won Suk Lee

  • #216 On Computing, Storing and Querying Frequent Patterns
    Authors: Guimei Liu, Hongjun Lu, Wenwu Lou, Jeffrey Xu Yu

  • #225 Accurate Decision Trees for Mining High-Speed Data Streams
    Authors: Joao Gama, Ricardo Rocha, Pedro Medas

  • #240 Mining Data Records in Web Pages
    Authors: Bing Liu, Robert Grossman, Yanhong Zhai

  • #259 Stylistic Text Mining of Electronic Messages
    Authors: Shlomo Argamon, Marin Saric, Sterling Stein

  • #268 Mining Viewpoint Patterns in Image Databases
    Authors: Wynne Hsu, Jing Dai, Mong Li Lee

  • #269 Correlating Synchronous and Asynchronous Data Streams
    Authors: Sudipto Guha, Dimitrios Gunopulos, Nick Koudas

  • #273 Interactive Exploration of Coherent Patterns in Time-Series Gene Expression Data
    Authors: Daxin Jiang, Jian Pei, Aidong Zhang

  • #281 Probabilistic Discovery of Time Series Motifs
    Authors: Bill Chiu, Eamonn Keogh, Stefano Lonardi

  • #283 Mining Phenotypes and Informative Genes from Gene Expression Data
    Authors: Chun Tang, Aidong Zhang, Jian Pei

  • #297 Distributed Cooperative Mining for Information Consortium
    Authors: Satoshi Morinaga, Kenji Yamanishi, Jun-ichi Takeuchi

  • #311 Efficiently Handling Feature Redundancy in High-Dimensional Data
    Authors: Lei Yu, Huan Liu

  • #328 Navigating Massive Data Sets via Local Clustering
    Authors: Michael E. Houle

  • #331 Mining Associations in "Weighted Support - Significant" Framework
    Authors: Feng Tao, Fionn Murtagh

  • #337 Using Randomized Response Techniques for Privacy-Preserving Data Mining
    Authors: Wenliang Du, Zhijun Zhan

  • #343 A Bag of Paths Model for Representing Document Structure with Application to Web Mining
    Authors: Sachindra Joshi, Neeraj Agrawal, Raghu Krishnapuram, Sumit Negi

  • #365 Understanding Captions in Biomedical Publications
    Authors: William Cohen, Richard Wang, Robert Murphy

  • #369 Learning Relational Probability Trees
    Authors: Jennifer Neville, David Jensen, Lisa Friedland, Michael Hay

  • #395 Playing Hide-And-Seek with Correlations
    Authors: Christopher Jermaine

  • #417 Online Novelty Detection on Temporal Sequences
    Authors: Junshui Ma, Simon Perkins

  • #459 Mining High Dimensional Data for Classifier Knowledge
    Authors: Raj Bhatnagar, Goutham Kurra, Wen Niu

  • #465 Applications of Sampling and Fractional Factorial Designs to Model-Free Data Squashing
    Authors: William DuMouchel, Deepak K. Agarwal

  • #471 Tracking Evolving Communities in Large Linked Networks
    Authors: John Hopcroft, Omar Khan, Brian Kulis, Bart Selman

  • #483 Improving Spatial Locality using Data Mining
    Authors: Karlton Sequeira, Mohammed Zaki, Boleslaw Szymanski, Christopher Carothers
 
POSTER PAPERS (Industrial/Government Track)

  • I5. Similarity Analysis on Government Regulations
    Authors: G. T. Lau, K. H. Law, G. Wiederhold

  • I7. Architecting a Knowledge Discovery Engine for the Military Commander Utilizing Massive Runs of Agent Based Simulations
    Authors: P. Barry, J. Zhang, M. McDonald

  • I9. Applying Data Mining in Investigating Money Laundering Crimes
    Authors: Z. Zhang, J. J. Salerno, P. S. Yu, J. Hua, R. Zhang, M. Regan, D. Cutler

  • I10. Visualizing Concept Drift
    Authors: K. B. Pratt, G. Tschapek

  • I14. Data Quality through Knowledge Engineering
    Authors: T. Dasu, G. T. Vesonder, J. R. Wright

  • I22. An Adaptive Nearest Neighbor Search for a Parts Acquisition ePortal
    Authors: R. Alonso, J. A. Bloom, H. Li, C. Basu

  • I29. Experimental Design for Solicitation Campaigns
    Authors: U. F. Mayer, A. Sarkissian

  • I31. Experimental Study of Discovering Essential Information from Customer Inquiry
    Authors: K. Shimazu, A. Momma, K. Furukawa

  • I36. Towards NIC-based Intrusion Detection
    Authors: M. Otey, S. Parthasarathy, A. Ghoting, G. Li, S. Narravula

  • I39. Data-Driven Validation, Completion and Construction of Event Relation Networks
    Authors: C.-S. Perng, D. Thoenen, S. Ma, G. Brabarnik, J. Hellerstein

Webmaster: Osmar R. Zaļane
Last updated: May 23, 2003