KDD Cup  

Home Page
KDD Cup 2008
KDD Cup 2007
KDD Cup 2006
KDD Cup 2005
KDD Cup 2004
KDD Cup 2003
KDD Cup 2002
KDD Cup 2001
KDD Cup 2000
KDD Cup 1999
KDD Cup 1998
KDD Cup 1997
SIGKDD

KDD Cup 2000: Tasks

Held in conjunction with the Sixth ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

The KDD Cup 2000 domain contains clickstream and purchase data from Gazelle.com, a legwear and legcare web retailer that closed their online store on 8/18/2000.

You are required to sign a non-disclosure agreement in order to receive a password to access the data, although the original restrictions have been dramatically relaxed on Apr 2002 to allow wider use of the data. Basically, any use of the data is allowed as long as the proper acknowledgment is provided and a copy of the work is provided to Blue Martini Software.

In order to access the data, you must fill out the form on this page. Your username and password will be emailed to you.

When you have received a username and password (see above), you can go to the confidential section of the site, which contains a description of the tasks, the data, background information, and more.

The reference to the KDD Cup 2001 is as follows (a PDF is available here):

Ron Kohavi, Carla Brodley, Brian Frasca, Llew Mason, and Zijian Zheng. KDD-Cup 2000 organizers' report: Peeling the onion. SIGKDD Explorations, 2(2):86-98, 2000. http://robotics.stanford.edu/users/ronnyk/kddOrganizerReport.pdf

The bibtex entry is:

@Article{kddcup2000,
author = {Ron Kohavi and Carla Brodley and Brian Frasca and Llew Mason and Zijian Zheng},
title = {{KDD-Cup} 2000 Organizers' Report: Peeling the Onion},
journal = {SIGKDD Explorations},
volume = {2},
number = {2},
pages = {86--98},
url = {http://robotics.stanford.edu/users/ronnyk/kddOrganizerReport.pdf},
year = 2000}

A paper describing the Blue Martini architecture is available here

Suhail Ansari, Ron Kohavi, Llew Mason, and Zijian Zheng, Integrating E-Commerce and Data Mining: Architecture and Challenges, ICDM 2001.

Please remember the restrictions on the data.