ACM Thirteenth Conference on
Information and Knowledge Management (CIKM)
CIKM and Workshops 2004
Conference Program

Monday November 8, 2004
8:30 am - 12:00 pm Tutorials
12:00 pm - 1:30 pm Lunch
1:30 pm - 5:00 pm Tutorials
Tuesday November 9, 2004
9:00 am - 10:15 am Keynote Address
10:15 am - 10:30 am Coffee Break
10:30 am - 12:00 pm Paper Session DB-1 (Databases): Data Integration
Chair - Luis Gravano Columbia University
  • Composable XML Integration Grammars
    Wenfei Fan, Minos Garofalakis, Ming Xiong (Bell Laboratories, Lucent Technologies),
    Xibei Jia (University of Edinburgh)

  • Extending and Inferring Functional Dependencies in Schema Transformation
    Qi He, Tok Wang Ling (National University of Singapore)

  • Organizing Structured Web Sources by Query Schemas: A Clustering Approach
    Bin He, Tao Tao, Kevin Chen-Chuan Chang (University of Illinois at Urbana-Champaign)

10:30 am - 12:00 pm Paper Session IR-1 (Information Retrieval): Information Retrieval Models
Chair - Ian Soboroff National Institute of Standards and Technology
  • Unified Utility Maximization Framework for Resource Selection
    Luo Si, Jamie Callan (Carnegie Mellon University)

  • Simple BM25 Extension to Multiple Weighted Fields
    Stephen Robertson, Hugo Zaragoza, Michael Taylor (Microsoft Research, Cambridge, U.K.)

  • Scoring Missing Terms in Information Retrieval Tasks
    Egidio Terra, Charles L. A. Clarke (University of Waterloo, Canada)

10:30 am - 12:00 pm Paper Session KM-1 (Knowledge Management): Clustering I
Chair - Tao Li University of Rochester
  • Goal-oriented Methods and Meta Methods for Document Classification and their Parameter Tuning
    Stefan Siersdorfer, Sergej Sizov, Gerhard Weikum (Max-Planck-Institut für Informatik, Saarbrücken, Germany)

  • Using Bi-modal Alignment and Clustering Techniques for Documents and Speech Thematic Segmentations
    Dalila Mekhaldi, Denis Lalanne, Rolf Ingold (Département d’Informatique Chemin du Musée, Fribourg, Switzerland)

  • Hierarchical Document Categorization with Support Vector Machines
    Lijuan Cai, Thomas Hofmann (Brown University)

12:00 pm - 1:30 pm Lunch
1:30 pm - 3:00 pm Paper Session DB-2 (Databases): Data Streams
Chair - Farnoush Banaei-Kashani University of Southern California
  • Interval Query Indexing for Efficient Stream Processing
    Kun-Lung Wu, Shyh-Kwei Chen, Philip S. Yu (IBM T.J. Watson Research Center)

  • Evaluating Window Joins over Punctuated Streams
    Luping Ding, Elke A. Rundensteiner (Worcester Polytechnic Institute)

  • EXPedite: A System for Encoded XML Processing
    Yi Chen (University of Pennsylvania),
    George A. Mihaila (IBM T.J. Watson Research Center),
    Susan B. Davidson (University of Pennsylvania),
    Sriram Padmanabhan (IBM Silicon Valley Labs)
1:30 pm - 3:00 pm Paper Session IR-2 (Information Retrieval): Web Information Retrieval
Chair - Charles L. A. Clarke University of Waterloo
  • Optimizing Web Search Using Web Click-through Data
    Gui-Rong Xue (Shanghai Jiao-Tong University, P.R.China),
    Hua-Jun Zeng, Zheng Chen (Microsoft Research Asia 5F, Beijing, P.R.China),
    Yong Yu (Shanghai Jiao-Tong University, P.R.China),
    Wei-Ying Ma (Microsoft Research Asia 5F, Beijing, P.R.China),
    WenSi Xi, WeiGuo Fan (Virginia Polytechnic Institute and State University)

  • A Practical Web-based Approach to Generating Topic Hierarchy for Text Segments
    Shui-Lung Chuang, Lee-Feng Chien (Institute of Information Science, Academia Sinica, Taiwan, R.O.C.)

  • Acquisition of Categorized Named Entities for Web Search
    Marius Pasca (Google Inc.)
1:30 pm - 3:00 pm Poster Session P-1
  • BioDIFF: An Effective Fast Change Detection Algorithm for Genomic and Proteomic Data
    Yang Song, Sourav S Bhowmick (Nanyang Technological University, Singapore)

  • Protein Structure Alignment using Geometrical Features
    S. Alireza Aghili, Divyakant Agrawal, Amr El Abbadi (University of California at Santa Barbara)

  • Mining Gene Expression Datasets using Density-based Clustering
    Seokkyung Chung, Jongeun Jun, Dennis McLeod (University of Southern California)

  • Semi-supervised Learning for Music Artists Style Identification
    Tao Li (Florida International University),
    Mitsunori Ogihara (University of Rochester)

  • Integrating Heterogeneous Features for Efficient Content Based Music Retrieval
    Jialie Shen, John Shepherd (The University of New South Wales, Sydney, Australia),
    Anne H. H. Ngu (Texas State University)

  • Unified Filtering by Combining Collaborative Filtering and Content-based Filtering via Mixture Model and Exponential Model
    Luo Si (Carnegie Mellon University)
    Rong Jin (Michigan State University)

  • A Framework for Refining Similarity Queries Using Learning Techniques
    Yiming Ma, Qi Zhong, Sharad Mehrotra, Dawit Yimam Seid (University of California at Irvine)

  • A Dimensionality Reduction Technique for Efficient Similarity Analysis of Time Series Databases
    Vasileios Megalooikonomou, Guo Li, Qiang Wang (Temple University)

  • Combining Structural and Citation-Based Evidence for Text Classification
    Baoping Zhang, Marcos André Gonçalves, Weiguo Fan, Yuxin Chen, Edward A. Fox (Virginia Tech)
    Pável Calado, Marco Cristo (Federal University of Minas Gerais, Belo Horizonte, MG, Brazil)

  • Using Relevance Feedback to Detect Misuse for Information Retrieval Systems
    Ling Ma, Nazli Goharian (Illinois Institute of Technology)

  • An Extended Logic Programming Based Multi-Agent System Formalization in Mobile Environments
    Jianwen Chen (IBM Australia),
    Yan Zhang (University of Western Sydney, Australia)
3:00 pm - 3:30 pm Coffee Break
3:30 pm - 5:00 pm Paper Session DB-3 (Databases): Data Mining
Chair - Parvathi Chundi University of Nebraska at Omaha
  • Framework and Algorithms for Trend Analysis in Massive Temporal Data Sets
    Sreenivas Gollapudi (Oracle Corporation),
    D. Sivakumar (IBM Almaden Research Center)

  • Scalable Sequential Pattern Mining for Biological Sequences
    Ke Wang (Simon Fraser University, Canada)
    Yabo Xu (Simon Fraser University, Canada & Chinese University of Hong Kong)
    Jeffrey Xu Yu (Chinese University of Hong Kong)

  • Discovering Frequently Changing Structures from Historical Structural Deltas of Unordered XML
    Qiankun Zhao, Sourav S Bhowmick (Nanyang Technological University, Singapore),
    Mukesh Mohania (IBM India Research Lab),
    Yahiko Kambayashi (Kyoto University, Japan)

3:30 pm - 5:00 pm Paper Session DB-IR-1 (Databases and Information Retrieval): Indexing and Query Processing Efficiency
Chair - Nazli Goharian Illinois Institute of Technology
  • Indexing Text Data under Space Constraints
    Bijit Hore (University of California at Irvine),
    Hakan Hacigumus (IBM Almaden Research Center),
    Bala Iyer (IBM Silicon Valley Lab),
    Sharad Mehrotra (University of California at Irvine)

  • Image Similarity Search with Compact Data Structures
    Qin Lv, Moses Charikar, Kai Li (Princeton University)

  • Energy Management Schemes for Memory-Resident Database Systems
    Jayaprakash Pisharath, Alok Choudhary (Northwestern University),
    Mahmut Kandemir (Pennsylvania State University)

3:30 pm - 5:00 pm Poster Session P-2
  • Restructuring Batch View Maintenance Efficiently
    Bin Liu, Elke A. Rundensteiner, David Finkel (Worcester Polytechnic Institute)

  • On Semantic Matching of Multilingual Attributes in Relational Systems
    A. Kumaran, Jayant R. Haritsa (Indian Institute of Science, Bangalore, India)

  • Compression Schemes for Differential Categorical Stream Clustering
    Weiyun Huang, Edward Omiecinski, Leo Mark (Georgia Institute of Technology)

  • Using a Compact Tree to Index and Query XML Data
    Qinghua Zou, Shaorong Liu, Wesley W. Chu (University of California at Los Angeles)

  • A Framework for Selective Query Expansion
    Steve Cronen-Townsend, Yun Zhou, W. Bruce Croft (University of Massachusetts)

  • Exploiting Hierarchical Relationships in Conceptual Search
    Devanand Ravindran, Susan Gauch (University of Kansas at Lawrence)

  • MRSSA: An Iterative Algorithm for Similarity Spreading over Interrelated Objects
    Gui-Rong Xue (Shanghai Jiao-Tong University, P.R. China),
    Hua-Jun Zeng, Zheng Chen (Microsoft Research Asia 5F, Beijing, P.R.China),
    Yong Yu (Shanghai Jiao-Tong University, P.R. China),
    Wei-Ying Ma (Microsoft Research Asia 5F, Beijing, P.R.China),
    WenSi Xi, Edward Fox (Virginia Polytechnic Institute and State University)

  • Web Page Clustering Enhanced by Summarization
    Xuanhui Wang (University of Illinois at Urbana-Champaign),
    Dou Shen (Tsinghua University, Beijing, P.R.China),
    Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma (Microsoft Research Asia 5F, Beijing, P.R.China)

  • Grammar-Based Task Analysis of Web Logs
    Savitha Srinivasan, Arnon Amir, Prasad Deshpande, Vladimir Zbarsky (IBM Almaden Research Center)

  • Soft Clustering Criterion Functions for Partitional Document Clustering: A Summary of Results
    Ying Zhao, George Karypis (University of Minnesota)

  • Calculating Similarity Between Texts using Graph-based Text Representation Model
    Junji Tomita, Hidekazu Nakawatase, Megumi Ishii (NTT Corporation, Kanagawa, Japan)

7:00 pm - 8:00 pm Reception
Wednesday November 10, 2004
9:00 am - 10:15 am Keynote Address
10:15 am - 10:30 am Coffee Break
10:30 am - 12:00 pm Paper Session IR-3 (Information Retrieval): Fusion of Retrieval Systems
Chair - Stephen Robertson Microsoft Research Cambridge
  • A Design Space Approach to Analysis of Information Retrieval Adaptive Filtering Systems
    Dmitriy Fradkin, Paul Kantor (The Center for Discrete Mathematics & Theoretical Computer Science)

  • A Multi-System Analysis of Document and Term Selection for Blind Feedback
    Thomas R. Lynam (University of Waterloo, Canada),
    Chris Buckley (Sabir Research Inc.)
    Charles L. A. Clarke, Gordon V. Cormack (University of Waterloo, Canada),

  • Improving Document Representations Using Relevance Feedback: The RFA Algorithm
    Razvan Stefan Bot, Yi-fang Brook Wu (New Jersey Institute of Technology)

10:30 am - 12:00 pm Paper Session KM-2 (Knowledge Management): Clustering II
Chair - Stefan Siersdorfer MPI Saarbruecken
  • A Vertical Distance-based Outlier Detection Method with Local Pruning
    Dongmei Ren, Imad Rahal, William Perrizo (North Dakota State University),
    Kirk Scott (University of Alaska Anchorage)

  • ClusterMap: Labeling Clusters in Large Datasets via Visualization
    Keke Chen, Ling Liu (Georgia Institute of Technology)

  • On Combining Multiple Clusterings
    Tao Li (Florida International University),
    Mitsunori Ogihara (University of Rochester),
    Sheng Ma (IBM T.J. Watson Research Center)

10:30 am - 12:00 pm Poster Session P-2
12:00 pm - 1:30 pm Lunch
1:30 pm - 3:00 pm Paper Session DB-4 (Databases): Similarity Search
Chair - Vasileios Megalooikonomou Temple University
  • SWAM: A Family of Access Methods for Similarity-Search in Peer-to-Peer Data Networks
    Farnoush Banaei-Kashani, Cyrus Shahabi (University of Southern California)

  • Localized Signature Table: Fast Similarity Search on Transaction Data
    Qiang Jing, Rui Yang, Panos Kalnis, Anthony K. H. Tung (National University of Singapore)

  • Distance-Function Design and Fusion for Sequence Data
    Yi Wu, Edward Y. Chang (University of California at Santa Barbara)

1:30 pm - 3:00 pm Paper Session IR-4 (Information Retrieval): Machine Learning in Information Retrieval
Chair - Rosie Jones Yahoo! Inc
  • Learning Similarity Measures in Non-orthogonal Space
    Ning Liu (Tsinghua University, Beijing, P.R. China),
    Benyu Zhang (Microsoft Research Asia, Beijing, P.R. China),
    Jun Yan (Peking University, Beijing, P.R. China),
    Qiang Yang (Hong Kong University of Science and Technology),
    Shuicheng Yan, Zheng Chen (Microsoft Research Asia, Beijing, P.R. China),
    Fengshan Bai (Tsinghua University, Beijing, P.R. China),
    Wei-Ying Ma (Microsoft Research Asia, Beijing, P.R. China)

  • Feature Selection with Conditional Mutual Information MaxiMin in Text Categorization
    Gang Wang, Frederick H. Lochovsky, Qiang Yang (Hong Kong University of Science and Technology)

  • Regularizing Translation Models for Better Automatic Image Annotation
    Feng Kang, Rong Jin, Joyce Y. Chai (Michigan State University)

1:30 pm - 3:00 pm Panel on "Key Problems in Integrating Structured and Unstructured Information"
3:00 pm - 3:30 pm Coffee Break
3:30 pm - 5:00 pm Paper Session DB-IR-2 (Databases and Information Retrieval): Web and XML Text Search
Chair - Min-Yen Kan National University of Singapore
  • Providing Consistent and Exhaustive Relevance Assessments for XML Retrieval Evaluation
    Benjamin Piwowarski (University Paris 6, France),
    Mounia Lalmas (University of London, England)

  • Processing Content-Oriented XPath Queries
    Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke (University of Amsterdam, The Netherlands)

  • Local Methods for Estimating PageRank Values
    Yen-Yu Chen, Qingqing Gan, Torsten Suel (Polytechnic University)

3:30 pm - 5:00 pm Paper Session IR-5 (Information Retrieval): Information Retrieval Applications
Chair - Susan Gauch University of Kansas
  • The Liberal Media and Right-Wing Conspiracies: Using Cocitation Information to Estimate Political Orientation in Web Documents
    Miles Efron (University of Texas at Austin)

  • Associative Document Retrieval by Query Subtopic Analysis and its Application to Invalidity Patent Search
    Toru Takaki (NTT DATA Corporation, Tokyo, Japan),
    Atsushi Fujii, Tetsuya Ishikawa (University of Tsukuba, Japan)

  • Taxonomy-driven Computation of Product Recommendations
    Cai-Nicolas Ziegler, Georg Lausen, Lars Schmidt-Thieme (Universität Freiburg, Germany)

3:30 pm - 5:00 pm Poster Session P-1
7:00 pm Banquet
  • Speaker: Alan Wade
    Chief Information Officer, CIA

Thursday November 11, 2004
9:00 am - 10:15 am Keynote Address
10:15 am - 10:30 am Coffee Break
10:30 am - 12:00 pm Paper Session DB-5 (Databases): Potpourri
Chair - Luis Gravano Columbia University
  • Computing Consistent Query Answers using Conflict Hypergraphs
    Jan Chomicki (University at Buffalo, State University of New York),
    Jerzy Marcinkowski (Wroclaw University, Poland),
    Slawomir Staworko (University at Buffalo, State University of New York)

  • Motion Adaptive Indexing for Moving Continual Queries over Moving Objects
    Bugra Gedik (Georgia Institute of Technology),
    Kun-Lung Wu, Philip Yu (IBM T.J. Watson Research Center),
    Ling Liu (Georgia Institute of Technology)

  • On Lossy Time Decompositions of Time Stamped Documents
    Parvathi Chundi (University of Nebraska at Omaha),
    Daniel J. Rosenkrantz (University at Albany, State University of New York)

10:30 am - 12:00 pm Paper Session IR-KM-1 (Information Retrieval and Knowledge Management): Text Mining
Chair - James G. Shanahan Clairvoyance Corporation
  • Event Threading within News Topics
    Ramesh Nallapati, Ao Feng, Fuchun Peng, James Allan (University of Massachusetts)

  • Approximating the Top-m Passages in a Parallel Question Answering System
    Charles L. A. Clarke, Egidio L. Terra (University of Waterloo, Canada)

  • Dynamic Extraction of Topic Descriptors and Discriminators: Towards Automatic Context-based Topic Search
    Ana Maguitman, David Leake, Thomas Reichherzer, Filippo Menczer (Indiana University)

10:30 am - 12:00 pm Industry Track Poster Session
  • Design of a Data Warehouse System for Network/Web Services
    Anoop Singhal (George Mason University)

  • InfoAnalyzer: A Computer-Aided Tool for Building Enterprise Taxonomies
    Li Zhang, ShiXia Liu, Yue Pan, LiPing Yang (IBM China Research Laboratory)

  • RStar: An RDF Storage and Query System for Enterprise Resource Management
    Li Ma, Zhong Su, Yue Pan, Li Zhang, Tao Liu (IBM China Research Laboratory)

  • Processing Search Queries in a Distributed Environment
    Frederick Knabe, Daniel Tunkelang (Endeca Technologies)

  • Intelligent Agent For Automated Manufacturing Rule Generation
    Alan Clark, Dimitar Filev (Ford Motor Company)

  • Document Clustering Based on Cluster Validation
    Zheng-Yu Niu, Dong-Hong Ji (Institute for Infocomm Research, Singapore),
    Chew-Lim Tan (National University of Singapore)

  • Circumstance-Based Categorization Analysis of Knowledge Management Systems for the Japanese Market
    Makoto Sano (Justsystem Corporation),
    David A. Evans (Clairvoyance Corporation)

  • Database Support for Species Extraction from the Biosystematics Literature - a Feasibility Demonstration
    Ralf Duckstein, Klemens Böhm (Otto-von-Guericke-University Magdeburg, Germany)

12:00 pm - 1:30 pm Lunch
1:30 pm - 3:00 pm Paper Session DB-6 (Databases): XML Query Processing
Chair - Alireza Aghili University of California at Santa Barbara
  • Efficient Processing of XML Twig Patterns with Parent Child Edges: A Look-ahead Approach
    Jiaheng Lu, Ting Chen, Tok Wang Ling (National University of Singapore)

  • QFilter: Fine-Grained Run-Time XML Access Control via NFA-based Query Rewriting
    Bo Luo, Dongwon Lee, Wang-Chien Lee, Peng Liu (Pennsylvania State University)

  • Virtual Cursors for XML Joins
    Beverly Yang (Stanford University),
    Marcus Fontoura, Eugene Shekita, Sridhar Rajagopalan, Kevin Beyer (IBM Almaden Research Center)

1:30 pm - 3:00 pm Paper Session IR-6 (Information Retrieval): Digital Libraries
Chair - Andrea S. LaPaugh Princeton University
  • CiteSeer-API: Towards Seamless Resource Location and Interlinking for Digital Libraries
    Yves Petinot, C. Lee Giles, Vivek Bhatnagar, Pradeep B. Teregowda, Hui Han, Isaac Councill (Pennsylvania State University)

  • The Robustness of Content-based Search in Hierarchical Peer to Peer Networks
    M. Elena Renda (I.S.T.I. - C.N.R. and Scuola Superiore Sant’Anna, Pisa, Italy),
    Jamie Callan (Carnegie Mellon University)

  • SERF: Integrating Human Recommendations with Search
    Seikyung Jung, Kevin Harris, Janet Webster, Jonathan L. Herlocker (Oregon State University),

1:30 pm - 3:00 pm Paper Session KM-3 (Knowledge Management): Knowledge Extraction
Chair - Ophir Frieder Illinois Institute of Technology
  • Weakly-Supervised Relation Classification for Information Extraction
    Zhu Zhang (University of Michigan)

  • TEG - A Hybrid Approach to Information Extraction
    Benjamin Rosenfeld, Ronen Feldman, Moshe Fresko, Jonathan Schler, Yonatan Aumann (Bar-Ilan University, Ramat Gan, Israel)

  • Node Ranking in Labeled Directed Graphs
    Krishna P. Chitrapura (Indian Institute of Technology),
    Srinivas R. Kashyap (University of Maryland)

3:00 pm - 3:30 pm Coffee Break
3:30 pm - 5:00 pm Paper Session IR-7 (Information Retrieval): Natural Language Processing for IR
Chair - David A. Evans Clairvoyance Corporation
  • Unsupervised Question Answering Data Acquisition From Local Corpora
    Lucian Vlad Lita, Jaime Carbonell (Carnegie Mellon University)

  • Distributional Term Representations: An Experimental Comparison
    Alberto Lavelli (ITC-irst, Povo di Trento, Italy),
    Fabrizio Sebastiani (ISTI-CNR, Pisa, Italy),
    Roberto Zanoli (ITC-irst, Povo di Trento, Italy)

  • Stemming and Lemmatization in the Clustering of Finnish Text Documents
    Tuomo Korenius, Jorma Laurikkala, Kalervo Järvelin, Martti Juhola (University of Tampere, Finland)

3:30 pm - 5:00 pm Paper Session KM-4 (Knowledge Management): Distributed Knowledge Management
Chair - Marc Ronthaler University of Bremen
  • Towards Smarter Documents
    Vikas Krishna, Prasad M. Deshpande, Savitha Srinivasan (IBM Almaden Research Center)

  • On Structuring Formal, Semi-Formal and Informal Data to Support Traceability in Systems Engineering Environments
    Paul Mason (Shinawatra University, Pathumthani, Thailand),
    Ken Cosh, Pulyamon Vihakapirom (Asian University of Science & Technology, Chon Buri, Thailand)

  • Swoogle: A Search and Metadata Engine for the Semantic Web
    Li Ding, Tim Finin, Anupam Joshi, Rong Pan, R. Scott Cost, Yun Peng, Pavan Reddivari, Vishal Doshi, Joel Sachs (University of Maryland at Baltimore County)

Friday November 12, 2004
8:00 am - 10:00 am Session
10:30 am - 12:00 pm Workshops
12:00 pm - 1:30 pm Lunch
1:30 pm - 4:30 pm Workshops
Saturday November 13, 2004
8:00 am - 10:00 am Session
10:30 am - 12:00 pm Workshops
12:00 pm - 1:30 pm Lunch
1:30 pm - 4:30 pm Workshops