Conference Program
 |
|
Monday November 8, 2004
|
8:30 am - 12:00 pm |
Tutorials |
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 5:00 pm |
Tutorials
|
|
Tuesday November 9, 2004
|
9:00 am - 10:15 am |
Keynote Address
|
|
10:15 am - 10:30 am |
Coffee Break
|
10:30 am - 12:00 pm |
Paper Session DB-1 (Databases): Data Integration
Chair -
Luis Gravano
Columbia University
|
-
Composable XML Integration Grammars
Wenfei Fan, Minos Garofalakis, Ming Xiong
(Bell Laboratories, Lucent Technologies),
Xibei Jia
(University of Edinburgh)
-
Extending and Inferring Functional Dependencies in Schema Transformation
Qi He, Tok Wang Ling
(National University of Singapore)
-
Organizing Structured Web Sources by Query Schemas: A Clustering Approach
Bin He, Tao Tao, Kevin Chen-Chuan Chang
(University of Illinois at Urbana-Champaign)
|
10:30 am - 12:00 pm |
Paper Session IR-1 (Information Retrieval): Information Retrieval Models
Chair -
Ian Soboroff
National Institute of Standards and Technology
|
- Unified Utility Maximization Framework for Resource Selection
Luo Si, Jamie Callan
(Carnegie Mellon University)
- Simple BM25 Extension to Multiple Weighted Fields
Stephen Robertson, Hugo Zaragoza, Michael Taylor
(Microsoft Research, Cambridge, U.K.)
- Scoring Missing Terms in Information Retrieval Tasks
Egidio Terra, Charles L. A. Clarke
(University of Waterloo, Canada)
|
10:30 am - 12:00 pm |
Paper Session KM-1 (Knowledge Management): Clustering I
Chair -
Tao Li
University of Rochester
|
- Goal-oriented Methods and Meta Methods for Document Classification and their Parameter Tuning
Stefan Siersdorfer, Sergej Sizov, Gerhard Weikum
(Max-Planck-Institut für Informatik, Saarbrücken, Germany)
- Using Bi-modal Alignment and Clustering Techniques for Documents and Speech Thematic Segmentations
Dalila Mekhaldi, Denis Lalanne, Rolf Ingold
(Département d’Informatique Chemin du Musée, Fribourg, Switzerland)
- Hierarchical Document Categorization with Support Vector Machines
Lijuan Cai, Thomas Hofmann
(Brown University)
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 3:00 pm |
Paper Session DB-2 (Databases): Data Streams
Chair -
Farnoush Banaei-Kashani
University of Southern California
|
- Interval Query Indexing for Efficient Stream Processing
Kun-Lung Wu, Shyh-Kwei Chen, Philip S. Yu
(IBM T.J. Watson Research Center)
- Evaluating Window Joins over Punctuated Streams
Luping Ding, Elke A. Rundensteiner
(Worcester Polytechnic Institute)
- EXPedite: A System for Encoded XML Processing
Yi Chen
(University of Pennsylvania),
George A. Mihaila
(IBM T.J. Watson Research Center),
Susan B. Davidson
(University of Pennsylvania),
Sriram Padmanabhan
(IBM Silicon Valley Labs)
|
1:30 pm - 3:00 pm |
Paper Session IR-2 (Information Retrieval): Web Information Retrieval
Chair -
Charles L. A. Clarke
University of Waterloo
|
- Optimizing Web Search Using Web Click-through Data
Gui-Rong Xue
(Shanghai Jiao-Tong University, P.R.China),
Hua-Jun Zeng, Zheng Chen
(Microsoft Research Asia 5F, Beijing, P.R.China),
Yong Yu
(Shanghai Jiao-Tong University, P.R.China),
Wei-Ying Ma
(Microsoft Research Asia 5F, Beijing, P.R.China),
WenSi Xi, WeiGuo Fan
(Virginia Polytechnic Institute and State University)
- A Practical Web-based Approach to Generating Topic Hierarchy for Text Segments
Shui-Lung Chuang, Lee-Feng Chien
(Institute of Information Science, Academia Sinica, Taiwan, R.O.C.)
- Acquisition of Categorized Named Entities for Web Search
Marius Pasca
(Google Inc.)
|
1:30 pm - 3:00 pm |
Poster Session P-1
|
- BioDIFF: An Effective Fast Change Detection Algorithm for Genomic and Proteomic Data
Yang Song, Sourav S Bhowmick
(Nanyang Technological University, Singapore)
- Protein Structure Alignment using Geometrical Features
S. Alireza Aghili, Divyakant Agrawal, Amr El Abbadi
(University of California at Santa Barbara)
- Mining Gene Expression Datasets using Density-based Clustering
Seokkyung Chung, Jongeun Jun, Dennis McLeod
(University of Southern California)
- Semi-supervised Learning for Music Artists Style Identification
Tao Li
(Florida International University),
Mitsunori Ogihara
(University of Rochester)
- Integrating Heterogeneous Features for Efficient Content Based Music Retrieval
Jialie Shen, John Shepherd
(The University of New South Wales, Sydney, Australia),
Anne H. H. Ngu
(Texas State University)
- Unified Filtering by Combining Collaborative Filtering and Content-based Filtering via Mixture Model and Exponential Model
Luo Si
(Carnegie Mellon University)
Rong Jin
(Michigan State University)
- A Framework for Refining Similarity Queries Using Learning Techniques
Yiming Ma, Qi Zhong, Sharad Mehrotra, Dawit Yimam Seid
(University of California at Irvine)
- A Dimensionality Reduction Technique for Efficient Similarity Analysis of Time Series Databases
Vasileios Megalooikonomou, Guo Li, Qiang Wang
(Temple University)
- Combining Structural and Citation-Based Evidence for Text Classification
Baoping Zhang, Marcos André Gonçalves, Weiguo Fan, Yuxin Chen, Edward A. Fox
(Virginia Tech)
Pável Calado, Marco Cristo
(Federal University of Minas Gerais, Belo Horizonte, MG, Brazil)
- Using Relevance Feedback to Detect Misuse for Information Retrieval Systems
Ling Ma, Nazli Goharian
(Illinois Institute of Technology)
- An Extended Logic Programming Based Multi-Agent System Formalization in Mobile Environments
Jianwen Chen
(IBM Australia),
Yan Zhang
(University of Western Sydney, Australia)
|
3:00 pm - 3:30 pm |
Coffee Break
|
3:30 pm - 5:00 pm |
Paper Session DB-3 (Databases): Data Mining
Chair -
Parvathi Chundi
University of Nebraska at Omaha
|
- Framework and Algorithms for Trend Analysis in Massive Temporal Data Sets
Sreenivas Gollapudi
(Oracle Corporation),
D. Sivakumar
(IBM Almaden Research Center)
- Scalable Sequential Pattern Mining for Biological Sequences
Ke Wang
(Simon Fraser University, Canada)
Yabo Xu
(Simon Fraser University, Canada & Chinese University of Hong Kong)
Jeffrey Xu Yu
(Chinese University of Hong Kong)
- Discovering Frequently Changing Structures from Historical Structural Deltas of Unordered XML
Qiankun Zhao, Sourav S Bhowmick
(Nanyang Technological University, Singapore),
Mukesh Mohania
(IBM India Research Lab),
Yahiko Kambayashi
(Kyoto University, Japan)
|
3:30 pm - 5:00 pm |
Paper Session DB-IR-1 (Databases and Information Retrieval): Indexing and Query Processing Efficiency
Chair -
Nazli Goharian
Illinois Institute of Technology
|
- Indexing Text Data under Space Constraints
Bijit Hore
(University of California at Irvine),
Hakan Hacigumus
(IBM Almaden Research Center),
Bala Iyer
(IBM Silicon Valley Lab),
Sharad Mehrotra
(University of California at Irvine)
- Image Similarity Search with Compact Data Structures
Qin Lv, Moses Charikar, Kai Li
(Princeton University)
- Energy Management Schemes for Memory-Resident Database Systems
Jayaprakash Pisharath, Alok Choudhary
(Northwestern University),
Mahmut Kandemir
(Pennsylvania State University)
|
3:30 pm - 5:00 pm |
Poster Session P-2
|
- Restructuring Batch View Maintenance Efficiently
Bin Liu, Elke A. Rundensteiner, David Finkel
(Worcester Polytechnic Institute)
- On Semantic Matching of Multilingual Attributes in Relational Systems
A. Kumaran, Jayant R. Haritsa
(Indian Institute of Science, Bangalore, India)
- Compression Schemes for Differential Categorical Stream Clustering
Weiyun Huang, Edward Omiecinski, Leo Mark
(Georgia Institute of Technology)
- Using a Compact Tree to Index and Query XML Data
Qinghua Zou, Shaorong Liu, Wesley W. Chu
(University of California at Los Angeles)
- A Framework for Selective Query Expansion
Steve Cronen-Townsend, Yun Zhou, W. Bruce Croft
(University of Massachusetts)
- Exploiting Hierarchical Relationships in Conceptual Search
Devanand Ravindran, Susan Gauch
(University of Kansas at Lawrence)
- MRSSA: An Iterative Algorithm for Similarity Spreading over Interrelated Objects
Gui-Rong Xue
(Shanghai Jiao-Tong University, P.R. China),
Hua-Jun Zeng, Zheng Chen
(Microsoft Research Asia 5F, Beijing, P.R.China),
Yong Yu
(Shanghai Jiao-Tong University, P.R. China),
Wei-Ying Ma
(Microsoft Research Asia 5F, Beijing, P.R.China),
WenSi Xi, Edward Fox
(Virginia Polytechnic Institute and State University)
- Web Page Clustering Enhanced by Summarization
Xuanhui Wang
(University of Illinois at Urbana-Champaign),
Dou Shen
(Tsinghua University, Beijing, P.R.China),
Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma
(Microsoft Research Asia 5F, Beijing, P.R.China)
- Grammar-Based Task Analysis of Web Logs
Savitha Srinivasan, Arnon Amir, Prasad Deshpande, Vladimir Zbarsky
(IBM Almaden Research Center)
- Soft Clustering Criterion Functions for Partitional Document Clustering: A Summary of Results
Ying Zhao, George Karypis
(University of Minnesota)
- Calculating Similarity Between Texts using Graph-based Text Representation Model
Junji Tomita, Hidekazu Nakawatase, Megumi Ishii
(NTT Corporation, Kanagawa, Japan)
|
7:00 pm - 8:00 pm |
Reception
|
Wednesday November 10, 2004
|
9:00 am - 10:15 am |
Keynote Address
|
|
10:15 am - 10:30 am |
Coffee Break
|
10:30 am - 12:00 pm |
Paper Session IR-3 (Information Retrieval): Fusion of Retrieval Systems
Chair -
Stephen Robertson
Microsoft Research Cambridge
|
- A Design Space Approach to Analysis of Information Retrieval Adaptive Filtering Systems
Dmitriy Fradkin, Paul Kantor
(The Center for Discrete Mathematics & Theoretical Computer Science)
- A Multi-System Analysis of Document and Term Selection for Blind Feedback
Thomas R. Lynam
(University of Waterloo, Canada),
Chris Buckley
(Sabir Research Inc.)
Charles L. A. Clarke, Gordon V. Cormack
(University of Waterloo, Canada),
- Improving Document Representations Using Relevance Feedback: The RFA Algorithm
Razvan Stefan Bot, Yi-fang Brook Wu
(New Jersey Institute of Technology)
|
10:30 am - 12:00 pm |
Paper Session KM-2 (Knowledge Management): Clustering II
Chair -
Stefan Siersdorfer
MPI Saarbruecken
|
- A Vertical Distance-based Outlier Detection Method with Local Pruning
Dongmei Ren, Imad Rahal, William Perrizo
(North Dakota State University),
Kirk Scott
(University of Alaska Anchorage)
- ClusterMap: Labeling Clusters in Large Datasets via Visualization
Keke Chen, Ling Liu
(Georgia Institute of Technology)
- On Combining Multiple Clusterings
Tao Li
(Florida International University),
Mitsunori Ogihara
(University of Rochester),
Sheng Ma
(IBM T.J. Watson Research Center)
|
10:30 am - 12:00 pm |
Poster Session P-2
|
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 3:00 pm |
Paper Session DB-4 (Databases): Similarity Search
Chair -
Vasileios Megalooikonomou
Temple University
|
- SWAM: A Family of Access Methods for Similarity-Search in Peer-to-Peer Data Networks
Farnoush Banaei-Kashani, Cyrus Shahabi
(University of Southern California)
- Localized Signature Table: Fast Similarity Search on Transaction Data
Qiang Jing, Rui Yang, Panos Kalnis, Anthony K. H. Tung
(National University of Singapore)
- Distance-Function Design and Fusion for Sequence Data
Yi Wu, Edward Y. Chang
(University of California at Santa Barbara)
|
1:30 pm - 3:00 pm |
Paper Session IR-4 (Information Retrieval): Machine Learning in Information Retrieval
Chair -
Rosie Jones
Yahoo! Inc
|
- Learning Similarity Measures in Non-orthogonal Space
Ning Liu
(Tsinghua University, Beijing, P.R. China),
Benyu Zhang
(Microsoft Research Asia, Beijing, P.R. China),
Jun Yan
(Peking University, Beijing, P.R. China),
Qiang Yang
(Hong Kong University of Science and Technology),
Shuicheng Yan, Zheng Chen
(Microsoft Research Asia, Beijing, P.R. China),
Fengshan Bai
(Tsinghua University, Beijing, P.R. China),
Wei-Ying Ma
(Microsoft Research Asia, Beijing, P.R. China)
- Feature Selection with Conditional Mutual Information MaxiMin in Text Categorization
Gang Wang, Frederick H. Lochovsky, Qiang Yang
(Hong Kong University of Science and Technology)
- Regularizing Translation Models for Better Automatic Image Annotation
Feng Kang, Rong Jin, Joyce Y. Chai
(Michigan State University)
|
1:30 pm - 3:00 pm |
Panel on "Key Problems
in Integrating Structured and Unstructured
Information"
|
3:00 pm - 3:30 pm |
Coffee Break
|
3:30 pm - 5:00 pm |
Paper Session DB-IR-2 (Databases and Information Retrieval): Web and XML Text Search
Chair -
Min-Yen Kan
National University of Singapore
|
- Providing Consistent and Exhaustive Relevance Assessments for XML Retrieval Evaluation
Benjamin Piwowarski
(University Paris 6, France),
Mounia Lalmas
(University of London, England)
- Processing Content-Oriented XPath Queries
Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke
(University of Amsterdam, The Netherlands)
- Local Methods for Estimating PageRank Values
Yen-Yu Chen, Qingqing Gan, Torsten Suel
(Polytechnic University)
|
3:30 pm - 5:00 pm |
Paper Session IR-5 (Information Retrieval): Information Retrieval Applications
Chair -
Susan Gauch
University of Kansas
|
- The Liberal Media and Right-Wing Conspiracies: Using Cocitation Information to Estimate Political Orientation in Web Documents
Miles Efron
(University of Texas at Austin)
- Associative Document Retrieval by Query Subtopic Analysis and its Application to Invalidity Patent Search
Toru Takaki
(NTT DATA Corporation, Tokyo, Japan),
Atsushi Fujii, Tetsuya Ishikawa
(University of Tsukuba, Japan)
- Taxonomy-driven Computation of Product Recommendations
Cai-Nicolas Ziegler, Georg Lausen, Lars Schmidt-Thieme
(Universität Freiburg, Germany)
|
3:30 pm - 5:00 pm |
Poster Session P-1
|
|
7:00 pm |
Banquet
|
- Speaker: Alan Wade
Chief Information Officer, CIA
|
Thursday November 11, 2004
|
9:00 am - 10:15 am |
Keynote Address
|
|
10:15 am - 10:30 am |
Coffee Break
|
10:30 am - 12:00 pm |
Paper Session DB-5 (Databases): Potpourri
Chair -
Luis Gravano
Columbia University
|
- Computing Consistent Query Answers using Conflict Hypergraphs
Jan Chomicki
(University at Buffalo, State University of New York),
Jerzy Marcinkowski
(Wroclaw University, Poland),
Slawomir Staworko
(University at Buffalo, State University of New York)
- Motion Adaptive Indexing for Moving Continual Queries over Moving Objects
Bugra Gedik
(Georgia Institute of Technology),
Kun-Lung Wu, Philip Yu
(IBM T.J. Watson Research Center),
Ling Liu
(Georgia Institute of Technology)
- On Lossy Time Decompositions of Time Stamped Documents
Parvathi Chundi
(University of Nebraska at Omaha),
Daniel J. Rosenkrantz
(University at Albany, State University of New York)
|
10:30 am - 12:00 pm |
Paper Session IR-KM-1 (Information Retrieval and Knowledge Management): Text Mining
Chair -
James G. Shanahan
Clairvoyance Corporation
|
- Event Threading within News Topics
Ramesh Nallapati, Ao Feng, Fuchun Peng, James Allan
(University of Massachusetts)
- Approximating the Top-m Passages in a Parallel Question Answering System
Charles L. A. Clarke, Egidio L. Terra
(University of Waterloo, Canada)
- Dynamic Extraction of Topic Descriptors and Discriminators: Towards Automatic Context-based Topic Search
Ana Maguitman, David Leake, Thomas Reichherzer, Filippo Menczer
(Indiana University)
|
10:30 am - 12:00 pm |
Industry Track Poster Session
|
- Design of a Data Warehouse System for Network/Web Services
Anoop Singhal
(George Mason University)
- InfoAnalyzer: A Computer-Aided Tool for Building Enterprise Taxonomies
Li Zhang, ShiXia Liu, Yue Pan, LiPing Yang
(IBM China Research Laboratory)
- RStar: An RDF Storage and Query System for Enterprise Resource Management
Li Ma, Zhong Su, Yue Pan, Li Zhang, Tao Liu
(IBM China Research Laboratory)
- Processing Search Queries in a Distributed Environment
Frederick Knabe, Daniel Tunkelang
(Endeca Technologies)
- Intelligent Agent For Automated Manufacturing Rule Generation
Alan Clark, Dimitar Filev
(Ford Motor Company)
- Document Clustering Based on Cluster Validation
Zheng-Yu Niu,
Dong-Hong Ji
(Institute for Infocomm Research, Singapore),
Chew-Lim Tan
(National University of Singapore)
- Circumstance-Based Categorization Analysis of Knowledge Management Systems for the Japanese Market
Makoto Sano
(Justsystem Corporation),
David A. Evans
(Clairvoyance Corporation)
- Database Support for Species Extraction from the Biosystematics Literature - a Feasibility Demonstration
Ralf Duckstein,
Klemens Böhm
(Otto-von-Guericke-University Magdeburg, Germany)
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 3:00 pm |
Paper Session DB-6 (Databases): XML Query Processing
Chair -
Alireza Aghili
University of California at Santa Barbara
|
- Efficient Processing of XML Twig Patterns with Parent Child Edges: A Look-ahead Approach
Jiaheng Lu, Ting Chen, Tok Wang Ling
(National University of Singapore)
- QFilter: Fine-Grained Run-Time XML Access Control via NFA-based Query Rewriting
Bo Luo, Dongwon Lee, Wang-Chien Lee, Peng Liu
(Pennsylvania State University)
- Virtual Cursors for XML Joins
Beverly Yang
(Stanford University),
Marcus Fontoura, Eugene Shekita, Sridhar Rajagopalan, Kevin Beyer
(IBM Almaden Research Center)
|
1:30 pm - 3:00 pm |
Paper Session IR-6 (Information Retrieval): Digital Libraries
Chair -
Andrea S. LaPaugh
Princeton University
|
- CiteSeer-API: Towards Seamless Resource Location and Interlinking for Digital Libraries
Yves Petinot, C. Lee Giles, Vivek Bhatnagar, Pradeep B. Teregowda, Hui Han, Isaac Councill
(Pennsylvania State University)
- The Robustness of Content-based Search in Hierarchical Peer to Peer Networks
M. Elena Renda
(I.S.T.I. - C.N.R. and Scuola Superiore Sant’Anna, Pisa, Italy),
Jamie Callan
(Carnegie Mellon University)
- SERF: Integrating Human Recommendations with Search
Seikyung Jung, Kevin Harris, Janet Webster, Jonathan L. Herlocker
(Oregon State University),
|
1:30 pm - 3:00 pm |
Paper Session KM-3 (Knowledge Management): Knowledge Extraction
Chair -
Ophir Frieder
Illinois Institute of Technology
|
- Weakly-Supervised Relation Classification for Information Extraction
Zhu Zhang
(University of Michigan)
- TEG - A Hybrid Approach to Information Extraction
Benjamin Rosenfeld, Ronen Feldman, Moshe Fresko, Jonathan Schler, Yonatan Aumann
(Bar-Ilan University, Ramat Gan, Israel)
- Node Ranking in Labeled Directed Graphs
Krishna P. Chitrapura
(Indian Institute of Technology),
Srinivas R. Kashyap
(University of Maryland)
|
3:00 pm - 3:30 pm |
Coffee Break
|
3:30 pm - 5:00 pm |
Paper Session IR-7 (Information Retrieval): Natural Language Processing for IR
Chair -
David A. Evans
Clairvoyance Corporation
|
- Unsupervised Question Answering Data Acquisition From Local Corpora
Lucian Vlad Lita, Jaime Carbonell
(Carnegie Mellon University)
- Distributional Term Representations: An Experimental Comparison
Alberto Lavelli
(ITC-irst, Povo di Trento, Italy),
Fabrizio Sebastiani
(ISTI-CNR, Pisa, Italy),
Roberto Zanoli
(ITC-irst, Povo di Trento, Italy)
- Stemming and Lemmatization in the Clustering of Finnish Text Documents
Tuomo Korenius, Jorma Laurikkala, Kalervo Järvelin, Martti Juhola
(University of Tampere, Finland)
|
3:30 pm - 5:00 pm |
Paper Session KM-4 (Knowledge Management): Distributed Knowledge Management
Chair -
Marc Ronthaler
University of Bremen
|
- Towards Smarter Documents
Vikas Krishna, Prasad M. Deshpande, Savitha Srinivasan
(IBM Almaden Research Center)
- On Structuring Formal, Semi-Formal and Informal Data to Support Traceability in Systems Engineering Environments
Paul Mason
(Shinawatra University, Pathumthani, Thailand),
Ken Cosh, Pulyamon Vihakapirom
(Asian University of Science & Technology, Chon Buri, Thailand)
- Swoogle: A Search and Metadata Engine for the Semantic Web
Li Ding, Tim Finin, Anupam Joshi, Rong Pan, R. Scott Cost, Yun Peng, Pavan Reddivari, Vishal Doshi, Joel Sachs
(University of Maryland at Baltimore County)
|
Friday November 12, 2004
|
8:00 am - 10:00 am |
Session
|
10:30 am - 12:00 pm |
Workshops
|
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 4:30 pm |
Workshops
|
|
Saturday November 13, 2004
|
8:00 am - 10:00 am |
Session
|
10:30 am - 12:00 pm |
Workshops
|
|
12:00 pm - 1:30 pm |
Lunch
|
1:30 pm - 4:30 pm |
Workshops
|
|
|
|
|