Top VLDB publications per year (1994 - 2003)
Top VLDB publications per year (1994 - 2003)
The following list was generated based on Google Scholar and ACM citation counts in August 2005. For more information see our project on citation analysis.
| Title | Authors | Citations | |
|---|---|---|---|
| GS | ACM | ||
| 1994 | |||
| Fast Algorithms for Mining Association Rules | Rakesh Agrawal, Ramakrishnan Srikant | 2261 | 392 |
| Efficient and Effective Clustering Methods for Spatial Data Mining | Raymond T. Ng, Jiawei Han | 531 | 141 |
| Hilbert R-tree: An Improved R-tree using Fractals | Ibrahim Kamel, Christos Faloutsos | 192 | 46 |
| Composite Events for Active Databases: Semantics, Contexts and Detection | Sharma Chakravarthy, V. Krishnaprasad, Eman Anwar, S.-K. Kim | 161 | 21 |
| Including Group-By in Query Optimization | Surajit Chaudhuri, Kyuseok Shim | 117 | 29 |
| #references top 5 | 3421 | 629 | |
| #references overall (of 63 papers) | splitted by quartils 4037 / 656 / 330 / 85 | 5108 | 1007 |
| 1995 | |||
| An Efficient Algorithm for Mining Association Rules in Large Databases | Ashok Savasere, Edward Omiecinski, Shamkant B. Navathe | 515 | 96 |
| Mining Generalized Association Rules | Ramakrishnan Srikant, Rakesh Agrawal | 513 | 90 |
| Discovery of Multiple-Level Association Rules from Large Databases | Jiawei Han, Yongjian Fu | 357 | 86 |
| Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases | Rakesh Agrawal, King-Ip Lin, Harpreet S. Sawhney, Kyuseok Shim | 274 | 67 |
| W3QS: A Query System for the World-Wide Web | David Konopnicki, Oded Shmueli | 261 | 39 |
| #references top 5 | 1987 | 378 | |
| #references overall (of 66 papers) | splitted by quartils 3197 / 501 / 213 / 40 | 3951 | 820 |
| 1996 | |||
| Querying Heterogeneous Information Sources Using Source Descriptions | Alon Y. Levy, Anand Rajaraman, Joann J. Ordille | 692 | 120 |
| The X-tree : An Index Structure for High-Dimensional Data | Stefan Berchtold, Daniel A. Keim, Hans-Peter Kriegel | 486 | 116 |
| Sampling Large Databases for Association Rules | Hannu Toivonen | 366 | 79 |
| SPRINT: A Scalable Parallel Classifier for Data Mining | John C. Shafer, Rakesh Agrawal, Manish Mehta | 289 | 64 |
| On the Computation of Multidimensional Aggregates | Sameet Agarwal, Rakesh Agrawal, Prasad Deshpande, Ashish Gupta, Jeffrey F. Naughton, Raghu Ramakrishnan, Sunita Sarawagi | 256 | 65 |
| #references top 5 | 2177 | 444 | |
| #references overall (of 66 papers) | splitted by quartils 3441 / 643 / 215 / 10 | 4309 | 933 |
| 1997 | |||
| DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases | Roy Goldman, Jennifer Widom | 439 | 84 |
| M-tree: An Efficient Access Method for Similarity Search in Metric Spaces | Paolo Ciaccia, Marco Patella, Pavel Zezula | 308 | 67 |
| Optimizing Queries Across Diverse Data Sources | Laura M. Haas, Donald Kossmann, Edward L. Wimmers, Jun Yang | 247 | 44 |
| Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources | Mary Tork Roth, Peter M. Schwarz | 197 | 40 |
| To Weave the Web | Paolo Atzeni, Giansalvatore Mecca, Paolo Merialdo | 160 | 27 |
| #references top 5 | 1447 | 262 | |
| #references overall (of 63 papers) | splitted by quartils 2474 / 723 / 278 / 71 | 3546 | 846 |
| 1998 | |||
| A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces | Roger Weber, Hans-Jörg Schek, Stephen Blott | 314 | 77 |
| Using Schema Matching to Simplify Heterogeneous Data Translation | Tova Milo, Sagit Zohar | 159 | 26 |
| Algorithms for Mining Distance-Based Outliers in Large Datasets | Edwin M. Knorr, Raymond T. Ng | 147 | 33 |
| WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases | Gholamhosein Sheikholeslami, Surojit Chatterjee, Aidong Zhang | 137 | 38 |
| MindReader: Querying Databases Through Multiple Examples | Yoshiharu Ishikawa, Ravishankar Subramanya, Christos Faloutsos | 130 | 43 |
| #references top 5 | 920 | 217 | |
| #references overall (of 68 papers) | splitted by quartils 1903 / 556 / 214 / 32 | 2705 | 707 |
| 1999 | |||
| Relational Databases for Querying XML Documents: Limitations and Opportunities | Jayavel Shanmugasundaram, Kristin Tufte, Chun Zhang, Gang He, David J. DeWitt, Jeffrey F. Naughton | 484 | 70 |
| Query Optimization for XML | Jason McHugh, Jennifer Widom | 170 | 34 |
| Similarity Search in High Dimensions via Hashing | Aristides Gionis, Piotr Indyk, Rajeev Motwani | 118 | 37 |
| DBMSs on a Modern Processor: Where Does Time Go? | Anastassia Ailamaki, David J. DeWitt, Mark D. Hill, David A. Wood | 111 | 29 |
| Extracting Large-Scale Knowledge Bases from the Web | Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins | 106 | 22 |
| #references top 5 | 1032 | 192 | |
| #references overall (of 66 papers) | splitted by quartils 1712 / 495 / 218 / 48 | 2473 | 588 |
| 2000 | |||
| Efficient Filtering of XML Documents for Selective Dissemination of Information | Mehmet Altinel, Michael J. Franklin | 182 | 40 |
| Efficiently Publishing Relational Data as XML Documents | Jayavel Shanmugasundaram, Eugene J. Shekita, Rimon Barr, Michael J. Carey, Bruce G. Lindsay, Hamid Pirahesh, Berthold Reinwald | 149 | 30 |
| A Scalable Algorithm for Answering Queries Using Views | Rachel Pottinger, Alon Y. Levy | 114 | 21 |
| Focused Crawling Using Context Graphs | Michelangelo Diligenti, Frans Coetzee, Steve Lawrence, C. Lee Giles, Marco Gori | 114 | 29 |
| Schema Mapping as Query Discovery | Renée J. Miller, Laura M. Haas, Mauricio A. Hernández | 112 | 27 |
| #references top 5 | 720 | 147 | |
| #references overall (of 70 papers) | splitted by quartils 1491 / 541 / 152 / 17 | 2201 | 586 |
| 2001 | |||
| Generic Schema Matching with Cupid | Jayant Madhavan, Philip A. Bernstein, Erhard Rahm | 222 | 37 |
| Indexing and Querying XML Data for Regular Path Expressions | Quanzhong Li, Bongki Moon | 208 | 37 |
| RoadRunner: Towards Automatic Data Extraction from Large Web Sites | Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo | 137 | 34 |
| A Fast Index for Semistructured Data | Brian Cooper, Neal Sample, Michael J. Franklin, Gísli R. Hjaltason, Moshe Shadmon | 123 | 25 |
| Comparing Hybrid Peer-to-Peer Systems | Beverly Yang, Hector Garcia-Molina | 113 | 14 |
| #references top 5 | 852 | 147 | |
| #references overall (of 76 papers) | splitted by quartils 1778 / 544 / 158 / 20 | 2500 | 531 |
| 2002 | |||
| Monitoring Streams - A New Class of Data Management Applications | Donald Carney, Ugur ÿetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Greg Seidman, Michael Stonebraker, Nesime Tatbul, Stanley B. Zdonik | 171 | 0 |
| Approximate Frequency Counts over Data Streams | Gurmeet Singh Manku, Rajeev Motwani | 109 | 0 |
| Efficient Algorithms for Processing XPath Queries | Georg Gottlob, Christoph Koch, Reinhard Pichler | 79 | 0 |
| COMA - A System for Flexible Combination of Schema Matching Approaches | Hong Hai Do, Erhard Rahm | 75 | 0 |
| Streaming Queries over Streaming Data | Sirish Chandrasekaran, Michael J. Franklin | 74 | 0 |
| #references top 5 | 543 | 0 | |
| #references overall (of 89 papers) | splitted by quartils 1295 / 344 / 175 / 37 | 1851 | 0 |
| 2003 | |||
| Querying the Internet with PIER | Ryan Huebsch, Joseph M. Hellerstein, Nick Lanham, Boon Thau Loo, Scott Shenker, Ion Stoica | 105 | 0 |
| Load Shedding in a Data Stream Manager | Nesime Tatbul, Ugur ÿetintemel, Stanley B. Zdonik, Mitch Cherniack, Michael Stonebraker | 55 | 0 |
| A Framework for Clustering Evolving Data Streams | Charu C. Aggarwal, Jiawei Han, Jianyong Wang, Philip S. Yu | 39 | 0 |
| Operator Scheduling in a Data Stream Manager | Donald Carney, Ugur ÿetintemel, Alex Rasin, Stanley B. Zdonik, Mitch Cherniack, Michael Stonebraker | 39 | 0 |
| Composing Mappings Among Data Sources | Jayant Madhavan, Alon Y. Halevy | 32 | 0 |
| #references top 5 | 304 | 0 | |
| #references overall (of 88 papers) | splitted by quartils 625 / 253 / 84 / 21 | 983 | 0 |

