Affiliation Analysis
Bibliometric studies of computer science and database publications to date mainly focus on the number of papers and citations per author or per journal. As (commercial) bibliographic systems concentrate on journals, there is only little analysis regarding the affiliations of authors in computer science and database research.
We analyze author affiliations of publications to determine the main institutions contributing research to a specific field. For instance, we determine top affiliations in terms of number of papers (productivity) and also aggregate the numbers at varying level of detail, e.g. cities, countries, and continents.
Author affiliations in publications are given in quite heterogeneous form. Before any analyses on these data can be undertaken, the affiliation mentions denoting the same real world institutions have to be aligned. For this, we investigated into web-based affiliation recognition, matching, and clustering (cf. our publications).
Interpreting multiple-author papers as collaborations, bonds within and across institutions, cities, countries, and continents become visible (e.g. see illustration).
Illustrating collaborations within and across major countries publishing database research
Project Members
Publications
| |||
| |||
| |||
|
See also Citation Analysis and Semantic Content
Dataset example
With the following archive we provide some of our data for download – contained therein:
- affiliation strings, mostly as available from ACM, though in cases also the original PDFs were taken into account
- correspondences between affiliation strings on institution level, i.e. neglecting departments etc.
Download: affiliationstrings.zip
Note: Other object matching datasets available via Benchmark datasets for entity resolution.
Exemplary results of ten years of database publications
The following tables present initial results of an affiliation analysis of publications of the last decade (2000–2009) that appeared in the top conferences SIGMOD and VLDB and in the VLDBJ and TODS journals. It is also browsable along affiliation via our publication categorizer.
Notes on table headings:
- papers: productivity of regarded entity using total counting of papers
- frac: fractional counting (other columns always total counting)
- affils: number of affiliations within entity
- years 2000–2004 and 2005–2009 as first and second, respectively
conty | affils | papers | frac | 2000_2004 | 2005_2009 | research | industrial | demo | vldb | sigmod | vldbj | tods |
---|---|---|---|---|---|---|---|---|---|---|---|---|
North America | 284 | 1983 | 1767 | 830 | 1153 | 1397 | 258 | 328 | 870 | 771 | 188 | 154 |
Europe | 217 | 642 | 504 | 257 | 385 | 436 | 46 | 160 | 319 | 167 | 88 | 68 |
Asia | 95 | 513 | 390 | 188 | 325 | 407 | 29 | 77 | 219 | 188 | 60 | 46 |
S.H. | 28 | 79 | 51 | 21 | 58 | 63 | 3 | 13 | 36 | 12 | 17 | 14 |
Data overview on continental level (subsuming Africa, Oceania, and South America into Southern Hemisphere)
period | papers | vldb | sigmod | vldbj | tods | research | conf_res | industrial | demo | conf | journal |
---|---|---|---|---|---|---|---|---|---|---|---|
first half | 1120 | 534 | 416 | 99 | 71 | 757 | 587 | 151 | 212 | 950 | 170 |
second half | 1596 | 701 | 564 | 190 | 141 | 1147 | 816 | 160 | 289 | 1265 | 331 |
decade | 2716 | 1235 | 980 | 289 | 212 | 1904 | 1403 | 311 | 501 | 2215 | 501 |
Summary per five year spans and decade
venue | papers | 2000_2004 | 2005_2009 | research | industrial | demo |
---|---|---|---|---|---|---|
vldb | 1235 | 534 | 701 | 805 | 180 | 250 |
sigmod | 980 | 416 | 564 | 598 | 131 | 251 |
vldbj | 289 | 99 | 190 | 289 | 0 | 0 |
tods | 212 | 71 | 141 | 212 | 0 | 0 |
Base data by venue
year | papers | vldb | sigmod | vldbj | tods | research | conf_res | industrial | demo | conf | journal |
---|---|---|---|---|---|---|---|---|---|---|---|
2000 | 188 | 86 | 76 | 14 | 12 | 121 | 95 | 31 | 36 | 162 | 26 |
2001 | 203 | 92 | 76 | 23 | 12 | 138 | 103 | 28 | 37 | 168 | 35 |
2002 | 212 | 106 | 74 | 21 | 11 | 143 | 111 | 32 | 37 | 180 | 32 |
2003 | 225 | 111 | 79 | 20 | 15 | 163 | 128 | 17 | 45 | 190 | 35 |
2004 | 292 | 139 | 111 | 21 | 21 | 192 | 150 | 43 | 57 | 250 | 42 |
2005 | 293 | 133 | 108 | 24 | 28 | 202 | 150 | 38 | 53 | 241 | 52 |
2006 | 276 | 126 | 94 | 20 | 36 | 197 | 141 | 26 | 53 | 220 | 56 |
2007 | 317 | 139 | 127 | 25 | 26 | 226 | 175 | 27 | 64 | 266 | 51 |
2008 | 360 | 146 | 123 | 64 | 27 | 270 | 179 | 30 | 60 | 269 | 91 |
2009 | 350 | 157 | 112 | 57 | 24 | 252 | 171 | 39 | 59 | 269 | 81 |
Base data by year
country | affils | papers | frac | 2000_2004 | 2005_2009 | research | industrial | demo | vldb | sigmod | vldbj | tods |
---|---|---|---|---|---|---|---|---|---|---|---|---|
USA | 260 | 1868 | 1631 | 787 | 1081 | 1316 | 247 | 305 | 816 | 733 | 179 | 140 |
Germany | 69 | 243 | 184 | 108 | 135 | 147 | 23 | 73 | 129 | 68 | 26 | 20 |
Canada | 23 | 228 | 136 | 95 | 133 | 160 | 23 | 45 | 104 | 83 | 18 | 23 |
China | 29 | 211 | 151 | 49 | 162 | 176 | 3 | 32 | 81 | 75 | 31 | 24 |
Singapore | 5 | 116 | 75 | 34 | 82 | 102 | 1 | 13 | 44 | 49 | 17 | 6 |
France | 32 | 88 | 58 | 50 | 38 | 56 | 6 | 26 | 49 | 18 | 16 | 5 |
Italy | 28 | 88 | 58 | 37 | 51 | 57 | 3 | 28 | 36 | 24 | 12 | 16 |
India | 20 | 87 | 61 | 51 | 36 | 56 | 13 | 18 | 44 | 34 | 3 | 6 |
Switzerland | 10 | 67 | 50 | 15 | 52 | 41 | 6 | 20 | 40 | 18 | 6 | 3 |
Australia | 17 | 59 | 40 | 12 | 47 | 46 | 3 | 10 | 28 | 10 | 13 | 8 |
United Kingdom | 11 | 56 | 35 | 13 | 43 | 46 | 2 | 8 | 26 | 15 | 6 | 9 |
Israel | 9 | 55 | 41 | 17 | 38 | 43 | 1 | 11 | 24 | 15 | 10 | 6 |
Korea | 11 | 50 | 35 | 22 | 28 | 43 | 6 | 1 | 22 | 21 | 4 | 3 |
Greece | 10 | 48 | 32 | 17 | 31 | 44 | 0 | 4 | 14 | 15 | 16 | 3 |
Denmark | 5 | 34 | 21 | 15 | 19 | 27 | 3 | 4 | 20 | 4 | 5 | 5 |
The Netherlands | 10 | 33 | 24 | 15 | 18 | 26 | 2 | 5 | 16 | 8 | 6 | 3 |
Japan | 13 | 25 | 19 | 17 | 8 | 17 | 4 | 4 | 11 | 9 | 2 | 3 |
Austria | 5 | 15 | 10 | 8 | 7 | 8 | 2 | 5 | 11 | 1 | 0 | 3 |
Belgium | 4 | 13 | 8 | 2 | 11 | 12 | 0 | 1 | 3 | 2 | 0 | 8 |
Spain | 9 | 10 | 6 | 5 | 5 | 8 | 1 | 1 | 3 | 2 | 3 | 2 |
Top countries
author | papers | 2000_2004 | 2005_2009 | research | industrial | demo |
---|---|---|---|---|---|---|
Surajit Chaudhuri | 50 | 22 | 28 | 38 | 4 | 8 |
Divesh Srivastava | 49 | 25 | 24 | 39 | 2 | 8 |
Nick Koudas | 44 | 13 | 31 | 36 | 2 | 6 |
Jiawei Han | 41 | 16 | 25 | 34 | 0 | 7 |
H. V. Jagadish | 38 | 16 | 22 | 33 | 0 | 5 |
Beng Chin Ooi | 36 | 15 | 21 | 32 | 0 | 4 |
Alon Halevy | 36 | 18 | 18 | 30 | 3 | 3 |
Minos Garofalakis | 36 | 19 | 17 | 35 | 0 | 1 |
Raghu Ramakrishnan | 35 | 8 | 27 | 32 | 2 | 1 |
Yufei Tao | 34 | 13 | 21 | 34 | 0 | 0 |
Philip S. Yu | 33 | 10 | 23 | 28 | 4 | 1 |
Kian-Lee Tan | 32 | 14 | 18 | 27 | 0 | 5 |
Jeffrey Naughton | 32 | 18 | 14 | 31 | 1 | 0 |
Dimitris Papadias | 32 | 16 | 16 | 32 | 0 | 0 |
Gerhard Weikum | 29 | 9 | 20 | 17 | 0 | 12 |
Laks V. S. Lakshmanan | 28 | 21 | 7 | 23 | 0 | 5 |
Jennifer Widom | 27 | 18 | 9 | 22 | 1 | 4 |
Samuel Madden | 27 | 6 | 21 | 22 | 1 | 4 |
Dan Suciu | 27 | 13 | 14 | 25 | 0 | 2 |
Elke A. Rundensteiner | 27 | 12 | 15 | 10 | 0 | 17 |
Top authors
country | affils | papers | frac | first | second |
---|---|---|---|---|---|
USA | 180 | 1316 | 1132 | 545 | 771 |
China | 24 | 176 | 124 | 37 | 139 |
Canada | 15 | 160 | 93 | 61 | 99 |
Germany | 50 | 147 | 109 | 69 | 78 |
Singapore | 4 | 102 | 64 | 24 | 78 |
Italy | 23 | 57 | 36 | 22 | 35 |
France | 24 | 56 | 37 | 32 | 24 |
India | 14 | 56 | 38 | 30 | 26 |
Australia | 14 | 46 | 31 | 5 | 41 |
United Kingdom | 10 | 46 | 29 | 10 | 36 |
Countries by research papers only
country | affils | papers | frac | first | second |
---|---|---|---|---|---|
USA | 115 | 247 | 231 | 118 | 129 |
Canada | 10 | 23 | 16 | 14 | 9 |
Germany | 22 | 23 | 17 | 11 | 12 |
India | 10 | 13 | 9 | 7 | 6 |
France | 8 | 6 | 4 | 2 | 4 |
Countries by industrial papers only
country | affils | papers | frac | first | second |
---|---|---|---|---|---|
USA | 116 | 305 | 267 | 124 | 181 |
Germany | 35 | 73 | 58 | 28 | 45 |
Canada | 11 | 45 | 27 | 20 | 25 |
China | 19 | 32 | 23 | 12 | 20 |
Italy | 16 | 28 | 20 | 13 | 15 |
Countries by demo papers only