The numbers on this page were established with the version of the dblp.xml of the 17th Apr 2018.

Specification of pages

SELECT COUNT(*) FROM papers WHERE pagesTo-pagesFrom < 0;
 count 
-------
  1092

With this query, you can determine the number of papers where the specification of the pages is in the wrong order. For example, instead of 1-10, the pages are specified as 10-1, which results in a negative number of pages.

Let's show an example: the publication with the key journals/ijseke/IyerKJU05.
The column 'pages' contains the value 189-182.
If we look up the journal on the dblp website and look for the given article, we see the following:
Francis Eng Hock Tay, Myo Naing Nyan, Teck Hong Koh, K. H. W. Seah, Yih Yiow Sitoh: Smart Shirt That Can Call for Help After A Fall. 183-188
Neelima Iyer, Rupali Khatavkar, Bhagyashree Joshi, Vikram Upadhye: Lilliputian Hardware Platform For Scientific Applications. 189-182
T. Y. Jiang, J. Guzman, Huaizhong Li, Zhiming Gong: The Development of an Embedded, Integrated Ins/Gps System. 193-198
The second article is the one we were looking for, and we can see that the pages shouldn't be 189-182, but 189-192.

Another example is the publicaion with the key journals/tgis/SmithMKW04.
The dblp website shows the following information:
Jessica Smith, William A. Mackaness, Allison Kealy, Ian Williamson: Spatial Data Infrastructure Requirements for Mobile Location Based Journey Planning. 23-22
However, when we visit the article at https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-9671.2004.00166.x, we can see that the specification of pages is also 23-22, so the error originates there and not on the dblp website.


Longest titles

SELECT COUNT(*) FROM papers WHERE LENGTH(title) > 350;
 count 
-------
   327

More than 300 papers have a title that is longer than 350 characters, which is about as long as the text in the next section of this website, shortest titles (including the queries).


Shortest titles

SELECT COUNT(*) FROM papers WHERE LENGTH(title) <= 3;
 count 
-------
    58

This query shows the number of titles that are shorter than 4 characters. They are mostly just characters with a point, which is probably not a valid name.


SELECT COUNT(*) FROM papers WHERE LENGTH(title) <= 4;
 count 
-------
   458

More than 400 papers have a very short title of four or less characters.


Null value as journal

SELECT COUNT(*) FROM papers WHERE etype='article' AND journal IS NULL;
 count 
-------
   230

In more than 200 articles, the journal is not specified.


Booktitles

SELECT COUNT(*) FROM papers WHERE etype='article' AND NOT(booktitle IS NULL);
 count 
-------
   223

More than 200 articles have a specified booktitle even though articles should not have one.


SELECT COUNT(*) FROM papers WHERE etype='proceedings' AND booktitle IS NULL;
 count 
-------
   479

More than 400 proceedings do not have a specified booktitle.


SELECT COUNT(*) FROM papers WHERE LENGTH(booktitle) <= 2 AND booktitle IS NOT NULL;
 count 
-------
 27331

The booktitle of more than 27000 publications is only up to two characters long.


SELECT COUNT(*) FROM papers WHERE LENGTH(booktitle) <= 3 AND booktitle IS NOT NULL;
 count  
--------
 292869

More than 280000 publications have a booktitle that is not longer than three characters.


Pages

SELECT COUNT(*) FROM papers WHERE LENGTH(pages) >= 15;
 count 
-------
 14267

In more than 13000 publications, the description of the pages is at least 15 characters long.


SELECT COUNT(*) FROM papers WHERE pages LIKE '%e%';
 count 
-------
  2128

Some papers have an "e" in the pages column.


Short publishers

SELECT COUNT(*) FROM papers WHERE LENGTH(publisher) <= 1;
 count 
-------
     3

In 3 papers, the specified publisher is only up to one character long.


Papers without year of publication

SELECT COUNT(*) FROM papers WHERE publYear IS NULL;
 count 
-------
     6

Six papers that aren't websites don't have a year of publication specified.


Least important columns

Some columns are rarely used. The following ones are the columns that include the least amount of values.

SELECT COUNT(*) FROM papers WHERE reviewid IS NOT NULL;

Number of specified reviewids: 0

SELECT COUNT(*) FROM papers WHERE rating IS NOT NULL;

Number of specified ratings: 0

SELECT COUNT(*) FROM papers WHERE chapter IS NOT NULL;

Number of specified chapters: 2

SELECT COUNT(*) FROM papers WHERE address IS NOT NULL;

Number of specified addresses: 3

SELECT count(*) FROM papers WHERE cite IS NOT NULL;

Number of specified cites: 8263