Wikipedia categories (en)

This is the bipartite network of English Wikipedia articles and the categories they are contained in. Left nodes are articles and right nodes are categories.

Metadata

CodeWC
Internal namewiki-en-cat
NameWikipedia categories (en)
Data sourcehttp://dumps.wikimedia.org/
AvailabilityDataset is available for download
Consistency checkDataset passed all tests
Category
Feature network
Node meaningArticle, category
Edge meaningInclusion
Network formatBipartite, undirected
Edge typeUnweighted, no multiple edges

Statistics

Size n =2,036,440
Left size n1 =1,853,493
Right size n2 =182,947
Volume m =3,795,796
Wedge count s =898,164,591
Claw count z =1,363,824,377,144
Cross count x =3,001,843,059,062,188
Square count q =160,976,541
4-Tour count T4 =4,888,065,352
Maximum degree dmax =11,593
Maximum left degree d1max =54
Maximum right degree d2max =11,593
Average degree d =3.727 87
Average left degree d1 =2.047 91
Average right degree d2 =20.748 1
Fill p =1.119 40 × 10−5
Size of LCC N =1,893,657
Diameter δ =46
50-Percentile effective diameter δ0.5 =11.377 1
90-Percentile effective diameter δ0.9 =15.307 2
Mean distance δm =11.750 1
Gini coefficient G =0.640 075
Balanced inequality ratio P =0.257 484
Left balanced inequality ratio P1 =0.368 946
Right balanced inequality ratio P2 =0.197 671
Relative edge distribution entropy Her =0.902 290
Power law exponent γ =2.614 68
Tail power law exponent γt =2.381 00
Degree assortativity ρ =+0.093 530 5
Degree assortativity p-value pρ =0.000 00
Spectral norm α =147.912
Algebraic connectivity a =0.000 224 266
Spectral separation 1[A] / λ2[A]| =1.095 29

Plots

Degree distribution

Cumulative degree distribution

Lorenz curve

Spectral distribution of the adjacency matrix

Spectral distribution of the normalized adjacency matrix

Spectral distribution of the Laplacian

Spectral graph drawing based on the adjacency matrix

Spectral graph drawing based on the Laplacian

Spectral graph drawing based on the normalized adjacency matrix

Hop distribution

Downloads

References

[1] Jérôme Kunegis. KONECT – The Koblenz Network Collection. In Proc. Int. Conf. on World Wide Web Companion, pages 1343–1350, 2013. [ http ]
[2] Wikimedia Foundation. Wikimedia downloads. http://dumps.wikimedia.org/, January 2010.