geom {netdata}R Documentation

Geom – Collaboration network in computational geometry

Description

The network Geom is based on the file geombib.bib that contains Computational Geometry Database, version February 2002.

The authors collaboration network in computational geometry was produced from the BibTeX bibliography [Beebe, 2002] obtained from the Computational Geometry Database geombib, version February 2002 [Jones, 2002].

Two authors are linked with an edge, iff they wrote a common work (paper, book, ...). The value of an edge is the number of common works. Using a simple program written in programming language Python, the BibTeX data were transformed into the corresponding network, and output to the file in Pajek format.

The obtained network has 9072 vertices (authors) and 22577 edges (common papers or books) / 13567 edges as a simple network - multiple edges between a pair of authors are replaced with a single edge.

The problem with the obtained network is that, because of non standardized writing of the author's name, it contains several vertices corresponding to the same author. For example:

R.S. Drysdale, Robert L. Drysdale, Robert L. Scot Drysdale,
R.L. Drysdale, S. Drysdale, R. Drysdale, and R.L.S. Drysdale;

or:

Pankaj K. Agarwal, P. Agarwal, Pankaj Agarwal, and
P.K. Agarwal

that are easy to guess; but an 'insider' information is needed to know that Otfried Schwarzkopf and Otfried Cheong are the same person. Also, no provision is made in the database to discern two persons with the same name. We manually produced the name equivalence partition and then shrank (in Pajek) the network according to it.

The reduced simple network contains 7343 vertices and 11898 edges. It is a sparse network - its average degree is 2m/n = 3.24.

Usage

data(geom)

Details

Use data(package="netdata") to get a full list of networks.

Licenses and Citation

If the source of the data set does not specified otherwise, this data set is protected by the Creative Commons License http://creativecommons.org/licenses/by-nc-nd/2.5/.

When publishing results obtained using this data set the original authors should be cited. In addition this package should be cited as:

Mark S. Handcock, David Hunter, Carter T. Butts, Steven M. Goodreau, and Martina Morris. 2003 statnet: An R package for the Statistical Modeling of Social Networks
http://www.csde.washington.edu/statnet
and the source should be cited as:

{Vladimir Batagelj and Andrej Mrvar (2006):} Pajek datasets
http://vlado.fmf.uni-lj.si/pub/networks/data/.

Source

http://vlado.fmf.uni-lj.si/pub/networks/data/collab/geom.htm

References

See link above.

See Also

network, sna


[Package netdata version 0.5-1 Index]