-
Notifications
You must be signed in to change notification settings - Fork 13
Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur
License
kunegis/konect-extr
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
KONECT Extraction ================= This collection of network extraction tools is part of the KONECT project, i.e., the Koblenz Network Collection, a network analysis project by Jérôme Kunegis. http://konect.cc/ This package contains code for generating the network datasets used in KONECT. In each subdirectory of extr/ you can find code to extract one group of datasets, usually from one source, and generate a TSV file from it. If you are looking for the code for a specific network, you can find the name of the subirectory in the field "extr" of the meta.* file, which comes with the TSV files you can download from our network pages. Please note that not every network has the extraction code publically available. This is only the code we can make available. Also, many datasets have been extracted so long ago that we don't have the code anymore. Also, some datasets have been contributed by people that didn't give us their code. (If you contribute a dataset to KONECT, the best is to give us a self-contained directory that uses Stu to generate the dataset in the correct format.) Keep in mind that most code in here WAS EXECUTED JUST ONCE. The code is of corresponding quality. We provide the code for the purpose of full disclosure -- not in order to deliver a full-features extraction library. Some directories here also contain code for analysing the datasets, which has been used in a few papers. Usage ===== To build the datasets, execute "make" or "stu" inside each subdirectory of extr/, depending on whether a 'Makefile' or a 'main.stu' is present. The code downloads the datasets from their online sources and converts them to the KONECT format. Note that many directories have additional requirements. Read the 'Makefile' or 'main.stu' (or rarely, 'README') for more information. Requirements ============ You may need to install the following additional software packages, depending on the dataset: - Stu: https://github.com/kunegis/stu/ - Perl 5 - Some other tools; depending on the dataset Extraction Tools ================ The directories sh/ and c/ contain general-purpose functions for transforming data into the KONECT format. They are used by various directories under extr/, and you may also use them to extract your own software. Contributing ============ See CONTRIBUTING. If you use and like KONECT, please consider contributing to it by making a donation to: https://www.paypal.com/donate/?hosted_button_id=Q9JY2FB3AFHR6 License ======= The extraction code and library is distributed under the terms of the GNU General Public License version 3, which you can find provided in the file 'COPYING' in this directory. Stu === We're converting the directories to Stu, a replacement for Make, also written by Jérôme Kunegis. Stu can be installed from https://github.com/kunegis/stu/ Stu targets: default target: generate all data. @deploy: copy over to konect/. These copy the relevant files into konect/dat/, but usually don't create the symlinks in uni/. Code Quality ============ Most code in the directories is broken, because it was last executed just once, and that was a long time ago in most cases. Many things have changed in the time. If you want to execute any of this code, you'll most likely need to hack it. In particular: - The URL from which KONECT gets the data is not available anymore. - Scripts have been moved to sh/, but uses of it have not been updated - Extraction directories are now all under extr/, but code has not been changed. - Some code is really old and was written long before I learned proper practices for writing shell scripts, etc. Also, some directories use Stu and older ones Make. If you need help, please politely ask Jérôme Kunegis <[email protected]> and he may update the code to work. (Up to websites that have disappeared; Jérôme can't do anything about those ;)
About
Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published