Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SfB and ASB #38

Open
nichtich opened this issue Aug 23, 2022 · 3 comments
Open

Add SfB and ASB #38

nichtich opened this issue Aug 23, 2022 · 3 comments

Comments

@nichtich
Copy link
Member

See bachelor thesis https://nbn-resolving.org/urn:nbn:de:bsz:14-qucosa2-211701 to follow on.

@guitarster
Copy link
Contributor

Both, SfB and ASB, use a Wiki for maintaining their classification:

So, converting them to JSKOS would mean a lot of copy and paste work. So did the author of the bachelor thesis (s. page 50, last paragraph).
Or is there a thinkable solution to scrape the content of the wiki with a script (e.g. written in Python)?

@guitarster
Copy link
Contributor

https://www.crummy.com/software/BeautifulSoup/bs4/doc/ seems to bee an option. Will try it out with ASB some time.

@nichtich
Copy link
Member Author

nichtich commented Sep 27, 2022

There is an API to MediaWiki, e.g. https://www.sfb-online.de/wiki/api.php. In particular method parse to get a page: https://www.sfb-online.de/wiki/api.php?action=help&modules=parse

example query:

Either use Wikitext, e.g.

curl -s 'https://www.sfb-online.de/wiki/api.php?action=parse&page=BID&prop=wikitext&format=json' | jq -r '.parse.wikitext[]'

Or HTML and process with BeatifulSoup.

@guitarster guitarster removed their assignment Jun 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants