Skip to content

This project is used to crawl data from Chinese website and tokenise data afterwards.

Notifications You must be signed in to change notification settings

manliu1225/Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crawler

crawl data

crawl_app_news.py can be used as a template.

tokenize data

use Baidu API to get pos and tokenization.

json to cols

About

This project is used to crawl data from Chinese website and tokenise data afterwards.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages