Required Package

$ pip install -r requirements.txt

General Administration Information

Server

Server running as @NBSRESEARCHSVR.

The nature of the raw data, the source of the data, and the time period.

Nature - 公司业绩说明会, 公司高层与网民的交流信息
File Format - json and in the MySql database
Source - http://ircs.p5w.net/
Time period - Crawled at 2 March 2014. Data from 2009-02-27 to 2014-02-28.

The program codes used to crawl the data and where the raw data are stored on the server.

Crawler code: /data/danyang/projects/qna/src/crawler/js_response_crawler
Data crawled: /data/danyang/projects/qna/data
To run the code:

$ cd /data/danyang/projects/qna/src/crawler/js_response_crawler
$ python main.py

The program codes used to create the final database and indicate where to find the database.

Database Data Import code: /data/danyang/projects/qna/src/json_to_db
To run the code:

$ cd /data/danyang/projects/qna/src
$ python shell.py

Locate the database:

$ mysql -u website -p
mysql> use qna;
mysql> SELECT * FROM STOCK LIMIT 20;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Required Package

General Administration Information

Server

The nature of the raw data, the source of the data, and the time period.

The program codes used to crawl the data and where the raw data are stored on the server.

The program codes used to create the final database and indicate where to find the database.

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

Required Package

General Administration Information

Server

The nature of the raw data, the source of the data, and the time period.

The program codes used to crawl the data and where the raw data are stored on the server.

The program codes used to create the final database and indicate where to find the database.