EXTRACTING INFORMATION FROM PDF INVOICES

Effortlessly extract information trapped in PDF invoices. We are using Adobe PDF Services Extract API for extraction and outputting important data in a CSV format.

Brief Description

main folder contains all the logical operations.
ProductHandler.js file creates or initialize the Products.json file.
index.js file Extracts the pdf data as zip files and stores in the ExtractedZip folder.
unzip. js file unzips the extracted zipped files and stores them in the ExtractedUnzip folder.
jsonHandler.js file process the extracted data and store it in Products.json.
csvHandler.js file converts Products.json to CSV.
Products.json file contains the extracted data in JSON format.
ExtractedProduct.csv file contains the final output.

Installation

Run in your local machine terminal

Clone the Repository :

git clone https://github.com/Anand-shreya/AdobeHackathon_pdfExtractor.git

Go to folder :

cd AdobeHackathon_pdfExtractor

Install the node Packages :

npm install

Usage

To Extract data from Invoices, place all the invoices in the resources folder.
Update the pdfservices-api-credentials.json file with your Adobe PDF Services API credentials.
Run the script to create or update (if it already exists) the products.json file.

npm start

Run the script to extract JSON data.

npm run createJson

Run the script to convert JSON data to a CSV file.

npm run createCsv

API Reference

Documentation of Adobe Acrobat Services APIs link.
To generate credentials link.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
main		main
resources		resources
ExtractedProducts.csv		ExtractedProducts.csv
LICENSE.md		LICENSE.md
Products.json		Products.json
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
pdfservices-api-credentials.json		pdfservices-api-credentials.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EXTRACTING INFORMATION FROM PDF INVOICES

Brief Description

Installation

Usage

API Reference

About

Releases

Packages

Languages

License

Anand-shreya/AdobeHackathon_pdfExtractor

Folders and files

Latest commit

History

Repository files navigation

EXTRACTING INFORMATION FROM PDF INVOICES

Brief Description

Installation

Usage

API Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages