This repository provides an API for converting PDF documents into structured JSON data. The API is build using FastAPI web framework for building APIs with Python. The conversion process utilizes industry-standard libraries for PDF parsing and JSON serialization, ensuring accurate extraction of content from PDF files.
- PDF to JSON conversion
- File upload functionality
- Displaying structured JSON output
- backend language - Python,
- backend framework - FastAPI,
- frontend - HTML, CSS, JavaScript,
- Docker
- Clone the repository
git clone https://github.com/justkr/pdf_to_structured_json.git
- Build and run the Docker container
docker build -t pdf_reader_api .
docker run -p 8000:80 pdf_reader_api
- Access the API at
http://localhost:8000
Press CTRL+C to quit API
This FastAPI-based API provides functionality for uploading PDF files and extracting structured text for them.
-
- Method: 'POST'
- Endpoint: '/'
- Description: Upload a PDF file for text extraction
-
- Method: 'GET'
- Endpoint: '/'
- Description: Display extracted text from PDF in structured JSON form