Installation¶
Requirements¶
- Python 3.12 or higher
Install from PyPI¶
Install from Source¶
For development or to get the latest changes:
git clone https://github.com/artefactory-uk/document-extraction-tools.git
cd document-extraction-tools
uv sync
Verify Installation¶
Dependencies¶
The library has minimal core dependencies:
pydantic>=2.0.0- Data validation and settings managementPyYAML>=6.0.3- YAML configuration supportnumpy>=2.4.1- Numerical operationspillow>=12.1.0- Image processing
Additional dependencies may be required depending on your specific extractors and converters (e.g., PDF libraries, OCR engines, LLM clients).