26 lines
477 B
Plaintext
26 lines
477 B
Plaintext
|
|
# Multimodal Meaning Kernel Extraction Pipeline
|
||
|
|
# Required Python dependencies
|
||
|
|
|
||
|
|
# Image processing
|
||
|
|
Pillow>=10.0.0
|
||
|
|
|
||
|
|
# OCR (Optical Character Recognition)
|
||
|
|
pytesseract>=0.3.10
|
||
|
|
|
||
|
|
# PDF processing
|
||
|
|
pdf2image>=1.16.3
|
||
|
|
|
||
|
|
# Optional: Enhanced computer vision
|
||
|
|
# opencv-python>=4.8.0
|
||
|
|
# numpy>=1.24.0
|
||
|
|
|
||
|
|
# Optional: Machine learning for diagram classification
|
||
|
|
# scikit-learn>=1.3.0
|
||
|
|
# torch>=2.0.0
|
||
|
|
# torchvision>=0.15.0
|
||
|
|
|
||
|
|
# Development and testing
|
||
|
|
# pytest>=7.4.0
|
||
|
|
# black>=23.0.0
|
||
|
|
# flake8>=6.0.0
|