Public
Like
Jpegconversionfrtb
Val Town is a collaborative website to build and scale JavaScript apps.
Deploy APIs, crons, & store data – all from the browser, and deployed in milliseconds.
Viewing readonly version of main branch: v50View latest version
A robust web application that allows users to upload JPEG and HEIF images and convert them to HTML using advanced OCR (Optical Character Recognition) with mathematical formula preservation. Now with enhanced HEIF support and improved error handling.
- Enhanced HEIF Support: Robust handling of HEIF/HEIC files with automatic conversion to JPEG
- Drag and Drop Interface: Easy file upload with visual feedback for JPEG and HEIF formats
- Multiple File Upload: Process up to 60 files simultaneously with batch processing
- Advanced OCR: Text recognition using Tesseract.js with mathematical formula detection
- Mathematical Formula Recognition: Specialized detection and preservation of mathematical expressions
- LaTeX Math Rendering: Mathematical formulas preserved using MathJax in HTML output
- Real-time Progress Tracking: Detailed progress monitoring for each file including conversion status
- Error Recovery: Comprehensive error handling with detailed error messages
- HTML Output Generation: Clean, styled HTML with mathematical notation properly rendered
- Improved Library Loading: Better handling of heic2any library initialization
- Enhanced File Detection: More robust HEIF file format detection using both MIME types and file extensions
- Higher Quality Conversion: Optimized JPEG conversion settings for better OCR accuracy
- Better Error Handling: Detailed error messages for conversion failures
- Progress Tracking: Real-time status updates during HEIF conversion process
- Enhanced Character Recognition: Expanded character whitelist for better mathematical symbol detection
- Improved Engine Configuration: Optimized Tesseract.js settings for mathematical content
- Better Progress Reporting: More granular progress updates during OCR processing
- Library Availability Checks: Robust checking for library availability before processing
- HEIF File Indicators: Clear badges showing which files were converted from HEIF
- Enhanced Statistics: Detailed processing summary including conversion counts
- Better Visual Design: Improved styling with file type indicators and processing notes
- Conversion Tracking: Clear indication of which files underwent HEIF conversion
├── backend/
│ └── index.ts # Enhanced API server with robust HEIF support and improved error handling
├── frontend/
│ ├── index.html # Updated HTML template with improved library loading
│ ├── index.tsx # Enhanced React application with better HEIF processing
│ └── style.css # Custom styles with math formula styling
└── README.md
- Automatic Format Detection: Robust detection of HEIF/HEIC files using both MIME types and file extensions
- Improved Library Loading: Better handling of heic2any library with retry mechanisms
- High-Quality Conversion: Optimized JPEG conversion settings (95% quality) for better OCR accuracy
- Conversion Progress Tracking: Real-time status updates during HEIF to JPEG conversion
- Error Recovery: Graceful handling of conversion failures with detailed error messages
- Preview Generation: Automatic preview generation for HEIF files after conversion
- Enhanced Pattern Detection: Improved detection of mathematical expressions and symbols
- Extended Symbol Support: Support for Greek letters, mathematical operators, and special symbols
- LaTeX-Style Preservation: Mathematical formulas preserved in LaTeX format for HTML output
- MathJax Integration: Proper mathematical display using MathJax rendering engine
- Formula Counting: Automatic counting and display of detected mathematical formulas
- Enhanced Tesseract Configuration: Optimized settings for mathematical content recognition
- Extended Character Whitelist: Comprehensive character set including mathematical symbols
- Progress Monitoring: Detailed progress tracking with percentage completion
- Error Handling: Comprehensive error handling with specific error messages
- Library Availability Checks: Robust checking for required libraries before processing
- Batch Processing: Efficient processing of multiple files with individual status tracking
- Visit the application URL
- Drag and drop JPEG/HEIC files or click to select files (multiple files supported)
- Click "Start OCR Processing" to begin text and mathematical formula extraction
- Monitor the progress of each file being processed with specialized math recognition
- Once complete, click "Download HTML" to get a formatted HTML file with preserved mathematical formulas
- Drag & Drop Interface: Easy file upload with visual feedback for JPEG and HEIC formats
- HEIC Support: Automatic conversion of HEIC files to JPEG for processing
- Multiple File Support: Process multiple images at once
- Mathematical Formula Recognition: Specialized OCR for mathematical expressions and formulas
- LaTeX Math Rendering: Mathematical formulas preserved using MathJax in HTML output
- Real-time Progress: See processing status and progress for each file
- OCR Confidence Scores: View confidence levels for text recognition
- Preview Images: See thumbnails of uploaded images (HEIC converted to JPEG for preview)
- Formatted HTML Output: Clean, styled HTML with mathematical notation properly rendered
- Error Handling: Clear error messages for failed processing
- File Validation: Automatic filtering to accept JPEG and HEIC files
- Backend: Hono.js for API routes
- Frontend: React with TypeScript
- OCR: Tesseract.js for general text recognition
- Math OCR: Specialized mathematical formula recognition
- Math Rendering: MathJax for LaTeX formula display in HTML
- Image Processing: HEIC to JPEG conversion support
- Styling: TailwindCSS