Supported File Types
- Microsoft Office: Word (.docx), Excel (.xlsx), PowerPoint (.pptx)
- PDF documents (.pdf)
- Images (with EXIF metadata and OCR support)
- Audio files (with EXIF metadata)
- HTML files (with special handling for various sites)
- Text files (.txt, .md)
- Various other formats (CSV, JSON, XML, etc.)
First Conversion Note
The first conversion might take a few moments as the converter engine initializes. Subsequent conversions will be much faster.
Technical Implementation
Architecture Overview
This application uses a modern web architecture combining several key technologies:
1. Core Technologies
- Pyodide: A WebAssembly port of Python, enabling Python execution in the browser
- Web Workers: For non-blocking file processing
- Marked.js: For markdown rendering
2. File Processing Pipeline
Browser → Web Worker → Pyodide → MarkItDown → Markdown Output
3. Data Flow
- File is dropped or selected
- For simple text files: Direct browser processing
- For complex files:
- File is sent to Web Worker
- Worker initializes Python environment
- MarkItDown library processes the file
- Result is sent back to main thread
- Markdown is rendered in the preview panel
4. Performance Considerations
- Web Worker prevents UI blocking during conversion
- Pyodide environment is cached after first load
- Simple files bypass the Python pipeline for faster processing
Privacy & Security
- All processing happens client-side
- No server uploads required
- Data never leaves the user's browser