CodeHarvester
A powerful web-based tool that recursively processes folders to extract and combine text files into a single output, with advanced visualization and analysis features.
Available on: codeharvester.pages.dev
✨ Key Features
📁 File Processing
- Recursive folder processing - Automatically processes all text files in selected folder and subfolders
- Smart file detection - Intelligently identifies text files based on extensions and MIME types
- Multiple encoding support - Handles UTF-8, Windows-1252, and ISO-8859-1 encodings
- Large file handling - Safely skips files larger than 10MB to prevent browser freezing
- Pause/Resume functionality - Control processing with pause and resume capabilities
🎨 User Interface
- Modern responsive design - Clean, professional interface that works on all devices
- Dark/Light theme toggle - Switch between themes with automatic persistence
- Progress tracking - Real-time progress bar showing processing status
- Tabbed interface - Organized views for output, errors, and file preview
🌳 Directory Visualization
- Interactive directory tree - Visual representation of folder structure
- Expandable/Collapsible nodes - Navigate through directory hierarchy
- File status indicators - Visual indicators showing processed (✅), skipped (⏭️), or error (❌) files
- File size display - Shows individual file sizes in the tree
- Copy tree structure - Export directory tree as formatted text
📊 Analytics & Statistics
- Processing statistics - Track files processed, errors, total size, and skipped files
- File type analysis - Visual chart showing distribution of file types
- Error tracking - Detailed error log with timestamps and descriptions
- Real-time updates - Statistics update during processing
📝 Output Options
- Multiple formats:
- Tagged format -
<file path> and <file content> tags (default)
- Markdown format - Clean markdown with code blocks
- JSON format - Structured data with metadata
- File filtering - Filter output by filename or extension
- Download support - Save results as .txt or .json files
- Clipboard integration - One-click copy to clipboard
🔍 File Preview
- In-app file preview - View individual file contents without processing
- Syntax preservation - Maintains original formatting and structure
- Quick access - Click any processed file in the directory tree to preview
🚀 How to Use
Getting Started
- Select Folder: Click the "📂 Select Folder" button to choose your target directory
- Monitor Progress: Watch the real-time progress bar and statistics
- Control Processing: Use pause/resume functionality if needed
- Review Results: Examine the directory tree, statistics, and any errors
Viewing Results
- Combined Output Tab: View all processed files in your chosen format
- Errors Tab: Check any files that couldn't be processed
- File Preview Tab: Preview individual files by clicking them in the directory tree
Export Options
- Download: Save the combined output as a file
- Copy: Copy results to clipboard for pasting elsewhere
- Filter: Use the filter box to find specific files in the output
Advanced Features
- Change Format: Switch between tagged, markdown, and JSON output formats
- Theme Toggle: Switch between light and dark themes
- Keyboard Shortcuts:
Ctrl/Cmd + O: Open folder selector
Ctrl/Cmd + S: Download output
Ctrl/Cmd + Shift + C: Copy to clipboard
🛠️ Technical Details
Browser Support
File Type Support
Supports 80+ file extensions including:
- Web Technologies:
.html, .css, .js, .ts, .jsx, .tsx, .vue, .svelte
- Programming Languages:
.py, .java, .cpp, .cs, .go, .rs, .php, .rb
- Configuration:
.json, .yaml, .toml, .ini, .env, .gitignore
- Documentation:
.md, .txt, .rst
- And many more...
Security & Privacy
- 100% Client-Side Processing - No files are uploaded to any server
- Local Storage Only - All data remains on your device
- No Data Collection - No tracking or analytics
🎯 Use Cases
- Code Documentation - Combine entire codebases for documentation or AI analysis
- Project Analysis - Get overview of project structure and file distribution
- Code Review - Export code for sharing or review purposes
- Backup Preparation - Create structured exports of text-based projects
- Migration Planning - Analyze folder structures before migrations
🔧 Advanced Configuration
Custom File Filtering
The app intelligently detects text files but you can also:
- Use the filter input to focus on specific file types
- View file type distribution in the analytics chart
- Monitor processing statistics in real-time
Error Handling
- Encoding Detection: Automatically tries multiple text encodings
- Error Logging: Detailed error messages with file paths and timestamps
- Graceful Degradation: Continues processing even when individual files fail
📋 Requirements
- Modern web browser with File System Access API support
- JavaScript enabled
- Sufficient memory for processing large folder structures
Perfect for developers, researchers, and anyone who needs to efficiently process and analyze folder structures!