Back to Projects

Excel Translate - AI-Powered Document Translation System

Vite
ExcelJS
OpenAI SDK
React
TailwindCSS
Typescript
AES-256 Encryption
Render

Overview

Full-Stack AI Translation Platform | Full-Stack Developer (Solo)

Excel Translate is an AI-powered translation system that converts Vietnamese business and legal documents to English in under 2 minutes per sheet—achieving an 80% reduction in both time and cost compared to traditional translation services. Built as a solo full-stack project, the platform leverages OpenAI API to deliver context-aware translations while maintaining 100% Excel formatting integrity, including formulas, colors, merged cells, and multi-sheet structures.

The platform's core innovation is its Custom Dictionary (Word Bank) feature, which solves the critical inconsistency problem that plagues generic translation tools. Organizations can define their own specialized terminology once, ensuring that "Giám đốc" always translates to "Director" (not randomly switching between "Manager" or "CEO") across hundreds of documents. This guarantees professional-quality, standardized translations that meet compliance standards for audit and accounting firms.

This project demonstrates my ability to architect AI-powered solutions, implement complex file processing systems, and deliver production-ready applications with enterprise-grade security for handling sensitive financial data.

Excel Translate - AI-Powered Document Translation System screenshot 1
Excel Translate - AI-Powered Document Translation System screenshot 2
Excel Translate - AI-Powered Document Translation System screenshot 3
Excel Translate - AI-Powered Document Translation System screenshot 4
+2

Key Features

  • AI-powered Vietnamese to English translation
  • Custom Dictionary (Word Bank) for terminology consistency
  • 100% Excel formatting preservation (formulas, colors, styles, merged cells)
  • Drag-and-drop file upload interface
  • Real-time translation progress tracking
  • Multi-sheet Excel support
  • Enterprise-grade security with auto-deletion
  • Support for files up to 100 pages
  • ~2 minutes processing time per sheet

Challenges

Professional translation services for business documents present several critical challenges that make them impractical for organizations handling large volumes of financial reports, audit documents, and legal paperwork:

Cost & Time Constraints: Traditional human translation services are expensive and slow, with a single 50-page financial report taking days and costing hundreds of - dollars - These prohibitive costs and turnaround times create bottlenecks in international business operations and regulatory compliance

Inconsistent Terminology: Generic translation tools lack business context and produce inconsistent results—"Giám đốc" randomly becomes "Director," "Manager," or "CEO" across documents - For audit and accounting firms, this terminology inconsistency is unacceptable and can lead to confusion, errors, and compliance issues

Format Destruction: Most translation tools only handle plain text, destroying critical Excel formatting including formulas, color-coding, merged cells, and multi-sheet structures - Users must spend hours manually reconstructing formatting after translation, negating any time savings

Security & Confidentiality: Financial and legal documents contain highly sensitive information that cannot be exposed to public translation APIs - Audit firms require guarantees that confidential data won't be stored, accessed, or compromised by third-party services

Lack of Specialized Knowledge: General-purpose AI translation lacks understanding of accounting, audit, and legal terminology where context is critical - Vietnamese business documents use specific phrasing that requires industry knowledge to translate accurately

Solutions

AI-Powered Translation Engine: Context-Aware Translation: Implemented advanced prompting strategies using Google Gemini AI that distinguish between similar terms based on context (e.g., "Giám đốc điều hành" vs "Giám đốc dự án") - Batch Processing: Designed an efficient pipeline that translates entire multi-sheet Excel files in approximately 2 minutes per sheet - Real-Time Progress Tracking: Built a WebSocket-based system providing live progress updates with percentage completion and time estimates

Custom Dictionary (Word Bank) - Core Innovation: User-Defined Terminology: Organizations create their own translation dictionaries ensuring terms like "Hội đồng quản trị" always translate to "Board of Directors" with 100% consistency - Priority Translation Logic: AI checks the custom dictionary first for exact matches, then uses context-aware translation for non-dictionary terms - Bulk Management: Built Excel import/export functionality and intuitive search interface for managing hundreds of terms efficiently

Format Preservation Technology: 100% Formatting Integrity: Developed a sophisticated processing system that preserves all Excel formulas, styles, colors, merged cells, and multi-sheet structures - Five-Stage Pipeline: Parse Excel structure → Extract text with metadata → AI translation with context → Apply translations → Reconstruct file with original formatting

User Experience & Interface: Drag-and-Drop Upload: Implemented modern file upload supporting .xlsx, .xls, and .csv formats up to 100 pages - Real-Time Feedback: Built progress monitoring with status updates, percentage complete, and instant download upon completion - Dictionary Management: Designed table-based interface for easy terminology management with search, filter, and bulk operations

Security & Privacy Architecture: End-to-End Encryption: All uploaded files are encrypted client-side using industry-standard algorithms with automatic deletion after 24 hours - Zero-Knowledge Design: Server cannot access document contents; no logging or storage of document data - Access Control: Implemented authentication system with role-based permissions for organizational account management

Performance & Efficiency

  • ⚡ 2-Minute Translation: Reduced translation time from hours/days to approximately 2 minutes per Excel sheet
  • 💰 80% Cost Reduction: Eliminated expensive human translation services, saving organizations thousands of dollars annually
  • 🎯 100% Format Retention: Achieved perfect preservation of Excel formatting, formulas, and structure
  • ✅ Consistent Terminology: Guaranteed 100% terminology consistency through the Custom Dictionary system

Business Impact

  • Scalability: Organizations can translate hundreds of documents per month instead of just a few
  • Quality Control: Complete terminology control ensures professional, consistent translations meeting compliance standards
  • Cost Efficiency: Eliminated per-word translation fees with fast turnaround enabling real-time translation during audit cycles

Project Details