Audiobook Reader App: Standalone TTS for Mac/Windows/Linux
I started experimenting with audiobook generation using AI voice models as a response to the terrifying results of trying to estimate more or less how many books i will be able to read over the course of my life.
If you enjoy reading or do work that requires it, you might have already done this thought experiment. If you haven’t, just count how many books (or how many words if you want to be precise) you read in the last year, or whichever year made you feel good about the amount you read. Then look up the average life expectations for your cohort, and see how much reading you have left.
The first time I thought this through, I could already come up with more texts I wanted to read at some point than what I could expect to have time for. Audiobooks tip the scale, but there are not that many good ones, even in English, not to mention other languages.
I looked into text to speech and AI voice models, expecting to find a free tool like Audiobook Reader, which will allowed me to efficiently convert text files to mp3 without reading them out first. To my surprise, I did not. At the time of writing this, the mainstream TTS landscape consists mostly of a number of very well developed tools that read text in real-time, many of them integrated into operating systems or browsers as accessibility features. I found only a few tools that generate audio files, and I was disappointed with the voice quality, lack of mp3 conversion, lack of interrupt handling, and general slow performance. Most of them work around real-time reading speed, which makes them useless for long texts, since these can take 10 hours or more to read.
So I decided to make my own solution and see if I can make it fast and stable enough to be useful.
Features
- Around 10x real-time conversion speed: The backend automatically detects NPU/GPU/CPU configs and uses appropriate hardware optimization for speed and parallel processing
- 54 voices in 9 languages: Female, male and androgynous sounding voices in all languages
- Voice blending: Mix voices to create new ones
- Format support: EPUB, PDF, TXT, Markdown, RST → MP3, WAV, M4A, M4B
- Progress visualization: Visual progress bars with time estimates and conversion speed
- Character voices (beta, in development): Different voices per characters or have it done automatically
- Pause/resume: Interrupt and continue conversions
- Batch queue: Add multiple books, process sequentially or in parallel
Please note that I don’t have the resources to support and test all the distributions at the same level. CLI is always the best if you are not afraid of typing, and where I roll out new features first. Mac is tested regularly but inherently more complex. Win and Linux are in beta, QA help is welcome.
Download
Latest Release: GitHub Releases
Available Platforms
- macOS: Apple Silicon (M1/M2/M3) and Intel builds
- Windows: x64 installer and portable versions
- Linux: AppImage, DEB, and RPM packages for Ubuntu 20.04+, Debian, Fedora
File sizes: Approximately 80-120MB per platform (includes bundled Python runtime)
System Requirements
- macOS: 10.15 (Catalina) or later
- Windows: Windows 10 or later (64-bit)
- Linux: Ubuntu 20.04+, Debian 11+, Fedora 35+, or equivalent
- Disk space: 500MB minimum (includes voice models after first run)
- RAM: 4GB minimum, 8GB recommended for large books
Features
- Drag-and-drop interface: Select files visually, no terminal required
- 54 voices in 9 languages with real-time preview
- Voice blending: Mix voices via sliders with live playback
- Format support: EPUB, PDF, TXT, Markdown, RST → MP3, WAV, M4A, M4B
- Progress visualization: Visual progress bars with time estimates
- Character voices (beta): Assign voices to characters in plays and dialogues
- Hardware optimization: Automatic NPU/GPU/CPU detection for approximately 10x real-time speed
- Pause/resume: Interrupt and continue long conversions
- Batch queue: Add multiple books, process sequentially or in parallel
- FFmpeg integration: Download helper popup if not installed
Getting Started
First Launch
- Download the appropriate installer for your system
- macOS: Right-click and select “Open” (security prompt for unsigned app)
- Windows: Run installer or extract portable version
- Linux: Make AppImage executable (
chmod +x) or install package
First Conversion
- App will prompt to download Kokoro voice models (~310MB) and FFmpeg if needed
- Drag and drop a text file into the window
- Select output location and voice
- Click “Convert” - progress shows in real-time
📸 Screenshot: Audiobook Reader app main window showing drag-and-drop area and voice selection dropdown
Keyboard Shortcuts
- Ctrl/Cmd + O: Open file
- Ctrl/Cmd + S: Save output location
- Space: Pause/Resume conversion
- Ctrl/Cmd + Q: Quit app
- Ctrl/Cmd + ,: Open settings
Troubleshooting
macOS “App is damaged” error:
Run in Terminal: xattr -cr /Applications/AudiobookReader.app
Windows Defender warning: App is unsigned. Click “More info” → “Run anyway”
Linux permission issues:
Ensure AppImage is executable: chmod +x AudiobookReader.AppImage
Consider Supporting
I created this alone, and it will be free and open source forever. For issues and feature requests please use GitHub. If you find it useful, please consider supporting me so I can improve this tool and create more.
Copyright Notice
IMPORTANT: This software is a tool for converting text to audio. Users are solely responsible for:
- Ensuring they have the legal right to convert any text to audio
- Obtaining necessary permissions for copyrighted materials
- Complying with all applicable copyright laws and licensing terms
- Understanding that creating audiobooks from copyrighted text without authorization may constitute copyright infringement
Recommended Use Cases:
✅ Your own original content ✅ Public domain works ✅ Content you have explicit permission to convert ✅ Educational materials you legally own ✅ Open-source or Creative Commons licensed texts (per their terms)
I do not condone or support copyright infringement. By using this software, you agree to use it only for content you have the legal right to convert.