diff --git a/readme.md b/readme.md index 8e1ff5b..28f4ab3 100644 --- a/readme.md +++ b/readme.md @@ -35,13 +35,11 @@ The [examples directory](./examples/B0819W19WD) contains a preview export of the It works by logging into your [Kindle web reader](https://read.amazon.com) account using [Playwright](https://playwright.dev), exporting each page of a book as a PNG image, and then using a vLLM (`gpt-4o` or `gpt-4o-mini`) to transcribe the text from each page to text. Once we have the raw book contents and metadata, then it's easy to convert it to PDF, EPUB, etc. 🔥 -The accuracy has been very close to perfect in my testing, with the only discrepancies being occasional whitespace issues. -
- We start from the Kindle web reader library. + The automated script starts from the Kindle web reader's library page and selects the book we want to export. | @@ -57,7 +55,7 @@ The accuracy has been very close to perfect in my testing, with the only discrep |
- We use Playwright to take a screenshot of each page of the selected book. + Playwright exports a PNG screenshot for each page, bypassing Kindle's DRM. | @@ -65,7 +63,7 @@ The accuracy has been very close to perfect in my testing, with the only discrep |
- Then we convert each page screenshot into text using one of OpenAI's vLLMs. + Then we convert each page's screenshot into text using one of OpenAI's vLLMs (`gpt-4o` or `gpt-4o-mini`). | **Mantell Sector, North Nekhebet, Resurgam, Delta Pavonis system, 2551** @@ -95,6 +93,8 @@ Sylveste stood on the edge of the excavation and wondered if any of his labours +The accuracy has been very close to perfect in my testing, with the only discrepancies being occasional whitespace issues. + > [!NOTE] > _(Exporting audio books with AI-generated voice narration is coming soon! Please star the repo if you're interested in this feature.)_ ### Why is this necessary? |