Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

setup instructions for Windows #54

Closed
wants to merge 4 commits into from
Closed

Conversation

freecho
Copy link

@freecho freecho commented Jan 1, 2024

Please check the relevant content in the Windows section of the readme.md.

@VikParuchuri
Copy link
Owner

Thanks for this! This doesn't seem to include some dependencies:

  • ocrmypdf
  • tesseract + language packs
  • ghostscript

Did you find how to install these?

@freecho
Copy link
Author

freecho commented Jan 3, 2024

Thanks,I will continue to study these issues when free time.

@GdeJoode
Copy link

GdeJoode commented Jan 15, 2024

Hi, I found the tesseract 5 binary on https://github.com/UB-Mannheim/tesseract/wiki. I am just now installing marker and am waiting for a nvidia graphics card to use CUDA. So have not tested it yet.

or winget install -e --id UB-Mannheim.TesseractOCR (found through the OCRmyPDF page)

@CBIhalsen
Copy link

Reference in new issue

HI, have you found the perfect solution for windows?

@freecho
Copy link
Author

freecho commented Feb 4, 2024

I have updated the README.md, and following the steps, both my friend and I have successfully run the project.

@umarbutler
Copy link

I encountered two issues following the steps detailed in your instructions:

  1. I encountered the error ImportError: failed to find libmagic. Check your installation which I resolved by running (in poetry shell) pip install python-magic-bin==0.4.14 as described in this StackOverflow answer.
  2. I encountered the error AssertionError: Torch not compiled with CUDA enabled which I resolved by running (in poetry shell) pip uninstall torch and then pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118 (note that this will install torch complied for CUDA 11.8) as described by PyTorch's official documentation.
  3. ModuleNotFoundError: No module named 'pkg_resources' which I resolved by running pip install setuptools.

After running these fixes, I was able to get marker working in my Windows environment.

@umarbutler
Copy link

I have created a new PR that addresses the issues raised in my comment above.

@VikParuchuri
Copy link
Owner

Thanks for looking into this! The new version has no required system dependencies, so Windows install should be much simpler - #116

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants