-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot download pretrained model (GCSFS failed) #61
Comments
@jwasswa2023 : Thanks for reporting this! I'm taking a look. |
@jwasswa2023: It seems to be a transient issue with Google Cloud Platform, who hosts the bucket in which the files are located. Could you try it again and tell me if you still have an issue? And if so, are you running your Python code from a terminal where your user is logged in to GCP? Thank you. |
Hello Julien, 2023-06-16 01:48:32 | WARNING | google.auth._default | No project ID could be determined. Consider running The above exception was the direct cause of the following exception: Traceback (most recent call last):
|
Hi all! Having the same problem here. Is there a solution? |
Hi @azkalot1, from what we have gathered so far, this is often either:
Can you Also how did you install |
installed from pip
|
So it seems that you do have access through Two other questions:
import datamol as dm
mapper = dm.fs.get_mapper("gs://molfeat-store-prod/")
mapper.fs.ls("gs://molfeat-store-prod/")
|
gives
2 - no, instance from Saturn Cloud (AWS instance) |
Everything seems to indicate that it should be working in theory. I will investigate during the weekend. In the meantime, you might want to try temporary removing any previous gcp credentials you have and check again. |
I think the issue is different and the exception is a bit misleading.
will give
this is the actual error, not loading from the cloud This happens on molfeat 0.9.2 When I install 0.8.9
works! |
I haven’t had the time to look yet, sorry. |
@azkalot1 Could you share your Pydantic version? Besides a too broad except statement (here?), it seems to me that we are lacking a lower bound for the pydantic version in the pyproject.toml, causing issues when not installing with a conda-like dependency manager. |
Does anyone find a solution for this bug? |
same issues here |
Any update to this? Tried to run molfeat on colab but still having this issue. Would appreciate if there's any advice around using molfeat on colab. |
I will investigate this further. It seems to be something around colab notebooks and authentication in gcsfs. |
So this is really an issue with google colab and gcsfs. When in a google colab, it seems that the authentication does not work (even though the bucket accepts anonymous request by default). You therefore needs to be authenticated to access the bucket. The following works for me from google.colab import auth
auth.authenticate_user()
credentials, project_id = google.auth.default() then from molfeat.trans.pretrained import PretrainedHFTransformer
featurizer = PretrainedHFTransformer(kind="ChemBERTa-77M-MLM", pooling="bert", preload=True) Does this address the issues of everyone else ? I will try to find a definitive solution where in the colab, user do not need to run the authentication part. |
hi i'm having issues WARNING:google.auth.compute_engine._metadata:Compute Engine Metadata server unavailable on attempt 1 of 3. Reason: timed out |
Is there an existing issue for this?
Bug description
Hello here, thank you for your efforts.
I have been using Molfeat but failed to run through a tutorial for fine-tuning a pre-trained model. When I try to define the featurizer and load a transformer model using the code below, I get an error,
"featurizer = PretrainedHFTransformer(kind="ChemBERTa-77M-MLM", pooling="bert", preload=True)"
This is the error below.
How to reproduce the bug
No response
Error messages and logs
Environment
Current environment
Additional context
No response
The text was updated successfully, but these errors were encountered: