Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] 通过 semantic scholar 更新元数据 #161

Closed
northword opened this issue Mar 11, 2024 · 4 comments
Closed

[Feature Request] 通过 semantic scholar 更新元数据 #161

northword opened this issue Mar 11, 2024 · 4 comments
Assignees
Labels
enhancement New feature or request

Comments

@northword
Copy link
Owner

Describe the feature

您好,很感谢您开发的这个插件!现在很大的问题在于绝大部分刊登在arxiv上文章都不会更新自己发表在其它会议或期刊上的新信息,最多只留下一个comment。目前CS领域比较主流的是从 semantic scholar 上获取更新的信息,不知道您可否考虑加入进来?利用 semantic scholar api 可以查询发表在 arxiv 上的预印本是否已经发表论文(虽然没有囊括所有,但胜在简单可靠,且更新相对及时)

例子:

https://api.semanticscholar.org/graph/v1/paper/ArXiv:1705.07874?fields=publicationVenue,externalIds,journal,publicationTypes

Additional context

Originally posted by @invisprints in #57 (comment)

@northword northword added the enhancement New feature or request label Mar 11, 2024
@northword northword self-assigned this Mar 11, 2024
@github-project-automation github-project-automation bot moved this to 🆕 Needs Triage in @northword's Zotero Mar 11, 2024
@northword northword moved this from 🆕 Needs Triage to 📋 Planning stage in @northword's Zotero Mar 11, 2024
@northword
Copy link
Owner Author

northword commented Mar 16, 2024

https://github.com/northword/zotero-format-metadata/releases/tag/v1.14.0

插件将首先从 arxiv 获取是否有正式发表的 doi 信息,如果没有,则向 semantic scholar 请求数据。

为了保证不被速率限制,你可能需要在插件设置中填写 semantic scholar token。

由于没有大量样本做测试,在遇到问题时烦请继续报告一个 issue。

@github-project-automation github-project-automation bot moved this from 📋 Planning stage to ✅ Done in @northword's Zotero Mar 16, 2024
@invisprints
Copy link

感谢作者!不过目前还是无法顺利更新,就拿上面的文章为例 https://arxiv.org/abs/2107.07511
用 Zotero Connect 从浏览器抓取后,DOI、仓库、存档ID、网址、其它等字段都有元数据信息:

仓库: arXiv
存档ID: arXiv:1705.07874
DOI: 10.48550/arXiv.1705.07874
网址: http://arxiv.org/abs/1705.07874
其它: arXiv:1705.07874 [cs, stat]

Zotero 控制台的信息如下

HTTP GET https://api.semanticscholar.org/graph/v1/paper/DOI:10.48550/arXiv.1705.07874?fields=publicationTypes,title,authors,abstract,externalIds,url,venue,publicationVenue,publicationDate,journal failed with status code 404: {"error":"Paper with id DOI:10.48550/arXiv.1705.07874 not found"

可以看出 Linter 优先抓取 DOI 的信息,而且格式似乎也没对,DOI 类文献不包含 DOI 这个关键字,SS API 说明
不过大部分 Arxiv 类预印本通过此类方式也无法更新,还是按照 https://api.semanticscholar.org/arXiv:1705.10311 获取成功率最高

@northword
Copy link
Owner Author

可以看出 Linter 优先抓取 DOI 的信息,而且格式似乎也没对,DOI 类文献不包含 DOI 这个关键字,SS API 说明

根据文档,带上 DOI 也是正确的;

image

image

你所给出的这篇应该是预印本 DOI 在 ss 中没有记录,我应该可以调整一下顺序,优先使用 arxiv id 进行请求。

今天晚点改个版本再试试。

@northword northword reopened this Mar 18, 2024
@github-project-automation github-project-automation bot moved this from ✅ Done to 🆕 Needs Triage in @northword's Zotero Mar 18, 2024
@github-project-automation github-project-automation bot moved this from 🆕 Needs Triage to ✅ Done in @northword's Zotero Mar 18, 2024
Copy link

🚀 This ticket has been resolved in v1.15.1. See Release 1.15.1 for release notes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Archived in project
Development

No branches or pull requests

2 participants