Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

https://www.econ.sdu.edu.cn/zxzx/tzgg.htm 类似这种带分类链接的能智能提取吗 #15

Open
ieliwb opened this issue Apr 3, 2021 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@ieliwb
Copy link

ieliwb commented Apr 3, 2021

https://www.econ.sdu.edu.cn/zxzx/tzgg.htm
这种网站,由于有2个链接,导致结果为空,大佬可以更新下吗

@ieliwb ieliwb added the enhancement New feature or request label Apr 3, 2021
@ieliwb
Copy link
Author

ieliwb commented Apr 7, 2021

可以加一个自定义规则吗,有些网站提取不到,可以用规则,类似:

result = extractor.extract(html, noise_node_list=['//div[@Class="comment-list"]'])

谢谢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants