Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option "do not scrape duplicates" #48

Open
maksbdev opened this issue Apr 24, 2024 · 2 comments
Open

Option "do not scrape duplicates" #48

maksbdev opened this issue Apr 24, 2024 · 2 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@maksbdev
Copy link

Often, when scraping several similar queries (or one query in nearby locations), duplicates may appear in search results, resulting in duplicate entries in the output file and requiring additional time to complete the task. Is it possible to implement a parameter that prevents scraping information about an organization if it has already been scraped earlier?

@gosom
Copy link
Owner

gosom commented Apr 25, 2024

hi @maksbdev ,

this sounds like a good idea.

I will try to include in the next release.

thanks for your feedback

@gosom gosom self-assigned this Apr 25, 2024
@gosom gosom added enhancement New feature or request good first issue Good for newcomers labels Apr 25, 2024
@gosom gosom removed their assignment Apr 25, 2024
@ruanbsroche
Copy link

i have made one script in py to do this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants