Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: skip tests for sources without test accounts #379

Merged
merged 5 commits into from
Mar 5, 2024

Conversation

AstrakhantsevaAA
Copy link
Collaborator

Tell us what you do here

  • implementing verified source (please link a relevant issue labeled as verified source)
  • fixing a bug (please link a relevant bug report)
  • improving, documenting, or customizing an existing source (please link an issue or describe below)
  • anything else (please link an issue or describe below)

Relevant issue

issue #

More PR info

@AstrakhantsevaAA AstrakhantsevaAA mentioned this pull request Mar 5, 2024
4 tasks
@AstrakhantsevaAA AstrakhantsevaAA changed the title Fix: tests vs Fix: skip tests for sources without test accounts Mar 5, 2024
@AstrakhantsevaAA AstrakhantsevaAA merged commit e7c3226 into master Mar 5, 2024
14 checks passed
@AstrakhantsevaAA AstrakhantsevaAA deleted the fix/tests-vs branch March 5, 2024 10:52
rudolfix pushed a commit that referenced this pull request Mar 6, 2024
* skip facebook tests

* skip matomo tests

* skip personio tests

* replace postgres to duckdb

* return kafka, skip strapi
rudolfix added a commit that referenced this pull request Mar 22, 2024
* handles end_value and row_order in sql_database

* drops dlt extract dependencies

* test(filesystem): enable testing (#348)

* enable testing

* bump dlt version

* pushes correct gdrive path

* fixes formatting, removez az logs

---------

Co-authored-by: Marcin Rudolf <[email protected]>

* Update pull_request_template.md (#373)

* fixes kinesis shard test (#390)

* Fix: skip tests for sources without test accounts (#379)

* skip facebook tests

* skip matomo tests

* skip personio tests

* replace postgres to duckdb

* return kafka, skip strapi

* Scrapy source using scrapy (#332)

Close queue

Add requirements.txt

Remove redundant config option

Add revised README

Make api simpler

* Add batching of results

Add logging and batch size configuration


* Add pytest-mock and scrapy

Close queue when exiting

Check if queue close is called

Log number of batches

Fix linting issues

Fix linting issues

Mark scrapy source

Fix linting issue

Format code

Yield!

* Adjust tests

* Add pytest-twisted

* Add twisted to scrapy dependencies

* Add twisted to dev dependencies

* Add review comments

* Add more checks and do not exit when queue is empty

* Create QueueClosedError and handle in listener to exit loop

* Simplify code

* Stop crawling if queue is closed

* Fix linting issues

* Fix linting issues

* Adjust tests and disable telnet server for scrapy

* Remove pytest-twisted

* Refactor scrapy item pipeline

* Eliminate custom spider

* Use pytest.mark.forked to run tests for ALL_DESTINATIONS

* Add pytest-forked

* Update lockfile

* Use scrapy signals

* Hide batching and retrieving logic inside queue

* Add more types

* Extend default scrapy settings

* Extract pipeline and scrapy runners

* Simplify helpers code

* Cleanup code

* Add start_urls_file configuration option

* Sync scrapy log level with dlt log level

* Expose simple scraping pipeline runner

* Adjust config file

* Connect signals in ScrapyRunner.init

* Register source and do cleanups

* Better scrapy setting passing and minor cleanups

* Remove reduntant code comments

* Call engine_stopped callback in finally block

* Add more docstrings related to runners

* Adjust batch size

* Fix queue batching bugs

* Pass crawler instance to item_scraped callback

* Add advanced example to pipeline code

* Access settings override for scrapy

* Rewrite tests

* Small readme update for bing wembaster

* Adjust queue read timeout

* Extract test utils for scraping source

* Add stream generator to queue to handle generator exit exception

* Extract singal registering and tearing down as context manager

* Adjust and cleanup example pipeline source file

* Cleanup scraping helpers

* Adjust tests for scraping pipeline

* Add callback access to scraping resource

* Update readme

* Cleanup code

* Import ParamSpec from typing extensions

* Fix linting issues

* Fix linting issues

* Set encoding when opening the file with urls

* Adjust typing for scraping testing utils

* Use proper Union syntax

* Adjust mock patch module path for scraping tests

* Use latest dlt version

* Adjust mock patch module path for scraping tests

* Adjust tests and mark ones to skip

* Cleanup tests and utils for scraping source

* Re-use spy on queue.close calls

* Use append write_disposition by default for scraping source

* Update test skip reason

* Stop crawler manually

* Return self from __call__

* Check if crawler.stop is actually called

* Check if crawling has already been stopping

* Test to verify resource name generation and override

* Adjust resource name selection

* Add more docstrings and update readme

* Update readme

* Add scrapy configuration in example pipeline

* Shutdown twisted reactor after module tests

* Use simple run_pipeline

* Close the queue after timeout

* Rewrite a comment and use break instead of return in while loop

* Update comments

* Mock queue with alternative implementation

* Adjust mock patch path

* Add logging when scrapy stops and re-arrange code actions

* Stop crawler in on_engine_stopped

* Call on_engine_stopped from on_item_scraped if the queue is closed

* Skip test

* rename template buttons (#340)

* Rename new-verified-source.md to build-new-verified-source.md

* Rename source-request.md to request-new-source.md

* Update request-new-source.md

* fixes references

---------

Co-authored-by: Marcin Rudolf <[email protected]>

* bumps dlt to 0.4.6

* fixes linter

* fixes the empty primary key disabling incremental dedup

* bumps to 0.4.7a0, does not overwrite [] as primary key

* bumps dlt to 0.4.7

---------

Co-authored-by: Ilya Gurov <[email protected]>
Co-authored-by: Anton Burnashev <[email protected]>
Co-authored-by: Alena Astrakhantseva <[email protected]>
Co-authored-by: Sultan Iman <[email protected]>
Co-authored-by: adrianbr <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants