v1.2.0 massive perf and batch processing for pcie and usb
Changelog:
- feat: rewrote the cpu search algorithm to be considerably faster (1s cURL to ~340ms cURL)
- fix: minor performance/idiomatic improvements to pcie/usb infra
- feat: added batch processing to both PCIe and USB endpoints