Remove the FIFO thread #7568

sakertooth · 2024-10-31T01:13:53Z

This PR removes AudioEngine::fifoWriter, which is a thread that facilities the multiple buffering mechanism we have currently for our audio rendering. The way the mechanism works (to my understanding):

The FIFO thread, when started, splits the buffer size into chunks of 256 sample frames and generates them as fast as it can.
The FIFO thread stops generating audio when the number of chunks generated equal original_buffer_size / 256 and waits for someone to request a chunk.
The main audio thread requests chunks from the FIFO and uses that for playback.
While the main audio thread is requesting and receiving chunks, the FIFO thread continues rendering more data.

Pros to having this thread:

Automation changes happen more regularly - Most processing in LMMS treat its parameters as having one value. That same value is used throughout the buffer. As a result, without the FIFO thread and the chunking of the full audio buffer, larger buffer sizes reduce the amount of automation that occurs through time.
1. Sample-exactness can fix this for most parameters, which makes it so that parameters are treated as having values per sample in the buffer, rather than just one single value. There are complaints of having sample-exactness for most of our native parameters will cause major drops in performance, which is why removing this thread may not be that straightforward. Note: In the case of VST2 parameters, I believe sample-exactness cannot be applied, as sample-exact automation was added in VST3. This is to show that inevitably, not all the parameters can be made sample-exact, and most professional DAWs deal with this same issue and also have problems involving automation timing accuracy.
2. I fixed this problem but was still able to remove the FIFO thread by doing the chunking on the callback thread.

Cons to having this thread:

Export bugs - This was the main reason why I wanted to remove this thread. The communication between the main audio thread, the export thread, and the FIFO thread is complex and has lead to deadlocks when exporting like in Constant freezing at 0% during export #7320. Trying to fix the FIFO thread and remove the deadlock does not feel like a good use of time considering that it really shouldn't be there in the first place if things were more correctly implemented in the beginning.
Higher buffer sizes != more performance - For reasons I am currently unaware of, higher buffer sizes do not necessarily imply more performance when using the FIFO thread (at least in this instance). I discovered this when I was investigating the infamous "automating VST knobs" bottleneck. The video I attached demonstrates a project automating a VST knob with an LFO controller with this branch merged in my build. In the video, you can see how having a buffer size of 1024 samples keeps the CPU meter at acceptable levels, while in contrast a buffer size of 256 samples has a bad impact on performance.
1. This improvement in performance was only seen when this thread was removed and the main audio thread was directly rendering the audio buffers. Current master exhibits bad performance regardless of what buffer size you select in the settings when automating VST knobs, while here the situation is less of a problem if a higher buffer size is selected.
2. Note: I checked out master and moved into a new branch in the video for some reason, but merged this branch and nothing else. I was planning to do a real fix, but @DomClark seems to be working on it already, and I quickly came up with the idea to see if this PR would help performance, so I merged it in and it did.

The solution now is to keep the chunking of the buffer size and do it on the audio callback thread, and still remove the FIFO thread.

output.mp4

sakertooth · 2024-12-17T15:22:22Z

Changed the PR to still remove the FIFO thread but keep the chunking of the buffer size (to avoid problems with non sample accurate automation but to also fix the problems this thread has caused).

…opped frames

Removed this functionality by accident

Rossmaxx · 2025-01-16T06:58:59Z

This shoulda had higher priority. @LMMS/testers

Rossmaxx · 2025-01-16T06:59:31Z

Wait the tester role is only for peki?

Rossmaxx

The fatloss on this is insane. I am not approving for now because I am doubtful if something might have gotten lost somewhere. I will approve after i get around to testing.

Also, don't understand what is now happening with the chunking mechanism. Is the audio thread processing in chunks of 256 and then outputting to the playback/export at the user defined buffer size? Or is there something I missed.

src/core/AudioEngine.cpp

src/core/audio/AudioDevice.cpp

bratpeki · 2025-01-16T11:19:40Z

@LMMS/testers

Reporting!

I'm busy with managing #7477, #7444 and #7366.

Is this making the code more readable without reducing performance? Or is it a performance boosting PR? In any case, great stuff, I'll look at it when I'm done with these three PRs!

Rossmaxx · 2025-01-16T11:55:11Z

This is supposed to solve the performance degradation on master compared to 1.2 and the rare case with deadlocks making a mess, along with the buffer size and automation performance, among other stuff. @sakertooth correct me if I'm wrong. If what i guess is right, this PR has the potential to straighten up a lot of the performance bugs.

bratpeki · 2025-01-16T12:32:48Z

Wait the tester role is only for peki?

For the time being, yeah. I think I get rights to review PRs and whatnot, so it's probably not wise to give it away to anyone wanting to test, but I'm not entirely sure about if that's how it behaves, just speculation.

sakertooth · 2025-01-16T15:38:27Z

This is supposed to solve the performance degradation on master compared to 1.2 and the rare case with deadlocks making a mess, along with the buffer size and automation performance, among other stuff. @sakertooth correct me if I'm wrong. If what i guess is right, this PR has the potential to straighten up a lot of the performance bugs.

It's mostly to fix deadlock bugs involving this thread waiting forever to close. It's less of a performance benefit (though that could be possible) but more so to simplify the thread communication within the audio pipeline.

CorruptVoidSoul · 2025-01-16T20:35:45Z

So I tested this PR.
Exporting works fine, no hearable differences.
Playback is still pretty much the same as on master, I made it play the performance intensive part of a song, my phone is still in pain, no noticeable gain or loss there.
I had two random crashes but I expect this to be a me problem because well, it's a phone and this mmpz is uhh... a bit tough sometimes.
Still on the "me problem" side, usually Equalizer takes a minute to load for the first time, in this PR it takes 30 seconds so it's better, not gonna complain.
Now back on playback, the playhead will desync itself from the actual sound after a huge lag spike, but that's a general performance problem, it doesn't have much to do with this PR I think.

Final feedback is this seems good, did Peki test that too ? There could be differences with a computer that can handle heavy workload.

sakertooth · 2025-01-17T21:56:41Z

Final feedback is this seems good, did Peki test that too ? There could be differences with a computer that can handle heavy workload.

I don't think Peki has tested this yet, but they're probably busy with other PRs so it's fine. Thank you for testing this for me 👍

sakertooth added 9 commits December 16, 2024 23:50

Remove FIFO thread

140f99c

Add AudioEngine::renderNextBufferChunked and use it in SDL audio device

a85b145

Use new chunking function in JACK audio device

4a3ca48

Use new chunking function in OSS audio device

e193860

Use new chunking function in PortAudio device

608a7d1

Use new chunking function in PulseAudio device

0a956c6

Use new chunking function in ALSA device

850a842

Use new chunking function in sndio device

25802da

Use new chunking function in soundio device

663ca7e

sakertooth force-pushed the revamp-buffers branch from acfbfc4 to 663ca7e Compare December 17, 2024 15:21

sakertooth added 2 commits December 17, 2024 10:35

Remove unused getNextBuffer function

17a1d34

Minor changes

24a2a28

sakertooth marked this pull request as ready for review December 17, 2024 16:10

sakertooth added needs code review A functional code review is currently required for this PR needs testing This pull request needs more testing labels Dec 18, 2024

sakertooth added 4 commits December 18, 2024 07:20

Make renderNextBufferChunked persist buffers across calls to avoid dr…

2dbe458

…opped frames

Make some style changes in AudioSoundIo

30db2ef

Remove redundancy in AudioOss

0a0122e

Check for result from write call again

b2f3579

sakertooth mentioned this pull request Dec 18, 2024

Constant freezing at 0% during export #7320

Open

1 task

sakertooth added 4 commits December 18, 2024 08:30

Consider if the audio device has stopped for JACK devices

bf89032

Removed this functionality by accident

Consider if the audio device has stopped

e4fc133

Cast bytes to std::size_t

ecf9523

Merge remote-tracking branch 'upstream/master' into revamp-buffers

e433ffd

sakertooth linked an issue Dec 21, 2024 that may be closed by this pull request

Constant freezing at 0% during export #7320

Open

1 task

sakertooth added 4 commits December 24, 2024 11:05

Avoid copying of rendered buffer in renderNextBufferChunked

a03bd17

Remove processNextBuffer function and call writeBuffer directly

885cb32

Restore functionality to render audio within AudioDummy

3a72faa

Add stopped variable in AudioDummy

c5a16a7

Rossmaxx reviewed Jan 16, 2025

View reviewed changes

src/core/AudioEngine.cpp Outdated Show resolved Hide resolved

src/core/AudioEngine.cpp Outdated Show resolved Hide resolved

src/core/audio/AudioDevice.cpp Show resolved Hide resolved

sakertooth added 2 commits January 16, 2025 11:34

Use outputBufferRead instead of copying to a separate static buffer

75dab12

Inline startProcessing and stopProcessing

0615733

sakertooth removed the needs testing This pull request needs more testing label Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove the FIFO thread #7568

Remove the FIFO thread #7568

sakertooth commented Oct 31, 2024 •

edited

Loading

sakertooth commented Dec 17, 2024

Rossmaxx commented Jan 16, 2025

Rossmaxx commented Jan 16, 2025

Rossmaxx left a comment

bratpeki commented Jan 16, 2025 •

edited

Loading

Rossmaxx commented Jan 16, 2025

bratpeki commented Jan 16, 2025

sakertooth commented Jan 16, 2025

CorruptVoidSoul commented Jan 16, 2025

sakertooth commented Jan 17, 2025

Remove the FIFO thread #7568

Are you sure you want to change the base?

Remove the FIFO thread #7568

Conversation

sakertooth commented Oct 31, 2024 • edited Loading

sakertooth commented Dec 17, 2024

Rossmaxx commented Jan 16, 2025

Rossmaxx commented Jan 16, 2025

Rossmaxx left a comment

Choose a reason for hiding this comment

bratpeki commented Jan 16, 2025 • edited Loading

Rossmaxx commented Jan 16, 2025

bratpeki commented Jan 16, 2025

sakertooth commented Jan 16, 2025

CorruptVoidSoul commented Jan 16, 2025

sakertooth commented Jan 17, 2025

sakertooth commented Oct 31, 2024 •

edited

Loading

bratpeki commented Jan 16, 2025 •

edited

Loading