The pool adjusts to the amount of required threads.
The implementation is as simple as possible, but should be enough
for future extensions.
This brings down decoder thread creation/destruction even more.
Also fix OpenALSystem::release() to close channels properly on shutdown,
otherwise it would deadlock with the pool, because it waits until all
threads have died off, which did not necessarily happen.