fal’s inference engine bindings takes a torch module and applies all relevant dynamic compilation and
quantization techniques to make it faster out of the box without leaking any of the complexity to the user.
This API is currently experimental, and might be subject to change in the future.