From Video input to audio output. Via object detection - (yolov8, onnx format), LLM - (chatGPT, via API) and text-to-speech (fastspeech2-en-ljspeech). One can use webcam, movie files or youtube videos ...
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.