Open Source Python Multimedia Software - Page 4

Python Multimedia Software

View 4876 business solutions

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • Enterprise Job Scheduling Software Icon
    Enterprise Job Scheduling Software

    Unify Enterprise Job Scheduling for Scale, Visibility, and Control

    Managing your sprawling data center and cloud with disparate native schedulers creates chaos. Achieve unparalleled control and efficiency over your entire IT environment with JAMS job orchestration tools. JAMS provides the singular, centralized platform required to overcome the complexities of disparate native schedulers. Automate, secure, and govern all your workloads, eliminating fragmented control, compliance risks, and operational bottlenecks. JAMS streamlines operations and ensures audit-ready history, transforming your enterprise automation with confidence and precision.
    Learn More
  • deskbird is the most intuitive desk booking app for your hybrid office. Icon
    deskbird is the most intuitive desk booking app for your hybrid office.

    With deskbird, creating an efficient workplace has never been easier.

    For companies in need of a people-centric workplace management solution so employees can see who is in the office, schedule their office and work-from-home days, and book resources for office days.
    Learn More
  • 1
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including Soulseek, YouTube, and Beatport, then verify file accuracy through AcoustID fingerprinting to ensure the correct audio was obtained. The platform also enriches tracks with lyrics, album art, and MusicBrainz metadata while organizing files into clean, customizable folder structures. Once processed, the library can be synchronized with media servers like Plex, Jellyfin, or Navidrome so collections stay up to date without manual intervention.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    KAIR

    KAIR

    Image Restoration Toolbox (PyTorch). Training and testing codes

    Image restoration toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSR/GAN, SwinIR.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    Toonily Downloader

    Toonily Downloader

    A python tool for downloading manga from Toonily

    Toonily Downloader is a Python-based scraping and downloading tool designed specifically for manga and manhwa hosted on Toonily, enabling users to fetch entire series efficiently while preserving original image quality and structure. It provides both a command-line interface and a graphical user interface, making it accessible for both technical and non-technical users. The software supports downloading full series or selected chapters by parsing Toonily URLs and organizing content into clean, chapter-based directories. It uses concurrent downloading techniques to significantly speed up the process and includes robust error handling to recover from interruptions or failed downloads. Additionally, the tool allows users to convert downloaded chapters into high-quality PDF files without re-encoding images, ensuring fidelity to the original content.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    IOPaint

    IOPaint

    Image inpainting tool powered by SOTA AI Model

    IOPaint is a powerful open-source image editing tool focused on inpainting, outpainting, object removal, and general image manipulation driven by state-of-the-art AI models, delivering these capabilities through both local and hosted workflows. Designed to be fully self-hosted and flexible, IOPaint supports a variety of underlying generators and inpaint models — from LaMa erase networks to Stable Diffusion-based replace/object generation — giving users multiple ways to refine or reconstruct images by removing unwanted elements or expanding artwork beyond its original boundaries. Its feature set includes erasing people, watermarks, or defects, adding or replacing objects, applying text-aware edits, and extending images outward (outpainting) to fill contours or expand compositions.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Intelligent Retail Management Icon
    Intelligent Retail Management

    Retail space, product categories, planograms, automatic ordering, and shelf labels management

    Quant offers a wide range of solutions for retail. Within one integrated software system, it allows you to efficiently combine the management of retail space, shelf labels and marketing materials with task management, reporting and automatic replenishment.
    Learn More
  • 5
    YT Channel Downloader

    YT Channel Downloader

    Download YouTube channels, playlists, videos, audios with easy login.

    YT Channel Downloader is a powerful and user-friendly application that enables you to selectively or entirely download YouTube channels, playlists or individual videos and audios with ease. The app supports high-quality video and audio downloads, including 4K resolution and best available audio tracks. With a simple YouTube login process, you can download private or age-restricted content using your session cookies. The application offers extensive customization options, allowing you to choose your preferred video resolution, audio quality, and file format. Additionally, the intuitive interface ensures that both novice and advanced users can navigate and utilize the tool effortlessly. Whether you're archiving your favorite channels or downloading videos for offline viewing, YT Channel Downloader provides a seamless and efficient experience.
    Leader badge
    Downloads: 335 This Week
    Last Update:
    See Project
  • 6
    Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.
    Leader badge
    Downloads: 69 This Week
    Last Update:
    See Project
  • 7
    Caesium - Image Compressor
    !! THIS PROJECT HAS BEEN MOVED!! https://github.com/Lymphatus/caesium-image-compressor Caesium reduces the size of your picture up to 90%, preserving the original visual quality. Allows you to save a lot of space and easily upload your pictures on the web in a moment. The software is user-friendly with a simple and clear interface.
    Downloads: 62 This Week
    Last Update:
    See Project
  • 8
    VidCutter

    VidCutter

    the simplest & fastest video cutter & joiner

    A modern, simple to use, constantly evolving and hella fast MEDIA CUTTER + JOINER w/ frame-accurate SmartCut technology + Qt5, libmpv, FFmpeg and MediaInfo powering the backend.
    Leader badge
    Downloads: 223 This Week
    Last Update:
    See Project
  • 9
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Taking the Paper Out of Work Icon
    Taking the Paper Out of Work

    For organizations that need powerful ECM and document automation software

    The Square 9 AI-powered intelligent document processing platform takes the paper out of work and makes it easier to get things done with digital workflows.
    Learn More
  • 10
    Spotify Music Downloader

    Spotify Music Downloader

    Spotify Music Downloader

    Download music from Spotify and other music sources.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    Curlew Multimedia Converter

    Curlew Multimedia Converter

    Easy to use Multimedia Converter for Linux

    Leader badge
    Downloads: 66 This Week
    Last Update:
    See Project
  • 12
    GNU Solfege
    GNU Solfege is *free* ear training software written in Python 3.4 using the Gtk+ 3 toolkit. The program is designed to be easily extended with lesson files (data files), so the user can create new exercises.
    Leader badge
    Downloads: 43 This Week
    Last Update:
    See Project
  • 13
    The Quake Army Knife (QuArK)
    A powerful editor for video games based on or similar to id Software's series of Quake games. Currently QuArK supports 41 distinct games, 5 generic game engines, and a countless number of expansions packs, addons, and mods. It integrates a map editor, model editor, archive editors, texture management, and much more.
    Downloads: 49 This Week
    Last Update:
    See Project
  • 14
    Crazy Eddies GUI System (CEGUI)

    Crazy Eddies GUI System (CEGUI)

    A fast, powerful and adaptable GUI solution

    Crazy Eddie's GUI (CEGUI) system is a graphical user interface C++ library. It was designed particularly for the needs of videogames, but the library is usable for non-game tasks, such as any other type of applications (rendering/visualisation/virtual reality) and tools. It is designed for user flexibility in look-and-feel, as well as being adaptable to the user's choice in tools and operating systems. Established in 2003, CEGUI sees continual, active development and remains one of the most powerful and most popular options for developers requiring an adaptable and efficient GUI solution. CEGUI is fully supported by the developers that created - and continue to develop - the library, and who are accessible via the project's dedicated internet forums and IRC channel. For more information, more screenshots and for news, check out out our homepage: http://cegui.org.uk The repository is now at bitbucket (including our issue/bug tracker): https://bitbucket.org/cegui
    Downloads: 44 This Week
    Last Update:
    See Project
  • 15
    DALL-E 2 - Pytorch

    DALL-E 2 - Pytorch

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. To train CLIP, you can either use x-clip package, or join the LAION discord, where a lot of replication efforts are already underway. Then, you will need to train the decoder, which learns to generate images based on the image embedding coming from the trained CLIP.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Google Photos Sync

    Google Photos Sync

    Google Photos and Albums backup with Google Photos Library API

    Google Photos Sync is a backup tool for your Google Photos cloud storage. Google Photos Sync downloads all photos and videos the user has uploaded to Google Photos. It also organizes the media in the local file system using album information. Additional Google Photos 'Creations' such as animations, panoramas, movies, effects and collages are also backed up. This software is read only and never modifies your cloud library in any way, so there is no risk of damaging your data. There are a number of long standing issues with the Google Photos API that mean it is not possible to make a true backup of your media.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    Mirrorcast

    Mirrorcast

    Open Source Alternative to Chromecast, Mirror Desktop and Play media r

    The idea is to replicate what Chromecast can do in regards to screen mirroring and streaming media to a remote display. Google chromes screen mirroring feature works well when used with a receiver such as Chromecast but this is a proprietary solution and audio does not work for desktop mirroring on some operating systems. At the moment, there is only a client for Debian/Ubuntu Operating systems and a server/receiver application for Raspberry pi. Mirrorcast aims to be a low latency screen mirroring solution with high-quality video and audio at 25-30fps, the later is why we will not use something like VNC. Mirrorcast uses up about the same amount of system resources as google chromes cast feature. The delay is less than 1 second on most networks. To achieve this we will use existing FOSS software such as ffmpeg, mpv, and omxplayer.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    VidCutter

    VidCutter

    A modern yet simple multi-platform video cutter and joiner

    A modern, simple to use, constantly evolving and hella fast MEDIA CUTTER + JOINER w/ frame-accurate SmartCut technology, chapter support, media stream selection for audio + subtitle channels and blackdetect video filter support to automatically detect scene changes or skip commercials in digital TV recordings. Chapter support allows scene chapter names to be included in final media metadata. NOTE: results will only work in media players that support chapters. Flatpak release includes the latest stable versions of FFmpeg, libmpv, MediaInfo, and PyQt5 running on the KDE platform runtime.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    CamDesk

    CamDesk

    The Desktop Webcam Widget

    CamDesk is a free, open source, desktop webcam widget, that was created as home surveillance application. Although others have used it for demonstrations even with CamStudio, and QuickTime Player for screen casting.
    Downloads: 63 This Week
    Last Update:
    See Project
  • 21
    Clarity AI Upscaler

    Clarity AI Upscaler

    AI Image Upscaler & Enhancer

    Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. The project is intended as a free and open alternative to commercial AI upscaling tools, allowing developers and digital artists to run the technology locally or integrate it into their own workflows. The repository includes a full application environment with scripts, configuration files, and model support that allow users to run the upscaler as a standalone tool or integrate it into other pipelines.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    Robust Video Matting (RVM)

    Robust Video Matting (RVM)

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX

    We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance. Our method is much lighter than previous approaches and can process 4K at 76 FPS and HD at 104 FPS on an Nvidia GTX 1080Ti GPU. Unlike most existing methods that perform video matting frame-by-frame as independent images, our method uses a recurrent architecture to exploit temporal information in videos and achieves significant improvements in temporal coherence and matting quality. Furthermore, we propose a novel training strategy that enforces our network on both matting and segmentation objectives. This significantly improves our model's robustness. Our method does not require any auxiliary inputs such as a trimap or a pre-captured background image, so it can be widely applied to existing human matting applications. RVM is specifically designed for robust human video matting.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    pytube

    pytube

    A lightweight, dependency-free Python library

    Pytube is a lightweight, dependency-free Python library that enables downloading YouTube videos and audio streams with minimal setup. It supports video resolution selection, progressive or adaptive streams, and caption downloads. Pytube is ideal for automation scripts, archiving tools, and media applications that need to interface with YouTube content programmatically.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    Radio Tray is a streaming player for listening to online radios. Any bug or feature request should be reported in https://bitbucket.org/carlmig/radio-tray/issues
    Downloads: 35 This Week
    Last Update:
    See Project
  • 25
    Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.
    Downloads: 34 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB