update lists

This commit is contained in:
2025-07-18 22:22:32 +02:00
parent 55bed3b4a1
commit 5916c5c074
3078 changed files with 331679 additions and 357255 deletions

View File

@@ -1,14 +1,14 @@
 Awesome Generative AI !Awesome (https://awesome.re/badge-flat.svg) (https://awesome.re)
 Awesome Generative AI !Awesome (https://awesome.re/badge-flat.svg) (https://awesome.re)
▐ A curated list of modern Generative Artificial Intelligence projects and services.
Generative Artificial Intelligence is a technology that creates original content such as images, sounds, and texts by using machine learning algorithms that are trained on large amounts of data. Unlike other forms of AI, it is capable 
of creating unique and previously unseen outputs such as photorealistic images, digital art, music, and writing. These outputs often have their own unique style and can even be hard to distinguish from human-created works. Generative AI
has a wide range of applications in fields such as of art, entertainment, marketing, academia, and computer science.
Generative Artificial Intelligence is a technology that creates original content such as images, sounds, and texts by using machine learning algorithms that are trained on large amounts of data. Unlike other forms of AI, it is capable of creating
unique and previously unseen outputs such as photorealistic images, digital art, music, and writing. These outputs often have their own unique style and can even be hard to distinguish from human-created works. Generative AI has a wide range of 
applications in fields such as of art, entertainment, marketing, academia, and computer science.
Contributions to this list are welcome. Before submitting your suggestions, please review the Contribution Guidelines (CONTRIBUTING.md) to ensure your entries meet the criteria. Add links through pull requests 
(https://github.com/steven2358/awesome-generative-ai/pulls) or create an issue (https://github.com/steven2358/awesome-generative-ai/issues) to start a discussion. More projects can be found in the Discoveries List (DISCOVERIES.md), 
where we showcase a wide range of up-and-coming Generative AI projects.
(https://github.com/steven2358/awesome-generative-ai/pulls) or create an issue (https://github.com/steven2358/awesome-generative-ai/issues) to start a discussion. More projects can be found in the Discoveries List (DISCOVERIES.md), where we 
showcase a wide range of up-and-coming Generative AI projects.
Contents
@@ -25,30 +25,31 @@
Recommended reading
- How Large Language Models Will Transform Science, Society, and AI (https://hai.stanford.edu/news/how-large-language-models-will-transform-science-society-and-ai) - Article summarizing the capabilities and limitations of the GPT-3 
model, and its potential impact on society. By Alex Tamkin and Deep Ganguli, February 5, 2021.
- Generative AI: A Creative New World (https://www.sequoiacap.com/article/generative-ai-a-creative-new-world/) - A comprehensive examination of the generative AI industry, offering a historical perspective and in-depth analysis of the 
industry ecosystem. By Sonya Huang, Pat Grady and GPT-3, September 19, 2022.
- A Coming-Out Party for Generative A.I., Silicon Valley's New Craze (https://www.nytimes.com/2022/10/21/technology/generative-ai.html) - Article about the rise of generative AI, particularly the success of the Stable Diffusion image 
generator, and the associated controversies. New York Times, October 21, 2022.
- AI's New Creative Streak Sparks a Silicon Valley Gold Rush (https://www.wired.com/story/ais-new-creative-streak-sparks-a-silicon-valley-gold-rush/) - Article about the growing hype and investment in generative AI startups, with 
various industries exploring its potential applications. Wired, October 27, 2022.
- ChatGPT Heralds an Intellectual Revolution (https://www.wsj.com/articles/artificial-intelligence-generative-ai-chatgpt-kissinger-84512912) - An op-ed by Henry Kissinger, Eric Schmidt and Daniel Huttenlocher. Wall Street Journal, 
February 24, 2023.
- How Large Language Models Will Transform Science, Society, and AI (https://hai.stanford.edu/news/how-large-language-models-will-transform-science-society-and-ai) - Article summarizing the capabilities and limitations of the GPT-3 model, and its
potential impact on society. By Alex Tamkin and Deep Ganguli, February 5, 2021.
- Generative AI: A Creative New World (https://www.sequoiacap.com/article/generative-ai-a-creative-new-world/) - A comprehensive examination of the generative AI industry, offering a historical perspective and in-depth analysis of the industry 
ecosystem. By Sonya Huang, Pat Grady and GPT-3, September 19, 2022.
- A Coming-Out Party for Generative A.I., Silicon Valley's New Craze (https://www.nytimes.com/2022/10/21/technology/generative-ai.html) - Article about the rise of generative AI, particularly the success of the Stable Diffusion image generator, 
and the associated controversies. New York Times, October 21, 2022.
- AI's New Creative Streak Sparks a Silicon Valley Gold Rush (https://www.wired.com/story/ais-new-creative-streak-sparks-a-silicon-valley-gold-rush/) - Article about the growing hype and investment in generative AI startups, with various 
industries exploring its potential applications. Wired, October 27, 2022.
- ChatGPT Heralds an Intellectual Revolution (https://www.wsj.com/articles/artificial-intelligence-generative-ai-chatgpt-kissinger-84512912) - An op-ed by Henry Kissinger, Eric Schmidt and Daniel Huttenlocher. Wall Street Journal, February 24, 
2023.
Milestones
- OpenAI API (https://openai.com/blog/openai-api/) - Announcement of the OpenAI API for text-to-text general-purpose AI models based on GPT-3. OpenAI blog, June 11, 2020.
- GitHub Copilot (https://github.blog/2021-06-29-introducing-github-copilot-ai-pair-programmer/) - Announcement of Copilot, a new AI pair programmer that helps you write better code. GitHub blog, June 29, 2021.
- DALL·E 2 (https://openai.com/blog/dall-e-2/) - Announcement of the release of DALL·E 2, an advanced image generation system with improved resolution, expanded image creation capabilities, and various safety mitigations. OpenAI blog, 
April 6, 2022.
- Stable Diffusion Public Release (https://stability.ai/blog/stable-diffusion-public-release) - Announcement of the public release of Stable Diffusion, an AI-based image generation model trained on a broad internet scrape and licensed 
under a Creative ML OpenRAIL-M license. Stable Diffusion blog, 22 August, 2022.
- ChatGPT (https://openai.com/blog/chatgpt/) - Announcement of ChatGPT, a conversational model trained to answer follow-up questions, admit mistakes, challenge incorrect premises, and reject inappropriate requests. OpenAI blog, November
30, 2022.
- Bing Search (https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/) - Microsoft announces a new version of its search engine Bing, powered by a 
next-generation OpenAI model. Microsoft blog, February 7, 2023.
- DALL·E 2 (https://openai.com/blog/dall-e-2/) - Announcement of the release of DALL·E 2, an advanced image generation system with improved resolution, expanded image creation capabilities, and various safety mitigations. OpenAI blog, April 6, 
2022.
- Stable Diffusion Public Release (https://stability.ai/blog/stable-diffusion-public-release) - Announcement of the public release of Stable Diffusion, an AI-based image generation model trained on a broad internet scrape and licensed under a 
Creative ML OpenRAIL-M license. Stable Diffusion blog, 22 August, 2022.
- ChatGPT (https://openai.com/blog/chatgpt/) - Announcement of ChatGPT, a conversational model trained to answer follow-up questions, admit mistakes, challenge incorrect premises, and reject inappropriate requests. OpenAI blog, November 30, 2022.
- Bing Search (https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/) - Microsoft announces a new version of its search engine Bing, powered by a next-generation 
OpenAI model. Microsoft blog, February 7, 2023.
- LLaMA (https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) - Llama LLM, a foundational, 65-billion-parameter large language model by Meta. Meta, February 23rd, 2023. #opensource
- GPT-4 (https://openai.com/research/gpt-4) - Announcement of GPT-4, a large multimodal model. OpenAI blog, March 14, 2023.
- DALL·E 3 (https://openai.com/index/dall-e-3/) - Announcement of DALL·E 3 image generator. OpenAI blog, September 20, 2023.
- Sora (https://openai.com/research/video-generation-models-as-world-simulators) - Presentation of Sora, a large video generation model. OpenAI, February 15, 2024.
Text
@@ -60,27 +61,29 @@
- OPT (https://huggingface.co/facebook/opt-350m) - Open Pretrained Transformers (OPT) by Facebook is a suite of decoder-only pre-trained transformers. Announcement 
(https://ai.facebook.com/blog/democratizing-access-to-large-scale-language-models-with-opt-175b/). OPT-175B text generation (https://opt.alpa.ai/) hosted by Alpa.
- Bloom (https://huggingface.co/docs/transformers/model_doc/bloom) - BLOOM by Hugging Face is a model similar to GPT-3 that has been trained on 46 different languages and 13 programming languages. #opensource
- LLaMA (https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) - A foundational, 65-billion-parameter large language model by Meta. #opensource
- Llama 2 (https://ai.meta.com/llama/) - The next generation of Meta's open source large language model. #opensource
- Llama (https://www.llama.com/) - Meta's open source large language model. #opensource
- Claude (https://claude.ai/) - Talk to Claude, an AI assistant from Anthropic.
- Vicuna-13B (https://lmsys.org/blog/2023-03-30-vicuna/) - An open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. #opensource
- Mixtral 8x7B (https://mistral.ai/news/mixtral-of-experts/) - A high-quality sparse mixture of experts model with open weights. #opensource
- Mistral (https://mistral.ai/en/models) - Cutting-edge open-weight LLMs by Mistral AI. #opensource
- Grok (https://grok.x.ai/) - An LLM by xAI with open source (https://github.com/xai-org/grok-1) and open weights. #opensource
- Qwen (https://qwenlm.github.io/) - A series of LLMs independently developed by Alibaba Cloud. #opensource (https://github.com/QwenLM/Qwen)
Chatbots
- ChatGPT (https://chat.openai.com/chat) - ChatGPT by OpenAI is a large language model that interacts in a conversational way.
- ChatGPT (https://chatgpt.com/) - ChatGPT by OpenAI is a large language model that interacts in a conversational way.
- Copilot (https://copilot.microsoft.com/) - An everyday AI companion by Microsoft.
- Gemini (https://gemini.google.com/) - A family of multimodal large language model developed by Google Deepmind.
- Meta AI (https://www.meta.ai/) - Meta AI assistant to get things done, create AI-generated images, get answers. Built on Llama LLM.
- DeepSeek (https://www.deepseek.com/) - Cutting-edge LLMs for enterprise, consumer, and scientific applications. #opensource
- Character.AI (https://character.ai/) - Character.AI lets you create characters and chat to them.
- ChatPDF (https://www.chatpdf.com/) - Chat with any PDF.
- ChatSonic (https://writesonic.com/chat) - An AI-powered assistant that enables text and image creation.
- Pi (https://pi.ai) - A personalized AI platform available as a digital assistant.
- Qwen (https://chat.qwenlm.ai/) - Qwen chatbot with image generation, document processing, web search integration, video understanding, etc.
- Le Chat (https://chat.mistral.ai/) - Chat with Mistral AI's cutting-edge language models.
Custom interfaces
- LibreChat (https://librechat.ai/) - LibreChat is a free and open-source chat interface for assistant AIs. Source code (https://github.com/danny-avila/LibreChat).
- Chatbot UI (https://www.chatbotui.com/) - An open source ChatGPT UI. Source code (https://github.com/mckaywrigley/chatbot-ui).
- LibreChat (https://librechat.ai/) - LibreChat is a free and open-source chat interface for assistant AIs. #opensource (https://github.com/danny-avila/LibreChat).
- Chatbot UI (https://www.chatbotui.com/) - An open source ChatGPT UI. #opensource (https://github.com/mckaywrigley/chatbot-ui).
Search engines
@@ -88,6 +91,7 @@
- Metaphor (https://metaphor.systems/) - Language model powered search.
- Phind (https://phind.com/) - AI-based search engine.
- You.com (https://you.com/) - A search engine built on AI that provides users with a customized search experience while keeping their data 100% private.
- Komo (https://komo.ai/) - An AI-powered search engine.
Local search engines
@@ -103,8 +107,8 @@
- HyperWrite (https://hyperwriteai.com/) - HyperWrite helps you write with confidence and get your work done faster from idea to final draft.
- Moonbeam (https://www.gomoonbeam.com/) - Better blogs in a fraction of the time.
- copy.ai (https://www.copy.ai/) - Write better marketing copy and content with AI.
- ChatSonic (https://writesonic.com/chat) - An AI-powered assistant that enables text and image creation.
- Anyword (https://anyword.com/) - Anyword's AI writing assistant generates effective copy for anyone.
- Contenda (https://contenda.co/) - Create the content your audience wants, from content you've already made.
- Hypotenuse AI (https://www.hypotenuse.ai/) - Turn a few keywords into original, insightful articles, product descriptions and social media copy.
- Lavender (https://www.lavender.ai/) - Lavender email assistant helps you get more replies in less time.
- Lex (https://lex.page/) - A word processor with artificial intelligence baked in, so you can write faster.
@@ -130,19 +134,23 @@
Productivity
- ChatPDF (https://www.chatpdf.com/) - Chat with any PDF.
- Mem (https://mem.ai/) - Mem is the world's first AI-powered workspace that's personalized to you. Amplify your creativity, automate the mundane, and stay organized automatically.
- Taskade (https://www.taskade.com/) - Outline tasks, notes, generated structured lists and mind maps with Taskade AI.
- Notion AI (https://www.notion.so/product/ai) - Write better, more efficient notes and docs.
- Nekton AI (https://nekton.ai) - Automate your workflows with AI. Describe your workflows step by step in plain language.
- Rewind (https://www.rewind.ai/) - Rewind is a personalized AI powered by everything you've seen, said, or heard..
- Rewind (https://www.rewind.ai/) - Rewind is a personalized AI powered by everything you've seen, said, or heard.
- NotebookLM (https://notebooklm.google/) - A research and note-taking online tool to interact with documents, powered by Google Gemini.
Meeting assistants
- Otter.ai (https://otter.ai/) - A meeting assistant that records audio, writes notes, automatically captures slides, and generates summaries.
- Cogram (https://www.cogram.com/) - Cogram takes automatic notes in virtual meetings and identifies action items.
- Sybill (https://www.sybill.ai/) - Sybill generates summaries of sales calls, including next steps, pain points and areas of interest, by combining transcript and emotion-based insights.
- Loopin AI (https://www.loopinhq.com/) - Loopin is a collaborative meeting workspace that not only enables you to record, transcribe & summaries meetings using AI, but also enables you to auto-organise meeting notes on top of your 
calendar.
- Loopin AI (https://www.loopinhq.com/) - Loopin is a collaborative meeting workspace that not only enables you to record, transcribe & summaries meetings using AI, but also enables you to auto-organise meeting notes on top of your calendar.
- Fireflies.ai (https://fireflies.ai/) - Fireflies.ai helps your team transcribe, summarize, search, and analyze voice conversations.
- Read AI (https://www.read.ai/) - An AI copilot for wherever you work, making your meetings, emails, and messages more productive with summaries, content discovery, and recommendations.
- Fireflies.ai (https://fireflies.ai) - Transcribe, summarize, search, and analyze all your team conversations.
Academia
@@ -153,6 +161,14 @@
(https://github.com/paperswithcode/galai).
- Consensus (https://consensus.app/search/) - Consensus is a search engine that uses AI to find answers in scientific research.
- Synthical (https://synthical.com) - AI-powered collaborative research environment.
- scite (https://scite.ai/) - A platform for discovering and evaluating scientific articles.
- SciSpace (https://typeset.io/) - An AI research assistant for understanding scientific literature.
- STORM (https://storm.genie.stanford.edu/) - An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations. #opensource (https://github.com/stanford-oval/storm/)
Leaderboards
- Chatbot Arena (https://lmarena.ai/) - An open platform for crowdsourced AI benchmarking, hosted by researchers at UC Berkeley SkyLab and LMArena.
- Artificial Analysis (https://artificialanalysis.ai/) - Artificial Analysis provides objective benchmarks & information to help choose AI models and hosting providers.
- imgsys (https://imgsys.org/rankings) - A generative image model arena by fal.ai.
Other text generators
@@ -166,7 +182,7 @@
- GitHub Copilot (https://github.com/features/copilot) - GitHub Copilot uses the OpenAI Codex to suggest code and entire functions in real-time, right from your editor.
- OpenAI Codex (https://platform.openai.com/docs/guides/code/) - An AI system by OpenAI that translates natural language to code.
- Ghostwriter (https://blog.replit.com/ai) - An AI-powered pair programmer by replit.
- Amazon CodeWhisperer (https://aws.amazon.com/codewhisperer/) - Build applications faster with the ML-powered coding companion.
- Amazon Q (https://aws.amazon.com/q/) - The AWS generative AIpowered assistant that helps answer questions, write code, and automate tasks.
- tabnine (https://www.tabnine.com/) - Code faster with whole-line & full-function code completions.
- Stenography (https://stenography.dev/) - Automatic code documentation.
- Mintlify (https://mintlify.com/) - AI powered documentation writer.
@@ -176,9 +192,10 @@
- PR-Agent (https://github.com/Codium-ai/pr-agent) - AI-powered tool for automated PR analysis, feedback, suggestions and more.
- MutableAI (https://mutable.ai/) - AI Accelerated Software Development.
- TurboPilot (https://github.com/ravenscroftj/turbopilot) - A self-hosted copilot clone which uses the library behind llama.cpp to run the 6 billion parameter Salesforce Codegen model in 4 GB of RAM.
- GPT-Code UI (https://github.com/ricklamers/gpt-code-ui) - An open source implementation of OpenAI's ChatGPT Code interpreter.
- GPT-Code UI (https://github.com/ricklamers/gpt-code-ui) - An open source implementation of OpenAI's ChatGPT Code interpreter. #opensource
- MetaGPT (https://github.com/geekan/MetaGPT) - The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
- Open Interpreter (https://github.com/KillianLucas/open-interpreter) - OpenAI's Code Interpreter in your terminal, running locally.
- Continue (https://www.continue.dev/) - Open-source AI code assistant. Connect any model and any context to create custom autocomplete and chat experiences inside the IDE. #opensource (https://github.com/continuedev/continue)
Developer tools
@@ -186,12 +203,35 @@
- Haystack (https://haystack.deepset.ai/) - A framework for building NLP applications (e.g. agents, semantic search, question-answering) with language models.
- LangChain (https://langchain.com/) - A framework for developing applications powered by language models.
- gpt4all (https://github.com/nomic-ai/gpt4all) - A chatbot trained on a massive collection of clean assistant data including code, stories and dialogue.
- LLM App (https://github.com/pathwaycom/llm-app) - Open-source Python library to build real-time LLM-enabled data pipeline.
- LMQL (https://lmql.ai/) - LMQL is a query language for large language models.
- LlamaIndex (https://www.llamaindex.ai/) - A data framework for building LLM applications over external data.
- Phoenix (https://phoenix.arize.com/) - Open-source tool for ML observability that runs in your notebook environment, by Arize. Monitor and fine tune LLM, CV and tabular models.
- Cursor (https://www.cursor.so/) - Cursor is the IDE of the future, built for pair-programming with Powerful AI.
- SymbolicAI (https://github.com/Xpitfire/symbolicai) - A neuro-symbolic framework for building applications with LLMs at the core.
- Vanna.ai (https://vanna.ai/) - An open-source Python RAG framework for SQL generation and related functionality. #opensource (https://github.com/vanna-ai/vanna)
- Portkey (https://portkey.ai/) - A full-stack LLMOps platform for LLM monitoring, caching, and management.
- agenta (https://github.com/agenta-ai/agenta) - An open-source end-to-end LLMOps platform for prompt engineering, evaluation, and deployment. #opensource
- Together AI (https://www.together.ai/) - Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale.
- Gitingest (https://gitingest.com/) - Turn any Git repository into a simple text digest of its codebase so it can be fed into any LLM. #opensource (https://github.com/cyclotruc/gitingest)
- Repomix (https://repomix.com/) - Pack your codebase into AI-friendly formats. #opensource (https://github.com/yamadashy/repomix)
- llama.cpp (https://github.com/ggml-org/llama.cpp) - Inference of Meta's LLaMA model (and others) in pure C/C++. #opensource
- bitnet.cpp (https://github.com/microsoft/BitNet) - Official inference framework for 1-bit LLMs, by Microsoft. #opensource (https://github.com/microsoft/BitNet)
- OpenRouter (https://openrouter.ai/) - A unified interface for LLMs. #opensource (https://github.com/OpenRouterTeam)
- Ludwig (https://github.com/ludwig-ai/ludwig) - A low-code framework for building custom AI models like LLMs and other deep neural networks. #opensource (https://github.com/ludwig-ai/ludwig)
Playgrounds
- OpenAI Playground (https://platform.openai.com/playground) - Explore resources, tutorials, API docs, and dynamic examples.
- Google AI Studio (https://aistudio.google.com/) - A web-based tool to prototype with Gemini and experimental models.
- GitHub Models (https://github.com/marketplace/models) - Find and experiment with AI models to develop a generative AI application.
Local LLM Deployment
- Ollama (https://github.com/ollama/ollama) - Get up and running with large language models locally.
- Open WebUI (https://github.com/open-webui/open-webui) - An extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. #opensource
- Jan (https://jan.ai/) - Run LLMs like Mistral or Llama2 locally and offline on your computer, or connect to remote AI APIs. #opensource (https://github.com/janhq/jan)
- Msty (https://msty.app/) - A straightworward and powerful interface for local and online AI models.
- PyGPT (https://pygpt.net/) - Personal desktop AI assistant with chat, vision, agents, image generation, tools and commands, voice control and more. #opensource
- LLM (https://llm.datasette.io/) - A CLI utility and Python library for interacting with Large Language Models, remote and local. #opensource (https://github.com/simonw/llm)
Agents
@@ -223,8 +263,8 @@
- Stable Diffusion (https://huggingface.co/CompVis/stable-diffusion-v1-4) - Stable Diffusion by Stability AI is a state of the art text-to-image model that generates images from text. #opensource
- Midjourney (https://www.midjourney.com/) - Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species.
- Imagen (https://imagen.research.google/) - Imagen by Google is a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding.
- Make-A-Scene (https://ai.facebook.com/blog/greater-creative-control-for-ai-image-generation/) - Make-A-Scene by Meta is a multimodal generative AI method puts creative control in the hands of people who use it by allowing them to 
describe and illustrate their vision through both text descriptions and freeform sketches.
- Make-A-Scene (https://ai.facebook.com/blog/greater-creative-control-for-ai-image-generation/) - Make-A-Scene by Meta is a multimodal generative AI method puts creative control in the hands of people who use it by allowing them to describe and 
illustrate their vision through both text descriptions and freeform sketches.
- DragGAN (https://github.com/XingangPan/DragGAN) - Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold.
Services
@@ -232,8 +272,7 @@
- Craiyon (https://www.craiyon.com/) - Craiyon, formerly DALL-E mini, is an AI model that can draw images from any text prompt.
- DreamStudio (https://beta.dreamstudio.ai/) - DreamStudio is an easy-to-use interface for creating images using the Stable Diffusion image generation model.
- Artbreeder (https://www.artbreeder.com/) - Artbreeder is new type of creative tool that empowers users creativity by making it easier to collaborate and explore.
- GauGAN2 (http://gaugan.org/gaugan2/) - GauGAN2 is a robust tool for creating photorealistic art using a combination of words and drawings since it integrates segmentation mapping, inpainting, and text-to-image production in a single 
model.
- GauGAN2 (http://gaugan.org/gaugan2/) - GauGAN2 is a robust tool for creating photorealistic art using a combination of words and drawings since it integrates segmentation mapping, inpainting, and text-to-image production in a single model.
- Magic Eraser (https://www.magiceraser.io/) - Remove unwanted things from images in seconds.
- Imagine by Magic Studio (https://magicstudio.com/imagine) - A tool by Magic Studio that let's you express yourself by just describing what's on your mind.
- Alpaca (https://www.getalpaca.io/) - Stable Diffusion Photoshop plugin.
@@ -249,9 +288,12 @@
- Lensa (https://prisma-ai.com/lensa) - An all-in-one image editing app that includes the generation of personalized avatars using Stable Diffusion.
- RunDiffusion (https://rundiffusion.com/) - Cloud-based workspace for creating AI-generated art.
- Ideogram (https://ideogram.ai/) - A text-to-image platform to make creative expression more accessible.
- Bing Image Creator (https://www.bing.com/images/create) - DALLE·3 based text-to-image generator with safety features.
- KREA (https://www.krea.ai/) - Generate high quality visuals with an AI that knows about your styles, concepts, or products.
- Nightcafe (https://creator.nightcafe.studio/) - NightCafe Creator is an AI Art Generator app with multiple methods of AI art generation.
- Leonardo AI (https://leonardo.ai/) - Create production-quality visual assets for your projects with unprecedented quality, speed, and style.
- Recraft (https://www.recraft.ai/) - An AI tool that lets creators easily generate and iterate original images, vector art, illustrations, icons, and 3D graphics.
- Reve Image (https://reve.art/) - A model trained from the ground up to excel at prompt adherence, aesthetics, and typography.
Graphic design
@@ -281,17 +323,25 @@
Video
- RunwayML (https://runwayml.com/) - Magical AI tools, realtime collaboration, precision editing, and more. Your next-generation content creation suite.
- Runway (https://runwayml.com/) - Magical AI tools, realtime collaboration, precision editing, and more. Your next-generation content creation suite.
- Synthesia (https://www.synthesia.io/) - Create videos from plain text in minutes.
- Rephrase AI (https://www.rephrase.ai/) - Rephrase's technology enables hyper-personalized video creation at scale that drive engagement and business efficiencies.
- Hour One (https://hourone.ai/) - Turn text into video, featuring virtual presenters, automatically.
- D-ID (https://www.d-id.com/) - Create and interact with talking avatars at the touch of a button.
- Colossyan (https://www.colossyan.com/) - Learning & Development focused video creator. Use AI avatars to create educational videos in multiple languages.
- Fliki (https://fliki.ai/) - Create text to video and text to speech content with ai powered voices in minutes.
- Pictory (https://pictory.ai/) - Pictory's powerful AI enables you to create and edit professional quality videos using text.
- Pika (https://pika.art/) - An idea-to-video platform that brings your creativity to motion.
- HeyGen (https://app.heygen.com/) - Turn scripts into talking videos with customizable AI avatars in minutes.
- Sora (https://openai.com/sora) - An AI model that can create realistic and imaginative scenes from text instructions.
- Luma Dream Machine (https://lumalabs.ai/dream-machine) - An AI model that makes high quality, realistic videos fast from text and images.
- Infinity AI (https://infinity.ai/) - Infinity is a video foundation model that allows you to craft your characters and then bring them to life.
- KLING AI (https://klingai.com/) - Tools for creating imaginative images and videos.
- Hailuo AI (https://hailuoai.video/) - AI-powered text-to-video generator.
Avatars
- D-ID (https://www.d-id.com/) - Create and interact with talking avatars at the touch of a button.
- HeyGen (https://app.heygen.com/) - Turn scripts into talking videos with customizable AI avatars in minutes.
- RenderNet (https://rendernet.ai/) - RenderNet AI is a tool for generating images and videos, providing control over character design, composition, and style.
Animation
@@ -299,18 +349,22 @@
Audio
Speech
Text-to-speech
- Eleven Labs (https://beta.elevenlabs.io/) - AI voice generator.
- Resemble AI (https://www.resemble.ai/) - AI voice generator and voice cloning for text to speech.
- WellSaid (https://wellsaidlabs.com/) - Convert text to voice in real time.
- Play.ht (https://play.ht/) - AI Voice Generator. Generate realistic Text to Speech voice over online with AI. Convert text to audio.
- Coqui (https://coqui.ai/) - Generative AI for Voice.
- podcast.ai (https://podcast.ai/) - A podcast that is entirely generated by artificial intelligence, powered by Play.ht text-to-voice AI.
- VALL-E X (https://vallex-demo.github.io/) - A cross-lingual neural codec language model for cross-lingual speech synthesis.
- TorToiSe (https://github.com/neonbjb/tortoise-tts) - A multi-voice text-to-speech system trained with an emphasis on quality. #opensource
- Bark (https://github.com/suno-ai/bark) - A transformer-based text-to-audio model. #opensource
- Coque (https://coqui.ai/) - Coqui is dedicated to open speech technology and to serving as the hub where speech researchers, developers, and practitioners congregate.
Speech-to-text
- Whisper (https://openai.com/index/whisper/) - Robust speech recognition via large-scale weak supervision. #opensource (https://github.com/openai/whisper)
- Wispr Flow (https://wisprflow.ai/) - Flow makes writing quick with seamless voice dictation for any application on your computer.
- Vibe Transcribe (https://thewh1teagle.github.io/vibe/) - All-in-one solution for effortless audio and video transcription. #opensource (https://github.com/thewh1teagle/vibe)
- whisper.cpp (https://github.com/ggerganov/whisper.cpp) - Port of OpenAI's Whisper model in C/C++. #opensource
Music
@@ -345,7 +399,9 @@
- ChatGPT prompt engineering for developers (https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/) - A short course by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI).
- OpenAI Cookbook (https://github.com/openai/openai-cookbook) - Examples and guides for using the OpenAI API.
- OpenAI Prompt Engineering Guide (https://platform.openai.com/docs/guides/prompt-engineering) - Strategies and tactics for getting better results from large language models.
- PromptPerfect - Tool for prompt engineering.
- PromptPerfect (https://promptperfect.jina.ai/) - Tool for prompt engineering.
- Anthropic courses (https://github.com/anthropics/courses) - Anthropic's educational courses.
- Build a Large Language Model (From Scratch) (https://www.manning.com/books/build-a-large-language-model-from-scratch) - A guide to building your own working LLM, by Sebastian Raschka.
More lists
@@ -368,3 +424,5 @@
- FlowGPT (https://flowgpt.com/) - Amplify your workflow with the best prompts.
- ChatGPT Prompts for Data Science (https://github.com/travistangvh/ChatGPT-Data-Science-Prompts) - A repository of useful data science prompts for ChatGPT.
- Awesome ChatGPT (https://github.com/sindresorhus/awesome-chatgpt) - Another awesome list for ChatGPT.
generativeai Github: https://github.com/steven2358/awesome-generative-ai