AI Disruption
Subscribe
Sign in
Home
Podcast
Chat
Chip
Meta
Paper
Qwen
Agent
Robot
Google
OpenAI
AI Code
AI Video
AI Weekly
Elon Musk
AI Writing
AI Painting
AI Business
🎈 Guest Posts
AI Open Source
Machine Learning
Chinese Outbound
Foundation Model
Archive
About
AI Painting
Latest
Top
Discussions
Mogao=Seedream 3.0?Topping the Charts for Days, Mysterious Text-to-Image Model Exposed
Discover Mogao: ByteDance's Seedream 3.0 tops AI image generation charts, surpassing GPT-4o & Midjourney.
Apr 16
Â
•
Â
Meng Li
1
Share this post
AI Disruption
Mogao=Seedream 3.0?Topping the Charts for Days, Mysterious Text-to-Image Model Exposed
Copy link
Facebook
Email
Notes
More
VAST Open-Sources Two New Projects: 3D Component Editing & Auto-Rigging Framework
VAST open-sources HoloPart (3D part editing) & UniRig (auto-rigging). Boost 3D workflows with AI-powered segmentation & animation tools.
Apr 13
Â
•
Â
Meng Li
1
Share this post
AI Disruption
VAST Open-Sources Two New Projects: 3D Component Editing & Auto-Rigging Framework
Copy link
Facebook
Email
Notes
More
Midjourney Releases V7: The New AI Art Powerhouse | Demos & Flaws
Midjourney V7: Next-Gen AI Art with Realistic Details & Draft Mode | See Stunning Demos & Key Flaws
Apr 4
Â
•
Â
Meng Li
6
Share this post
AI Disruption
Midjourney Releases V7: The New AI Art Powerhouse | Demos & Flaws
Copy link
Facebook
Email
Notes
More
Manus Ignites MCP, Enabling Claude to Automate 3D Modeling with a Single Sentence
Claude automates Blender with MCP, turning 2D images into 3D models in minutes. Explore seamless 3D modeling and interactive web creation!
Mar 15
Â
•
Â
Meng Li
2
Share this post
AI Disruption
Manus Ignites MCP, Enabling Claude to Automate 3D Modeling with a Single Sentence
Copy link
Facebook
Email
Notes
More
Google Gemini's Native Image Output Launches First, Erasing OpenAI's One-Year Lead
Explore Gemini's game-changing text-to-image model with unmatched controllability. Discover tips for seamless edits, local modifications, and AI-driven…
Mar 14
Â
•
Â
Meng Li
2
Share this post
AI Disruption
Google Gemini's Native Image Output Launches First, Erasing OpenAI's One-Year Lead
Copy link
Facebook
Email
Notes
More
Llama Boosts Multimodal Performance by 30% with Diffusion's Attention Distribution
Boost Llama-3.2's multimodal performance by 30% with Stable Diffusion’s attention distribution. Achieve high accuracy with minimal data and training…
Feb 17
Â
•
Â
Meng Li
3
Share this post
AI Disruption
Llama Boosts Multimodal Performance by 30% with Diffusion's Attention Distribution
Copy link
Facebook
Email
Notes
More
MakeAnything Unlocks Multi-task Process Generation with Diffusion Transformer
MakeAnything combines Diffusion Transformer and asymmetric LoRA to unlock cross-domain, high-quality multi-task process generation, achieving…
Feb 16
Â
•
Â
Meng Li
2
Share this post
AI Disruption
MakeAnything Unlocks Multi-task Process Generation with Diffusion Transformer
Copy link
Facebook
Email
Notes
More
OpenAI Tests Sora Image Generator, Codename "Papaya" – Is DALL-E 4 Coming Soon?
OpenAI is testing Sora's image generation, codenamed "Papaya," with a new toggle for video and image creation. Could DALL-E 4 be coming soon?
Feb 9
Â
•
Â
Meng Li
Share this post
AI Disruption
OpenAI Tests Sora Image Generator, Codename "Papaya" – Is DALL-E 4 Coming Soon?
Copy link
Facebook
Email
Notes
More
DeepSeek Releases Image-Video Large Model Janus-Pro-7B, Competing with OpenAI DALL-E 3
DeepSeek’s multimodal tech revolutionizes AI, with Janus-Pro surpassing DALL-E 3 in performance. Explore efficient, open-source models for advanced…
Jan 28
Â
•
Â
Meng Li
2
Share this post
AI Disruption
DeepSeek Releases Image-Video Large Model Janus-Pro-7B, Competing with OpenAI DALL-E 3
Copy link
Facebook
Email
Notes
More
MetaMorph: Unified Visual Understanding and Generation by LeCun
Explore MetaMorph, a multimodal AI model merging visual understanding and generation. Discover insights from LeCun, Xie, Liu, and others.
Dec 21, 2024
Â
•
Â
Meng Li
1
Share this post
AI Disruption
MetaMorph: Unified Visual Understanding and Generation by LeCun
Copy link
Facebook
Email
Notes
More
Gemini 2.0: One-Sentence Image Editing Tool that Has Netizens Envious
Gemini 2.0 revolutionizes image editing with one-click commands. Transform objects, combine images, and create imaginative edits using just your voice.
Dec 16, 2024
Â
•
Â
Meng Li
2
Share this post
AI Disruption
Gemini 2.0: One-Sentence Image Editing Tool that Has Netizens Envious
Copy link
Facebook
Email
Notes
More
PaliGemma 2: Google's Multi-Scale Lightweight Vision-Language Model
Discover PaliGemma 2: Google's lightweight, multi-scale vision-language model, ideal for image-text tasks, content creation, and AI development…
Dec 8, 2024
Â
•
Â
Meng Li
2
Share this post
AI Disruption
PaliGemma 2: Google's Multi-Scale Lightweight Vision-Language Model
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts