VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF
GitHub
GitHub - VAST-AI-Research/TripoSF: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling - VAST-AI-Research/TripoSF
lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler
GitHub
GitHub - lum3on/comfyui_HiDream-Sampler: ComfyUI Wrapper for HiDream
ComfyUI Wrapper for HiDream. Contribute to lum3on/comfyui_HiDream-Sampler development by creating an account on GitHub.
River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
GitHub
GitHub - River-Zhang/ICEdit: [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing!…
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run! - River-...
👍1
Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
GitHub
GitHub - Tencent-Hunyuan/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent-Hunyuan/HunyuanCustom
❤1
JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
GitHub
GitHub - JAMESYJL/ShapeLLM-Omni: [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding
[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni
Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
GitHub
GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1
SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
GitHub
GitHub - SkyworkAI/Matrix-3D: Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or…
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt. - SkyworkAI/Matrix-3D
ENDESGA/PEP
Prediction-Encoded Pixels - a tiny yet powerful pixel art compression method
Language: C
#c #compression #image_compression #pixel_art #single_header
Stars: 241 Issues: 1 Forks: 3
https://github.com/ENDESGA/PEP
Prediction-Encoded Pixels - a tiny yet powerful pixel art compression method
Language: C
#c #compression #image_compression #pixel_art #single_header
Stars: 241 Issues: 1 Forks: 3
https://github.com/ENDESGA/PEP
GitHub
GitHub - ENDESGA/pep: Prediction-Encoded Pixels - a tiny yet powerful single-header pixel art compression method that focuses on…
Prediction-Encoded Pixels - a tiny yet powerful single-header pixel art compression method that focuses on size - ENDESGA/pep
Tencent-Hunyuan/HunyuanWorld-Voyager
Voyager is an interactive RGBD video generation model conditioned on camera trajectory, and supports real-time 3D reconstruction.
Language: Python
#3d #3d_generation #aigc #hunyuan3d #image_to_3d #image_to_video #scene_generation #world_model #world_models
Stars: 323 Issues: 1 Forks: 20
https://github.com/Tencent-Hunyuan/HunyuanWorld-Voyager
Voyager is an interactive RGBD video generation model conditioned on camera trajectory, and supports real-time 3D reconstruction.
Language: Python
#3d #3d_generation #aigc #hunyuan3d #image_to_3d #image_to_video #scene_generation #world_model #world_models
Stars: 323 Issues: 1 Forks: 20
https://github.com/Tencent-Hunyuan/HunyuanWorld-Voyager
GitHub
GitHub - Tencent-Hunyuan/HunyuanWorld-Voyager: Voyager is an interactive RGBD video generation model conditioned on camera input…
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction. - Tencent-Hunyuan/HunyuanWorld-Voyager
Tencent-Hunyuan/HunyuanImage-2.1
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
Language: Python
#aigc #diffusion_models #diffusion_transformer #image_generation #text_to_image
Stars: 255 Issues: 7 Forks: 16
https://github.com/Tencent-Hunyuan/HunyuanImage-2.1
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
Language: Python
#aigc #diffusion_models #diffusion_transformer #image_generation #text_to_image
Stars: 255 Issues: 7 Forks: 16
https://github.com/Tencent-Hunyuan/HunyuanImage-2.1
GitHub
GitHub - Tencent-Hunyuan/HunyuanImage-2.1: HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image…
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation - Tencent-Hunyuan/HunyuanImage-2.1
❤1
Tencent-Hunyuan/Hunyuan3D-Omni
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #multimodal #shape
Stars: 181 Issues: 0 Forks: 10
https://github.com/Tencent-Hunyuan/Hunyuan3D-Omni
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #multimodal #shape
Stars: 181 Issues: 0 Forks: 10
https://github.com/Tencent-Hunyuan/Hunyuan3D-Omni
GitHub
GitHub - Tencent-Hunyuan/Hunyuan3D-Omni: Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets - Tencent-Hunyuan/Hunyuan3D-Omni
Tencent-Hunyuan/HunyuanImage-3.0
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Language: Python
#image_generation #native_multimodal_model
Stars: 234 Issues: 2 Forks: 7
https://github.com/Tencent-Hunyuan/HunyuanImage-3.0
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Language: Python
#image_generation #native_multimodal_model
Stars: 234 Issues: 2 Forks: 7
https://github.com/Tencent-Hunyuan/HunyuanImage-3.0
GitHub
GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0
lightly-ai/lightly-studio
Curate, Annotate, and Manage Your Data in LightlyStudio.
Language: Python
#computer_vision #image_labeling #mlops
Stars: 395 Issues: 6 Forks: 7
https://github.com/lightly-ai/lightly-studio
Curate, Annotate, and Manage Your Data in LightlyStudio.
Language: Python
#computer_vision #image_labeling #mlops
Stars: 395 Issues: 6 Forks: 7
https://github.com/lightly-ai/lightly-studio
GitHub
GitHub - lightly-ai/lightly-studio: Curate, Annotate, and Manage Your Data in LightlyStudio.
Curate, Annotate, and Manage Your Data in LightlyStudio. - lightly-ai/lightly-studio
Tencent-Hunyuan/HunyuanVideo-1.5
HunyuanVideo-1.5: A leading lightweight video generation model
Language: Python
#image_to_video #text_to_video #video_generation
Stars: 360 Issues: 5 Forks: 17
https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5
HunyuanVideo-1.5: A leading lightweight video generation model
Language: Python
#image_to_video #text_to_video #video_generation
Stars: 360 Issues: 5 Forks: 17
https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5
GitHub
GitHub - Tencent-Hunyuan/HunyuanVideo-1.5: HunyuanVideo-1.5: A leading lightweight video generation model
HunyuanVideo-1.5: A leading lightweight video generation model - Tencent-Hunyuan/HunyuanVideo-1.5
Hugo-Dz/spritefusion-pixel-snapper
A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI.
Language: Rust
#game_development #gamedev #image_processing #pixel_art
Stars: 445 Issues: 3 Forks: 16
https://github.com/Hugo-Dz/spritefusion-pixel-snapper
A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI.
Language: Rust
#game_development #gamedev #image_processing #pixel_art
Stars: 445 Issues: 3 Forks: 16
https://github.com/Hugo-Dz/spritefusion-pixel-snapper
GitHub
GitHub - Hugo-Dz/spritefusion-pixel-snapper: A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel…
A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI. - Hugo-Dz/spritefusion-pixel-snapper
❤1
Robbyant/lingbot-world
Advancing Open-source World Models
Language: Python
#aigc #image_to_video #lingbot_world #video_generation #world_models
Stars: 971 Issues: 11 Forks: 35
https://github.com/Robbyant/lingbot-world
Advancing Open-source World Models
Language: Python
#aigc #image_to_video #lingbot_world #video_generation #world_models
Stars: 971 Issues: 11 Forks: 35
https://github.com/Robbyant/lingbot-world
GitHub
GitHub - Robbyant/lingbot-world: Advancing Open-source World Models
Advancing Open-source World Models. Contribute to Robbyant/lingbot-world development by creating an account on GitHub.
PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
Language: Python
#acceleration #diffusion #diffusion_model #diffusion_models #efficient_tuning #high__quality #image_to_video #image2video #interactive #long_context #long_video_generation #real_time #text_to_video #text2video #video_generation #video_generator #video_to_video #video2video #world_model #world_models
Stars: 712 Issues: 5 Forks: 46
https://github.com/PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
Language: Python
#acceleration #diffusion #diffusion_model #diffusion_models #efficient_tuning #high__quality #image_to_video #image2video #interactive #long_context #long_video_generation #real_time #text_to_video #text2video #video_generation #video_generator #video_to_video #video2video #world_model #world_models
Stars: 712 Issues: 5 Forks: 46
https://github.com/PKU-YuanGroup/Helios
GitHub
GitHub - PKU-YuanGroup/Helios: Helios: Real Real-Time Long Video Generation Model
Helios: Real Real-Time Long Video Generation Model - PKU-YuanGroup/Helios
wuyoscar/gpt_image_2_skill
GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing
Language: Python
#agent_skills #ai_image_prompts #claude_code_skill #cli #codex_skill #gpt_image #gpt_image_2 #gpt_image_2_prompts #image_editing #image_generation #image_prompt #openai #prompt_library #prompt_templates #research_figures #text_to_image
Stars: 721 Issues: 0 Forks: 64
https://github.com/wuyoscar/gpt_image_2_skill
GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing
Language: Python
#agent_skills #ai_image_prompts #claude_code_skill #cli #codex_skill #gpt_image #gpt_image_2 #gpt_image_2_prompts #image_editing #image_generation #image_prompt #openai #prompt_library #prompt_templates #research_figures #text_to_image
Stars: 721 Issues: 0 Forks: 64
https://github.com/wuyoscar/gpt_image_2_skill
GitHub
GitHub - wuyoscar/GPT-Image2-Skill: GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image …
GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing - wuyoscar/GPT-Image2-Skill
❤1
bytedance/Lance
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
Language: Python
#image_editing #image_generation #image_understanding #unified_multimodal_models #video_generation #video_understanding
Stars: 696 Issues: 10 Forks: 38
https://github.com/bytedance/Lance
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
Language: Python
#image_editing #image_generation #image_understanding #unified_multimodal_models #video_generation #video_understanding
Stars: 696 Issues: 10 Forks: 38
https://github.com/bytedance/Lance
GitHub
GitHub - bytedance/Lance: A 3B-active-parameter native unified multimodal model for image and video understanding, generation,…
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing. - bytedance/Lance
❤1
boona13/image-extender
Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker.
Language: TypeScript
#ai #gemini #image_generation #nano_banana #nextjs #openrouter #outpainting #poisson_blending #tailwindcss #typescript
Stars: 547 Issues: 0 Forks: 61
https://github.com/boona13/image-extender
Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker.
Language: TypeScript
#ai #gemini #image_generation #nano_banana #nextjs #openrouter #outpainting #poisson_blending #tailwindcss #typescript
Stars: 547 Issues: 0 Forks: 61
https://github.com/boona13/image-extender
GitHub
GitHub - boona13/image-extender: Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via…
Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker. - boona13/image-extender