GitHub repos

VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF

GitHub

GitHub - VAST-AI-Research/TripoSF: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling - VAST-AI-Research/TripoSF

1.85K views16:00

GitHub repos

lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler

GitHub

GitHub - lum3on/comfyui_HiDream-Sampler: ComfyUI Wrapper for HiDream

ComfyUI Wrapper for HiDream. Contribute to lum3on/comfyui_HiDream-Sampler development by creating an account on GitHub.

1.79K views10:00

GitHub repos

River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit

GitHub

GitHub - River-Zhang/ICEdit: [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing!…

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run! - River-...

👍1

1.88K views22:00

GitHub repos

Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom

GitHub

GitHub - Tencent-Hunyuan/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent-Hunyuan/HunyuanCustom

❤1

1.8K views16:00

GitHub repos

JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni

GitHub

GitHub - JAMESYJL/ShapeLLM-Omni: [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni

1.88K views22:00

GitHub repos

Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1

GitHub

GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1

1.93K views10:00

GitHub repos

SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D

GitHub

GitHub - SkyworkAI/Matrix-3D: Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or…

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt. - SkyworkAI/Matrix-3D

1.62K views16:00

GitHub repos

ENDESGA/PEP
Prediction-Encoded Pixels - a tiny yet powerful pixel art compression method
Language: C
#c #compression #image_compression #pixel_art #single_header
Stars: 241 Issues: 1 Forks: 3
https://github.com/ENDESGA/PEP

GitHub

GitHub - ENDESGA/pep: Prediction-Encoded Pixels - a tiny yet powerful single-header pixel art compression method that focuses on…

Prediction-Encoded Pixels - a tiny yet powerful single-header pixel art compression method that focuses on size - ENDESGA/pep

1.64K views04:00

GitHub repos

Tencent-Hunyuan/HunyuanWorld-Voyager
Voyager is an interactive RGBD video generation model conditioned on camera trajectory, and supports real-time 3D reconstruction.
Language: Python
#3d #3d_generation #aigc #hunyuan3d #image_to_3d #image_to_video #scene_generation #world_model #world_models
Stars: 323 Issues: 1 Forks: 20
https://github.com/Tencent-Hunyuan/HunyuanWorld-Voyager

GitHub

GitHub - Tencent-Hunyuan/HunyuanWorld-Voyager: Voyager is an interactive RGBD video generation model conditioned on camera input…

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction. - Tencent-Hunyuan/HunyuanWorld-Voyager

1.6K views16:00

GitHub repos

Tencent-Hunyuan/HunyuanImage-2.1
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
Language: Python
#aigc #diffusion_models #diffusion_transformer #image_generation #text_to_image
Stars: 255 Issues: 7 Forks: 16
https://github.com/Tencent-Hunyuan/HunyuanImage-2.1

GitHub

GitHub - Tencent-Hunyuan/HunyuanImage-2.1: HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image…

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation - Tencent-Hunyuan/HunyuanImage-2.1

❤1

1.67K views16:00

GitHub repos

Tencent-Hunyuan/Hunyuan3D-Omni
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #multimodal #shape
Stars: 181 Issues: 0 Forks: 10
https://github.com/Tencent-Hunyuan/Hunyuan3D-Omni

GitHub

GitHub - Tencent-Hunyuan/Hunyuan3D-Omni: Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets - Tencent-Hunyuan/Hunyuan3D-Omni

1.56K views10:00

GitHub repos

Tencent-Hunyuan/HunyuanImage-3.0
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Language: Python
#image_generation #native_multimodal_model
Stars: 234 Issues: 2 Forks: 7
https://github.com/Tencent-Hunyuan/HunyuanImage-3.0

GitHub

GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0

1.61K views22:00

GitHub repos

lightly-ai/lightly-studio
Curate, Annotate, and Manage Your Data in LightlyStudio.
Language: Python
#computer_vision #image_labeling #mlops
Stars: 395 Issues: 6 Forks: 7
https://github.com/lightly-ai/lightly-studio

GitHub

GitHub - lightly-ai/lightly-studio: Curate, Annotate, and Manage Your Data in LightlyStudio.

Curate, Annotate, and Manage Your Data in LightlyStudio. - lightly-ai/lightly-studio

1.59K views22:00

GitHub repos

Tencent-Hunyuan/HunyuanVideo-1.5
HunyuanVideo-1.5: A leading lightweight video generation model
Language: Python
#image_to_video #text_to_video #video_generation
Stars: 360 Issues: 5 Forks: 17
https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5

GitHub

GitHub - Tencent-Hunyuan/HunyuanVideo-1.5: HunyuanVideo-1.5: A leading lightweight video generation model

HunyuanVideo-1.5: A leading lightweight video generation model - Tencent-Hunyuan/HunyuanVideo-1.5

1.63K views17:00

GitHub repos

Hugo-Dz/spritefusion-pixel-snapper
A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI.
Language: Rust
#game_development #gamedev #image_processing #pixel_art
Stars: 445 Issues: 3 Forks: 16
https://github.com/Hugo-Dz/spritefusion-pixel-snapper

GitHub

GitHub - Hugo-Dz/spritefusion-pixel-snapper: A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel…

A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI. - Hugo-Dz/spritefusion-pixel-snapper

❤1

1.64K views11:00

GitHub repos

Robbyant/lingbot-world
Advancing Open-source World Models
Language: Python
#aigc #image_to_video #lingbot_world #video_generation #world_models
Stars: 971 Issues: 11 Forks: 35
https://github.com/Robbyant/lingbot-world

GitHub

GitHub - Robbyant/lingbot-world: Advancing Open-source World Models

Advancing Open-source World Models. Contribute to Robbyant/lingbot-world development by creating an account on GitHub.

1.66K views17:00

GitHub repos

PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
Language: Python
#acceleration #diffusion #diffusion_model #diffusion_models #efficient_tuning #high__quality #image_to_video #image2video #interactive #long_context #long_video_generation #real_time #text_to_video #text2video #video_generation #video_generator #video_to_video #video2video #world_model #world_models
Stars: 712 Issues: 5 Forks: 46
https://github.com/PKU-YuanGroup/Helios

GitHub

GitHub - PKU-YuanGroup/Helios: Helios: Real Real-Time Long Video Generation Model

Helios: Real Real-Time Long Video Generation Model - PKU-YuanGroup/Helios

1.55K views11:00

GitHub repos

wuyoscar/gpt_image_2_skill
GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing
Language: Python
#agent_skills #ai_image_prompts #claude_code_skill #cli #codex_skill #gpt_image #gpt_image_2 #gpt_image_2_prompts #image_editing #image_generation #image_prompt #openai #prompt_library #prompt_templates #research_figures #text_to_image
Stars: 721 Issues: 0 Forks: 64
https://github.com/wuyoscar/gpt_image_2_skill

GitHub

GitHub - wuyoscar/GPT-Image2-Skill: GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image …

GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing - wuyoscar/GPT-Image2-Skill

❤1

1.61K views22:00

GitHub repos

bytedance/Lance
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
Language: Python
#image_editing #image_generation #image_understanding #unified_multimodal_models #video_generation #video_understanding
Stars: 696 Issues: 10 Forks: 38
https://github.com/bytedance/Lance

GitHub

GitHub - bytedance/Lance: A 3B-active-parameter native unified multimodal model for image and video understanding, generation,…

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing. - bytedance/Lance

❤1

1.34K views10:00

GitHub repos

boona13/image-extender
Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker.
Language: TypeScript
#ai #gemini #image_generation #nano_banana #nextjs #openrouter #outpainting #poisson_blending #tailwindcss #typescript
Stars: 547 Issues: 0 Forks: 61
https://github.com/boona13/image-extender

GitHub

GitHub - boona13/image-extender: Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via…

Seamlessly extend any image in any direction with AI. Open-source web app powered by Gemini via OpenRouter, with Poisson-blended seams and best-of-3 variant picker. - boona13/image-extender

805 views10:00

About

Blog

Apps

Platform