
TVPaint Animation
The digital solution for your professional 2D animation projects.

High-fidelity 3D mesh generation from single images in under 45 seconds

One-2-3-45 represents a significant architectural leap in 3D generative AI, specifically designed to solve the latency and consistency issues found in earlier Score Distillation Sampling (SDS) methods. Developed by researchers at UC San Diego, the system utilizes a feed-forward neural network that leverages the 2D knowledge of large-scale diffusion models (specifically Zero123) to generate multi-view images of a single object. These views are then processed through a cost volume-based 3D reconstruction module. Unlike optimization-based approaches that can take hours, One-2-3-45 completes the lifting of a 2D image into a full 3D mesh in approximately 45 seconds. By 2026, the architecture has become a benchmark for real-time spatial computing, providing the foundational logic for rapid asset creation in XR environments and game development pipelines. Its technical superiority lies in its ability to maintain high geometry fidelity and multi-view consistency without the typical 'Janus problem' common in early-stage 3D generators. The system is highly scalable, supporting deployment on consumer-grade GPUs with at least 24GB of VRAM, making it a favorite for local development and private enterprise deployment.
One-2-3-45 represents a significant architectural leap in 3D generative AI, specifically designed to solve the latency and consistency issues found in earlier Score Distillation Sampling (SDS) methods.
Explore all tools that specialize in generate 3d meshes. This domain focus ensures One-2-3-45 delivers optimized results for this specific requirement.
Explore all tools that specialize in reconstruct 3d models. This domain focus ensures One-2-3-45 delivers optimized results for this specific requirement.
Explore all tools that specialize in multi-view synthesis. This domain focus ensures One-2-3-45 delivers optimized results for this specific requirement.
Uses a 3D cost volume to aggregate features from multi-view images for precise geometric localization.
A direct mapping from images to 3D geometry without iterative per-object optimization.
Ensures that the generated multi-view images align perfectly in 3D space before mesh extraction.
Utilizes Signed Distance Functions to ensure the resulting mesh is watertight and manifold.
Deep integration with viewpoint-conditioned diffusion models for enhanced texture realism.
Internal pre-processing pipeline to isolate the subject for clean 3D extraction.
Post-processing step that optimizes vertex count while preserving visual detail.
Clone the official GitHub repository for One-2-3-45.
Initialize a Conda environment with Python 3.9+.
Install CUDA-optimized PyTorch and relevant dependencies including Tiny-CUDA-NN.
Download the pre-trained weights for the multi-view diffusion model (Zero123).
Download the reconstruction module weights from the official UCSD storage.
Prepare a source image with a transparent background (segmentation may be required).
Configure the inference script parameters, adjusting the 'elevation' and 'azimuth' if necessary.
Execute the inference command to generate the intermediate multi-view images.
Run the marching cubes algorithm integrated into the pipeline to extract the final mesh.
Export the 3D model in .obj or .glb format for use in Blender or Unity.
All Set
Ready to go
Verified feedback from other users.
"Highly praised for speed and geometric consistency compared to DreamFusion, though it requires significant VRAM for local execution."
Post questions, share tips, and help other users.

The digital solution for your professional 2D animation projects.

Empowering independent artists with digital music distribution, publishing administration, and promotional tools.

Convert creative micro-blogs into high-performance web presences using generative AI and Automattic's core infrastructure.

Fashion design technology software and machinery for apparel product development.

Instantly turns any text to natural sounding speech for listening online or generating downloadable audio.

Professional studio-quality AI headshot generator for individuals and teams.