Nvidia's new text-to-3D model shows how fast generative AI is advancing

An image of a 3D origami dog on a skateboard generated by Nvidia LATTE3D

(Image credit: Nvidia)

Nvidia's on quite a roll. After revealing its Blackwell superchip, which is designed for the training of more powerful AI models like GPT, Claude and Gemini, it's teased a text-to-3D AI tool of its own (see our guide to the best graphics cards for consumer options).

The graphics card giant closed GTC week by showcasing LATTE3D, a text-to-3D generative AI model that it described as a "virtual 3D printer". It can turn text prompts into 3D representations of objects and animals within a second.

Nvidia says the 3D shapes generated by LATTE3D can be "easily served up in virtual environments for developing video games, ad campaigns, design projects or virtual training grounds for robotics". We've seen text-to-3D tools before, and commends online suggest some aren't too impressed with the quality of LATTE3Ds results. But the new model represents a big advance, especially in terms of speed.

LATTE3D was developed by Nvidia's Toronto-based AI lab team and was trained using text prompts generated using ChatGPT to improve the model’s ability to handle the various phrases a user might come up with to describe a particular 3D object. While the researchers trained LATTE3D on two specific datasets, animals and everyday objects, the same architecture could be used to to train the AI on other data types. It remains a research project only and is not available to for public use.

The AI creator Bilawal Sidhu wrote on X: "This leap is huge. DreamFusion circa 2022 was slow and low quality, but kicked off this generative 3D revolution. Efforts like ATT3D (Amortized Text-to-3D Object Synthesis) chased speed at the cost of quality. Now with LATTE3D is high quality and processes in less than a second! Meaning you can quickly iterate and populate a 3D world using text or image to 3D."

Thank you for reading 5 articles this month* Join now for unlimited access

Enjoy your first month for just £1 / $1 / €1

*Read 5 free articles per month without a subscription

Join now for unlimited access

Try first month for just £1 / $1 / €1

TOPICS

Joe is a regular freelance journalist and editor at Creative Bloq. He writes news, features and buying guides and keeps track of the best equipment and software for creatives, from video editing programs to monitors and accessories. A veteran news writer and photographer, he now works as a project manager at the London and Buenos Aires-based design, production and branding agency Hermana Creatives. There he manages a team of designers, photographers and video editors who specialise in producing visual content and design assets for the hospitality sector. He also dances Argentine tango.

Recommended reading