Question related to 3D mesh models in general: has any significant work been don...

troymc · 2025-01-22T01:50:45 1737510645

Check out RealityCapture [1]. I think it's what's used to create the Quixel Megascans [2]. (They're both under the Epic corporate umbrella now.)

[1] https://www.capturingreality.com/realitycapture

[2] https://quixel.com/megascans/

Joel_Mckay · 2025-01-22T04:50:37 1737521437

COLMAP + CloudCompare with a good CUDA GPU (more VRAM is better) card will give reasonable results for large textured objects like buildings. Glass/Water/Mirror/Gloss will need coated to scan, dry spray on Dr.scholls foot deodorant seems to work fine for our object scans.

There are now more advanced options than Gaussian splatting, and these can achieve normal playback speeds rather than hours of filtering. I'll drop a citation if I recall the recent paper and example code. However, note this style of 3D scene recovery tends to be heavily 3D location dependent.

Best of luck, =3

jocaal · 2025-01-22T01:29:41 1737509381

Recently, a lot of development in this area has been in gaussian splatting and from what I have seen, the new methods are super effective.

https://en.wikipedia.org/wiki/Gaussian_splatting

https://www.youtube.com/watch?v=6dPBaV6M9u4

meindnoch · 2025-01-22T11:13:33 1737544413

The parent explicitly asked for a mesh.

andybak · 2025-01-22T12:54:10 1737550450

You can never be sure what someone's real intent is. They might mean "something meshlike". Personally I usually reply by asking for more info (I always have the XY Problem in my mind) but that is time consuming and some people assume you're being pendantic (I am however correct more often than not - people have posed the wrong question or haven't given critical parts of the context)

geuis · 2025-01-22T16:54:22 1737564862

Yeah, I am explicitly asking about meshes, which is why I said that and also referenced photogrammetry. Sometimes people know what they're asking for help with.

Thanks for the links. Going to check them out this morning.

andybak · 2025-01-22T19:27:42 1737574062

Just to be clear I wasn't singling you out. I don't know anything about you.

And furthermore I also often post questions that lack sufficient context.

My point was that it's always okay to ask for clarification or to assume that there was maybe some broader contact context or to offer a suggestion that doesn't follow the most literal interpretation of the question as asked.

jocaal · 2025-01-22T11:37:23 1737545843

The second link I posted contains n flow from splats to meshes

geuis · 2025-01-22T02:11:50 1737511910

Yeah some very impressive stuff with splats going on. But I haven't seen much about going from splats to high quality 3D meshes. I've tried one or two with pretty poor results.

jiggawatts · 2025-01-22T08:48:08 1737535688

There have been a few papers published on the topic, but it's "early days".

Expect a lot of progress over the next couple of years.

SR2Z · 2025-01-25T17:24:35 1737825875

Splats to meshes is an isosurface extraction problem, fundamentally. It's one of the great unsolved problems of computer graphics and having a good general algorithm would have massive ripple effects for any problem involving meshes.

It's a rabbit hole and I only really understood it when I realized that the minimum time between any GH committer's hobby example and implementing the 2003 state-of-the-art is ~4 years.

Fingers crossed that Gaussian Splatting makes the rewards high enough that resources get poured on this.

Broussebar · 2025-01-22T13:49:17 1737553757

For this exact use case I used instant-ngp[0] recently and was really pleased with the results. There's an article[1] explaining how to prepare your data.

[0] https://github.com/NVlabs/instant-ngp

[1] https://github.com/NVlabs/instant-ngp/blob/master/docs/nerf_...

GistNoesis · 2025-01-22T16:35:47 1737563747

>full of holes

On the geometry side from the theoretical point of view you can repair meshes, [1], by inferring a signed or unsigned distance field from your existing mesh, then you contour this distance field.

If you like the distance field approach, there are also research work [2], to estimate neural unsigned distance fields directly, (kind of a similar way as Gaussian splats).

[1] https://github.com/nzfeng/signed-heat-3d [it works but it's research code, so buggy, not user friendly, and mostly on toy problems because complexity explode very quickly when using a grid the number of cells grows as a n^3, and then they solve a sparse linear system on top (so total complexity bounded by n^6), but tolerating approximations and writing things properly practical complexity should be on par with methods like finite element method in Computational Fluid Dynamics.

[2] https://virtualhumans.mpi-inf.mpg.de/ndf/

MrSkelter · 2025-01-25T16:50:46 1737823846

48 images is an incredibly small number for high quality photogrammetry. 480 wouldn’t be overkill. A couple of hundred would be considered normal.

Elucalidavah · 2025-01-22T06:58:42 1737529122

> the object was on a rotating platform

Isn't a static-object-rotating-camera basically a requirement for photogrammetry?

jdietrich · 2025-01-22T13:42:47 1737553367

No. For small objects, it is typical to use a turntable to rotate the object; there are a number of commercial and DIY turntables with an automated motion system that can trigger the shutter after a specified degree of rotation.

Mashimo · 2025-01-22T07:22:56 1737530576

Why would that make a difference?

addandsubtract · 2025-01-22T10:24:29 1737541469

The OC mentioned "static lighting". If they meant static, while the platform was spinning, then the lighting would be inconsistent, because the object would change lighting with each photo. You would have to fix the lighting to the platform to spin with the object, while taking the pictures to get consistent lighting.

geuis · 2025-01-22T16:57:54 1737565074

I think you just nailed why I have been having a hard time with my photo set. It's the lighting. Well crap, because I don't have access to the statue or studio again. Thanks for the tip.

ryandamm · 2025-01-22T22:37:38 1737585458

You could try generating per-view depth maps, going to a point cloud and meshing from there. (I suspect splats may reduce your accuracy as an intermediate.)

I’m not aware of a fully-baked workflow for that — though it may exist. The first step has gotten really good: the recent single-shot AI models for depth are pretty visually impressive (I don’t know about metric accuracy).

The ones I’m aware of are DUST3R and the newer MAST3R:

https://github.com/naver/dust3r https://github.com/naver/mast3r

Good luck!

SequoiaHope · 2025-01-22T07:45:46 1737531946

Photogrammetry generally assumes a fully static scene. If there are static parts of the scene which the camera can see and also rotating parts, the algorithm may struggle to properly match features between images.

Mashimo · 2025-01-22T07:47:22 1737532042

i think it's common to have dots on the rotating disk where the object is placed on.

SequoiaHope · 2025-01-22T16:47:03 1737564423

Sure, but if the background has a lot of features it will still confuse the algorithm unless it has some special settings for ignoring the background.

falloon · 2025-01-22T01:38:13 1737509893

Kiri engine is pretty easy to use and just released a good update for their 3DGS pipeline, and they have one of the better 3DGS to mesh options. https://kiri-innovation.github.io/3DGStoMesh2/

archerx · 2025-01-22T07:32:36 1737531156

>The background is solid black.

>These normally are ideal variables for photogrammetry

Actually no, my friend learned this the hard way during a photogrammetry project, he rented a photo studio, and made sure the background were perfectly black and took the photos but the photogrammetry program (Meshroom I think) was struggling to reconstruct the mesh. I did some research and I learned that it uses features in the background to help position itself to make the meshes. So he redid his tests outside with "messy" backgrounds and it worked much much better.

This was a few years ago so I don't know if things are different now.

tzumby · 2025-01-22T01:28:34 1737509314

I’m not an expert, only dabbled in photogrammetry, but it seems to me that the crux of that problem is identifying common pixels across images in order to sort of triangulate a point in the 3D space. It doesn’t sound like something an LLM would be good at.