I read a theory that the larger dies are a pain to manufacture and that the binning process is churning out plenty of chips suitable for the 2080 but not enough for the 2080 ti. The EVGA step up page is suggesting that the queue might be months long which would suggest that this isn't a short term shortage but instead might be a longer term thing
2080 and 2080TI SLI review with plenty of games tested over at TPU: https://www.techpowerup.com/reviews/NVIDIA/GeForce_RTX_2080_Ti_SLI_RTX_2080_SLI_NVLink/
It's not a theory really, just common sense. Dies are cut from round wafers and have areas where there are imperfections. The larger the die, the less you get per wafer and the more you have a chance of hitting a bad area. This is why AMD are so sorted with Ryzen, because they literally "glue" them together (only using Infinity Fabric). Intel's are single monolithic, so very expensive to make. So yeah, the bigger the die the more it costs and the more chance you have of it being a dud.
It's a little more complex than that, certain parts of a wafer have higher failure rates than others, i would imagine Intel use the center for bigger, high performance, dies and the outside edges for smaller chips.
It'd make sense, if they weren't using different chips. The TU104 is not just a cut-down TU102, it's a completely different die. Even the GPCs are not the same, TU102 (and TU106) use 12 SMs/GPC, while TU104 uses 8 per GPC.
Doesn't matter if it is an entirely separate die or not, yield always drops massively as die size goes up, so the theory that TSMC is simply struggling to supply Nvidia with sufficient chips for the TI is perfectly plausible (even if impossible to prove as exact yields for a specific product are impossible to come by).
While i don't dispute that smaller die equals higher yields thing that's not how a wafer gets cut up, a single wafer hosts many different sizes depending on detected defects and the flexibility of how it can be segmented, you may get 10x 752mm2, 20x 652mm2, and 30x 552mm2 from a single wafer for example.
The wafer layout is decided prior to fabbing, and will almost always be only hosting one type of die. You cannot 'cut' one die into another, you can only bin a die into different variants. e.g. All of the Ryzen line use one die (Zen), but binned differently to give different core counts (and clock speed ranges). Trying to cram multiple different dies onto a wafer would mean having to switch out the masks midway through the imaging process, that that introduces all sorts of new opportunities for defects to creep in: you need to take a wafer with a live photoetch, remove it from the scanner, place it in a cartridge, move it to storage, swap the mask, reclean the scanner, move the wafer back to the cleaning station, then load back into the scanner, then realign with the existing pattern. Through all this you need to avoid both exposing the wafer (because it has a live photoetch that you still need of sufficient quality to expose), not get any particles onto the wafer during any of the movement stages (and your cleaning must be FAR less aggressive than normal because you can't destroy that unexposed photoetch mask), AND you need a scanner that can align preceisely to an existing pattern. On a single scan mass the alignment is implicit because the wafer and scanner mechanism are not moved relative to each other (just the scan head). Having a bunch of dies of different sizes from the same wafer is extremely rare, and basically nonexistant for production runs (may occur for research and speciality devices like astronomy CCDs).
No you can't 'cut' one die into another, but you can cut different sizes of dies from one wafer depending, as i said, the flexibility of how it can be segmented. Yes the wafer layout is decided prior to fabbing but that doesn't mean a defective 754mm2 die can't lose 209mm2 of defective wafer so you can make a 545mm2 die, or that you can't cut 100mm2 from a defective 545mm2 piece of wafer, depending, as i said, how you've deigned it.
No, you cannot. That's not how IC manufacturing works. At absolute best you can take a die and disable areas of it that do not work through efusing, but you cannot chop it down to any other size than the one it was designed to be. For example, this is why you will find the Zeppelin die with anything from 2 (EPYC 7251) to 8 cores enabled, but the die itself will ALWAYS be cut to identical dimensions.
Yup Ryzen dies are relatively cheap, and even if part of the die in that square was defective they can literally use it to balance the heat spreader on Threadripper. Lisa Su said at launch it was cheap and yields were high. This is why Intel hired Jim Keller lol, man's a genius.
The scaling is horrendous, when it even works; which in half the games they tested, it did not. SLI is dead, long live SLI.