fakestuff @_fakestuff_ - Twitter Profile

26 days ago

PS5の4K描画で約5ミリ秒のパフォーマンスを実現。Activision、CoD向けレンダリング技術「AVBOIT」の解説資料を公開 https://t.co/ePSMk9fCWK 「AVBOIT」は、順序に依存しない透明表現を実現する手法で、資料では原理、実装、性能評価などが解説。なお、同資料は「SIGGRAPH 2025」講演で使われたもの

GameMakersJP's tweet photo. PS5の4K描画で約5ミリ秒のパフォーマンスを実現。Activision、CoD向けレンダリング技術「AVBOIT」の解説資料を公開
https://t.co/ePSMk9fCWK

「AVBOIT」は、順序に依存しない透明表現を実現する手法で、資料では原理、実装、性能評価などが解説。
なお、同資料は「SIGGRAPH 2025」講演で使われたもの https://t.co/3e8yJKjWWK

1

281

78

189

20K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

26 days ago

We were doing memory testing last week on phones... 1. Don't just allocate. Fill. Reserved virtual pages don't consume any RAM. You need to commit the pages. 2. Filling with zero = perfect memory compression. Fill with random data to avoid memory compression. Memory is hard :)

9

240

7

73

18K

_fakestuff_ retweeted

Yining Karl Li @yiningkarlli

about 1 month ago

I'm a little late to the party on this one since it's from January, but I just read this great blog post by Jure Triglav walking through implementing surfel-based global illumination. It's got a bunch of really cool interactive toys/visualizers! https://t.co/T5fsPEVx2N

yiningkarlli's tweet photo. I'm a little late to the party on this one since it's from January, but I just read this great blog post by Jure Triglav walking through implementing surfel-based global illumination. It's got a bunch of really cool interactive toys/visualizers!

https://t.co/T5fsPEVx2N https://t.co/cG3DMIXocv

0

203

36

154

13K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

about 1 month ago

At 6 bytes per splat, 1920x1080 (2Mpix) is just 12MB of read bandwidth. Claybook's SDF ray-tracer (1GB multilevel volume) consumed 8MB of read bandwidth for the distance field in my GDC 2018 benchmark slides. Splats don't have perfect occlusion culling, so there's some overhead of course. I did my math using tiny 16 splat clusters (a 4x4 screen region in the perfect case). So we should have culling granularity close to hardware HiZ. Much better than Nanite's 128 triangle clusters.

3

79

2

30

9K

Who to follow

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

about 1 month ago

This is one of the reasons we developed GPU-driven rendering at Ubisoft (for Rainbow Six Siege and Assassin's Creed: Unity). We spoke about it in SIGGRAPH 2015. GPU->CPU roundtrip latency is awful. Have to make all visibility decisions on the GPU side.

15

2K

95

531

91K

_fakestuff_ retweeted

Shlomi Steinberg @SteinbergShlomi

about 1 month ago

https://t.co/0rKJP6lNj1 The recorded EUROGRAPHICS 2026 talk for our wave tracing paper. Very honored that this paper received the Günter Enderle Best Paper Honorable Mention award. Collaboration with Matt Pharr (NVIDIA).

1

95

24

49

7K

fakestuff @_fakestuff_

about 2 months ago

@toreler Hi Tore, I just wondering if the MeshBlend can work with any kind of custom/stylization materials, or currently just standard pbr?

1

0

33

_fakestuff_ retweeted

Jacob Freeman

@GeForce_JacobF

about 2 months ago

Fun fact: DOOM: The Dark Ages uses Ray-Traced Global Illumination. If it had used older baked GI tech instead, the lighting data could have required up to 110GB and taken up to 68 days to bake 👀 From: https://t.co/Xo74fE97Mz

GeForce_JacobF's tweet photo. Fun fact: DOOM: The Dark Ages uses Ray-Traced Global Illumination. If it had used older baked GI tech instead, the lighting data could have required up to 110GB and taken up to 68 days to bake 👀

From: https://t.co/Xo74fE97Mz https://t.co/HnmTcRxvvX

49

1K

98

361

99K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

about 2 months ago

With optimized storage: 4096x4096 BC7/ASTC4x4 texture + mips = 21.3MB Million vertices = 24MB - RGBA16 position (pow2 round up bbox) - RGB10A2 normal - RGB10A2 tangent - RGBA8 color - RG16 UV [-8,+8] = 24 bytes/vertex Million triangles = 3 million indices = 12MB Mesh = 36MB

17

407

22

222

43K

_fakestuff_ retweeted

Łukasz | Wookash Podcast

@wookash_podcast

about 2 months ago

Had a great time talking to @SebAaltonen about his work! Rendering technology, back then & now, Hype Hype works, as well as "No Graphics API" Sebastian, thank you so much for the time and effort! :) https://t.co/LJhzMVbcnh

12

414

46

145

73K

_fakestuff_ retweeted

Markus Schütz @m_schuetz

2 months ago

New Paper🙂 Nanite has shown that small triangles can be rendered fast in compute, we're exploring how fast for large meshes with up to 18.9 billion triangles, without the need to precompute LOD structures. Paper: https://t.co/F9u4xE6Na3 Source: https://t.co/1LgdSHVi7i

m_schuetz's tweet photo. New Paper🙂

Nanite has shown that small triangles can be rendered fast in compute, we're exploring how fast for large meshes with up to 18.9 billion triangles, without the need to precompute LOD structures.

Paper: https://t.co/F9u4xE6Na3

Source: https://t.co/1LgdSHVi7i https://t.co/Vx7S6Sf6Oq

9

875

129

575

69K

_fakestuff_ retweeted

Osvaldo Pinali Doederlein @opinali

2 months ago

Let's check NVidia's RTXDI SDK. Huge update v3.0.0, published Mar 10: a month before the paper. Highlights include "ReSTIR PT resampling functions", that doesn't sound super specific. But this SDK is mostly samples and docs, the runtime is in a separate project RTXDI-Library.

opinali's tweet photo. Let's check NVidia's RTXDI SDK. Huge update v3.0.0, published Mar 10: a month before the paper. Highlights include "ReSTIR PT resampling functions", that doesn't sound super specific.

But this SDK is mostly samples and docs, the runtime is in a separate project RTXDI-Library. https://t.co/sq7vl707VQ

1

10

2

4

1K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

2 months ago

Vulkan just got a new descriptor heap extension. Another step towards the right direction: https://t.co/0QDsD6qYJ2 It makes push constants a struct in memory instead of separate API calls, which is a super nice improvement. Not exactly a GPU pointer to root data, but close.

2

82

4

23

6K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

2 months ago

Super nice that Nvidia open sourced Lyra 2.0. Github repo and weights in HuggingFace. Seems much more usable than Genie 3. Seems to have a real 3d understanding of the environment, and doesn't drift.

5

118

4

60

13K

_fakestuff_ retweeted

3DxDEV

@3DxDEV7

2 months ago

What if your Unreal Engine 5 renders looked like living oil paintings? 🎨 Elena Felici just released a full free course on building a procedural painterly brush-stroke shader in UE5 Free course + project files linked 👇 #UnrealEngine5 #UE5 #StylizedArt #ShaderArt #NPR #3DArt

4

656

75

423

26K

_fakestuff_ retweeted

Alex Goldring

@SoftEngineer

3 months ago

New Shade (WebGPU engine) demo: https://t.co/ZjkK3pGD17 Emphasis is on performance improvements, especially for Macs. The engine is intended for high-end GPUs, but I hope to include as many devices, so it at least runs OK for most people. Features: - Global Illumination via Sparse Volumetric Lightmaps (SH3) - Ambient Occlusion + Bent Normals - Clustered Lighting - Automatic Exposure - HDR display support - Realtime Cascaded Shadow Maps (3 cascades) - HDR bloom with temporal stabilization - TAAU (60% upscale in this demo) - HZB occlusion culling (including shadow views) - Meshlet-based rendering - GPU-driven draw

SoftEngineer's tweet photo. New Shade (WebGPU engine) demo:
https://t.co/ZjkK3pGD17

Emphasis is on performance improvements, especially for Macs.

The engine is intended for high-end GPUs, but I hope to include as many devices, so it at least runs OK for most people.

Features:
- Global Illumination via Sparse Volumetric Lightmaps (SH3)
- Ambient Occlusion + Bent Normals
- Clustered Lighting
- Automatic Exposure
- HDR display support
- Realtime Cascaded Shadow Maps (3 cascades)
- HDR bloom with temporal stabilization
- TAAU (60% upscale in this demo)
- HZB occlusion culling (including shadow views)
- Meshlet-based rendering
- GPU-driven draw

4

111

8

66

8K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

3 months ago

Optimized SSAO (GTAO) shader: 1.10ms -> 0.758ms New version didn't have fp16 optimized inner loop and did noise lookups inside the inner loop. Now I apply noise to line sample offset outside the loop. This is fine since only one sample is used (the max horizon).

1

91

5

36

9K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

3 months ago

Occlusion culling shadow cascades is a clear win for large scenes with lots of depth complexity, especially if the sun angle is low. You can also plot receiver pixels to shadow map with atomics (pack 2x16bit min/max to 32-bit) to know the receiver range -> cheap early out.

3

110

6

72

11K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

3 months ago

Bindless with 64-bit pointers is just 1 trip to memory. Don't need to fetch a buffer descriptor from the descriptor heap (using a 32-bit index). That's how CUDA and Metal operate. Available also in GLSL using Vulkan BDA extension. DX12 has no pointer support.

6

216

21

140

20K

_fakestuff_ retweeted

Sebastian Aaltonen

@SebAaltonen

3 months ago

The enthusiasm in this thread means that people want a WASM based (open source) web renderer. That's what I am building at the moment.

22

553

17

92

32K

fakestuff

@_fakestuff_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users