Technical Staff Member
Perplexity AI
Room: S01 At Perplexity, we serve production traffic on NVIDIA Hopper and NVIDIA Blackwell GPUs. Our in-house runtime, built on CUTLASS, FlashInfer, NVLink™, and NVSHMEM, serves models ranging from embeddings to large language models. Powered by NVIDIA GTC Paris