Rajesh Shashi Kumar

Computer Architect at ARM

prof_pic.jpg

Hi, I’m Rajesh. I’m passionate about Computer Architecture and optimizing systems for Machine Learning. At ARM, I focus on scalability and efficiency of datacenter infrastructure for AI workloads.

With over 7 years of experience in the semiconductor industry, I’ve held roles at AMD Research, Qualcomm and Analog Devices, Inc., working across various levels of abstraction in Computer Architecture. I am well-versed in addressing challenges in parallel computer architecture and optimizing the performance of heterogeneous systems.

I earned my graduate degree from University of Wisconsin-Madison, where I worked with Prof. Matt Sinclair at the Heterogeneous Architectures Lab, on mechanisms to promote data reuse on chiplet-based GPU architectures.

news

Aug 16, 2025 I’m reading The Art of Multiprocessor Programming this fall as part of the Software Internals Book Club. Come join us!
Jul 10, 2025 Invited lightning talk at the Arm Global Engineering Conference, Birmingham, UK.

selected publications

  1. MICRO
    CPElide: Efficient Multi-Chiplet GPU Implicit Synchronization
    Preyesh Dalmia, Rajesh Shashi Kumar, and Matthew D. Sinclair
    In 2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO) , Nov 2024
  2. US Patent
    Cache Synchronization for Chiplet Accelerators
    Preyesh Dalmia, Rajesh Shashi Kumar, and Matthew D. Sinclair
    Sep 2024
    US Patent App. 18/188,209