Skip to content
View prateekshukla1108's full-sized avatar

Block or report prateekshukla1108

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
prateekshukla1108/README.md

Hello!

I'm Prateek: CUDA Developer, Deep Learning Engineer and a lover of high performance computing

I love writing compilers in MLIR and learning how internals of Inference engines and Deep Learning Libraries work

I am currently working on writing a compiler for distributed training and inference which takes the best of all the inference engines and deep learning libraries

🦾 My Techs

🔎 Github Stats

Pinned Loading

  1. 100-daysofcuda 100-daysofcuda Public

    Kernels Written for 100 days of CUDA Challenge

    Cuda 7

  2. q-learning-cuda q-learning-cuda Public

    CUDA Q-Learning Implementation for a Grid World Environment

    Cuda

  3. seseme seseme Public

    An attempt of inference on seseme audio model

    Python

  4. gemm gemm Public

    Achieving 70%+ compute utilization in RTX 4060

    Cuda