Senior AI Performance Engineer (CUDA / GPU / NVIDIA Stack) Job at Brillfy Technology Inc, United States

MEtZQlhXZk9Cdmdwb2ZvVlRHckxOQVFPdlE9PQ==
  • Brillfy Technology Inc
  • United States

Job Description

 

Job Title: Senior AI Performance Engineer (CUDA / GPU / NVIDIA Stack)

Duration: Min 12+ Months

Location: 100% Remote

This is a hands-on engineering role , requiring deep expertise in CUDA, GPU architecture, and performance profiling .

Key Responsibilities

  • Profile and optimize AI/ML workloads across multi-GPU and multi-node systems
  • Identify bottlenecks across compute, memory, networking, and orchestration layers
  • Optimize CUDA kernels (memory coalescing, shared memory usage, occupancy tuning)
  • Improve inference performance using TensorRT, Triton, DeepStream, NeMo
  • Analyze and improve latency, throughput, GPU utilization, and memory efficiency
  • Work on distributed AI systems using Apache Ray, NCCL, Kubernetes GPU scheduling
  • Build benchmarking frameworks and performance monitoring systems
  • Collaborate with AI, DevOps, and Infrastructure teams for system-wide optimization

Required Skills

  • Strong hands-on CUDA programming and GPU performance optimization
  • Deep understanding of GPU architecture and memory hierarchy
  • Experience with Nsight, CUDA profiling tools, performance benchmarking
  • Hands-on experience with NVIDIA ecosystem (Triton, TensorRT, NeMo, DeepStream)
  • Experience with distributed AI systems (multi-GPU, multi-node, NCCL, Ray)
  • Experience working with AI models such as YOLO, GPT, LLaMA, Transformers
  • Strong understanding of AI system performance metrics (latency, throughput, utilization)

Preferred

  • Experience working at NVIDIA or similar GPU/AI infrastructure companies
  • Experience with real-time video / Vision AI systems
  • Experience with large-scale production AI deployments

Interview Process (Mandatory)

  • Candidates will receive a technical handout 1 day before interview
  • 90-minute deep-dive demo discussion (NOT theoretical)
  • Candidate must explain:
  • Bottleneck identification approach
  • GPU optimization strategies
  • System-level performance improvements

Job Tags

Full time, Remote work

Similar Jobs

Partners In Progress

Residential Support Staff Job at Partners In Progress

 ...is maintained for safe working conditions.Rate$17.50/hr for daytime hrs and $16.50/hr for designated "sleep" hours on overnight shifts. Yes, staff are paid even during hours when they are permitted to sleep on overnights. Pay increase available for certain earned DSP... 

Exceptional Healthcare Inc.

ER Registered Nurse(RN) - Seasonal Contract Days Job at Exceptional Healthcare Inc.

 ...*No COVID-19 Vaccination Requirements* The Emergency Room Nurse provides direct and indirect patient care in the emergency care setting...  ...or response to treatment. Maintain awareness of current ER operational policies and procedures which impact position responsibilities... 

Compass Group

FOOD TRANSPORTER/DELIVERY DRIVER (PART TIME) Job at Compass Group

 ...We are hiring immediately for a part time FOOD TRANSPORTER/DELIVERY DRIVER position. Location : Roper Hospital - 316 Calhoun Street, Charleston, SC 29401. Note: online applications accepted only . Schedule : Part time schedule; 4 hour shifts. Days and hours... 

Overhaul Carriers

Local CDL Class A Truck Drivers Home Daily Routes Job at Overhaul Carriers

Overview.Overhaul Carriers Ltd., CDL Class A Intermodal/Containers home daily night driving. Experience Drivers: Over the Road or Regional experience drivers. Compensation: $1000 to $1200 per week and miles per week 1000. Home Daily: Shift- Nights: Sun... 

Volkswagen of Stamford

Sales Manager Job at Volkswagen of Stamford

Volkswagen of Stamford is seeking a dedicated Sales Manager to help grow our booming sales department and provide our customers with exceptional...  ...inventory efficiency while considering inventory turn, pricing, online merchandising and aged units. * Require that standards are...