4. listopadu

AI Inference Engineer - High Tech Engineering Center a.s.

HTEC (formerly Certicon) is a technology company operating since 1996. We focus on comprehensive services in the areas of design, development, diagnostics, and automated software testing for major international clients in the healthcare, telecommunications, automotive, and aviation industries.

Join a team building the software foundation for next-generation AI compute platforms.
You’ll work across the full technology stack – from low-level kernels and hardware-optimized operators to large-scale ML deployment frameworks – collaborating closely with compiler engineers, ML researchers, and hardware specialists.

Your role will be to help shape cutting-edge AI infrastructure, fine-tune software for custom hardware, and expand your expertise in system software and machine learning.

What you’ll do

  • Design, develop, and maintain components for AI compute platforms
  • Implement and optimize key ML operators (e.g., GEMMs, convolutions, BLAS routines)
  • Map computational graphs from ML frameworks to target hardware
  • Collaborate with compiler and hardware teams on core infrastructure
  • Debug and analyze performance issues at the system level
  • Build scalable and reliable software solutions, ensuring quality through testing and automation


What we’re looking for

  • Bachelor’s degree in Computer Science, Electrical Engineering, Mathematics, or related field
  • 3+ years of professional software development experience
  • Strong skills in C/C++ or Python within Linux environments
  • Good understanding of computer architecture, system software, and data structures
  • Experience with specialized hardware (GPUs, FPGAs, AI accelerators) – CUDA or OpenCL a plus
  • Solid grasp of ML fundamentals and motivation to learn new technologiesResponsible, proactive team player

Nice to have

  • Experience with inference or training frameworks (Triton, PyTorch, TensorFlow, DeepSpeed, ONNX Runtime, TVM, IREE)
  • Familiarity with distributed systems (MPI, Gloo)
  • Performance optimization and ML operator implementation
  • 2+ years developing software targeting AI hardware
  • Contributions to open-source projects (LLVM, PyTorch, TensorFlow, etc.)

What we offer

  • Flexible working hours and on-site/remote arrangements
  • Private medical, dental, and vision insurance with mental health coverage
  • Training programs and workshops
  • Continuous support for career progression
  • Our benefits: https://htec.com/careers/benefits/#

Benefity

Vzdělávací kurzy, školení, Stravenky/příspěvek na stravování, Dovolená 5 týdnů, Zdravotní volno/sickdays, Příspěvek na vzdělání, Zvýhodněné půjčky zaměstnancům, Příspěvek na sport/kulturu/volný čas, Mobilní telefon, Práce převážně z domova, Firemní akce, Flexibilní začátek/konec pracovní doby, Notebook, Příspěvek na penzijní/životní připojištění, Možnost studijního volna

O pozici

Typ úvazku:
Práce na plný úvazek
Délka úvazku:
Na dobu neurčitou
Pracovní vztah:
Pracovní smlouva
Doporučené vzdělání:
Středoškolské nebo odborné vyučení s maturitou
Doporučené jazyky:
Angličtina (Pokročilá)