AI Specialist (AI Engineering)

Hong Kong

TLDR

Improve on-device AI performance by compressing and optimizing large language and vision models across diverse hardware architectures.

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities:

Compress and optimize large language and vision models for on-device inference.
Develop pipelines for model distillation and hardware-specific compilation.
Benchmark performance across various NPU/GPU architectures.

Qualifications:

Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
Strong C++ and Python skills.

Hyphen Connect Limited

Hyphen Connect Limited is your go-to partner in the Web3 talent acquisition landscape, connecting top talent with opportunities across infrastructure, DeFi, NFTs, and gaming. We leverage data-driven insights and extensive resources to facilitate meaningful connections within the vibrant Web3 community, ensuring that local expertise meets global potential.

View company profile

AI Engineer

AI Specialist (AI Engineering)

TLDR

This job is no longer available