JOIN OUR TEAM

Let's build the future

We are looking for founding researchers and engineers to solve hard problems in frontier generative speech and conversation models.

REMOTE
FULL-TIME
FOUNDING ROLE

Founding Audio Researcher

We are looking for a researcher to lead our efforts in novel audio LLMs and full-duplex audio modeling. You will be responsible for designing and training the next generation of our speech-to-speech models, moving beyond simple cascading systems to end-to-end architectures.

The Role

  • Research and train novel architectures for real-time speech-to-speech translation.
  • Explore full-duplex audio models that can handle interruptions, backchanneling, and overlapping speech naturally.
  • Work on voice cloning and style transfer to preserve speaker identity across languages.
  • Read papers and implement state-of-the-art techniques quickly.

Requirements

  • Deep understanding of Transformer architectures, Diffusion models, and Neural Audio generation.
  • Experience training large-scale models from scratch
  • A "product-first" research mindset. You care about the end-user experience, not just benchmarks.

REMOTE
FULL-TIME
FOUNDING ROLE

Founding Applied ML Engineer

We need an engineer who lives at the intersection of model performance and system efficiency. You will own the inference stack, optimizing our models for low-latency real-time usage and on-device deployment.

The Role

  • Optimize large audio and language models for production inference.
  • Implement efficient serving infrastructure using vLLM, TensorRT, or custom CUDA kernels.
  • Work on quantization and distillation for running high-fidelity models on consumer hardware and mobile devices.
  • Handle model fine-tuning pipelines and data engineering.

Requirements

  • Strong engineering background with experience in low-level optimization (CUDA, C++).
  • Experience with inference optimization tools like TensorRT, vLLM, ONNX Runtime.
  • Familiarity with model fine-tuning (LoRA, QLoRA) and efficient training techniques.
  • Deep understanding of GPU architecture and memory management.

About Us

Pinch was founded on the belief that language should not be a barrier to human connection. We are a small, distributed team of researchers and engineers passionate about AI, audio, and design.

We operate remotely but maintain a high-bandwidth culture of collaboration, often traveling to co-work in person. If you're ready to move faster than any research lab you've ever worked at, we'd love to talk.

The Pinch Team
PINCH

Making cross-lingual conversations as natural as native communication.

Built in San Francisco, New York, Valencia, Tallinn, and Tartu

© 2025 PINCH RESEARCH