Software Engineer (Multiple Levels) - Machine Learning Infrastructure, Slack Job at Slack, Seattle, WA

OWI0ejNITlQ0VEZWY2xkNWFydllHZnNpcnc9PQ==
  • Slack
  • Seattle, WA

Job Description

Software Engineer Role at Salesforce

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a buzzword — it's a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.

Ready to level-up your career at the company leading workforce transformation in the agentic era? Agentforce is the future of AI, and you are the future of Salesforce.

The software engineer role at Salesforce encompasses architecture, design, implementation, and testing to ensure we build products right and release them with high quality. Equally important is advanced prompt engineering — the ability to write precise, structured prompts and cultivate the system context that makes AI outputs reliable, secure, and production-ready.

The AI and ML Infrastructure team is part of Slack's Core Infrastructure organization and is responsible for the foundational systems that enable machine learning and AI across the company. The team designs, builds, and operates reliable, scalable, and high performance platforms that allow product and ML teams to develop, deploy, and operate AI driven capabilities with confidence.

The team owns shared infrastructure, services, and tooling that support the full ML lifecycle, including model training, deployment, inference, and monitoring. As Slack AI continues to grow, the team is evolving from traditional ML deployments toward large scale, highly distributed systems. This work involves deep architectural decisions around scalable model deployment strategies, real time feature serving at very high throughput, GPU accelerated inference at message scale, and responsible training of models on sensitive data with strong privacy and safety requirements.

We are looking for Software Engineers to join the ML Infrastructure focus area and help architect and operate the core systems that power AI at Slack. In this role, you will own foundational infrastructure for large scale model training and inference, and evolve it into a reliable, secure, and self service platform used across the company.

You will work at the intersection of distributed systems, GPU infrastructure, and modern ML stacks, solving complex scalability and reliability challenges. This role blends deep systems engineering with a strong understanding of the ML lifecycle, and plays a critical part in shaping the long term technical foundations of Slack's AI capabilities.

Design, build, and operate systems to train, serve, and deploy machine learning models at scale, with a focus on reliability, performance, and operational simplicity

Evolve GPU backed inference infrastructure to support high throughput, latency sensitive workloads, including large scale model serving

Architect and optimize distributed training and data processing systems using platforms such as Ray, Airflow, Spark, or similar technologies

Build and maintain Kubernetes based platforms and orchestration layers using tools such as KubeRay, vLLM, and internally developed services

Architect solutions that bridge legacy systems with modern technologies while maintaining monolithic application stability

Develop robust monitoring, observability, and alerting for production ML workloads to ensure operational excellence

Partner closely with AI Platform, ML modeling, security, and product engineering teams to design infrastructure that supports evolving AI use cases

Provide technical leadership through design reviews, mentorship, and by setting engineering standards and long term architectural direction for ML infrastructure

Author technical design and architecture documentation, and contribute thought leadership through engineering blog posts

Build and ship high-quality, production-grade software using modern engineering practices, with AI as a core part of your development workflow by pushing the boundaries of AI development tools to deliver secure, optimized, and high-quality code.

Design and orchestrate complex systems where AI agents integrate seamlessly into human workflows, driving efficiency and innovation at scale.

Contribute to building and maintaining the shared system context, an explicit repository of system designs, constraints, and standards that enables AI to operate accurately and reliably.

Critically evaluate code (Human or AI-generated) for correctness, quality, security, and performance

Significant professional experience in software engineering with a strong focus on infrastructure, backend systems, platform engineering, or MLOps

Deep experience building and operating distributed systems, including expert level knowledge of Kubernetes and container based platforms

Hands on experience with modern ML infrastructure and serving stacks such as Ray or KubeRay, vLLM, or similar training and inference orchestration frameworks

Experience working with GPU infrastructure, including performance optimization and operational management at scale

Strong experience with data infrastructure and orchestration technologies such as Airflow, Spark, or similar systems

Experience building and operating cloud native systems on public cloud platforms such as AWS, GCP, or Azure, including infrastructure as code

A demonstrated ability to drive technical direction for complex systems and balance short term delivery with long term architectural goals

Excellent written communication, as well as ability to thrive in an asynchronous and globally distributed infrastructure team.

A related technical degree required

A demonstrated, genuine AI-first approach to engineering. Using AI to move faster, build fluency across the stack, and contribute well beyond your core specialty.

Experience using AI tools (e.g., Claude Code, GitHub Copilot, Codex, Cursor, etc.) in development workflows

Advanced prompt engineering skills and the ability to write precise, structured prompts and cultivate the system context that makes AI outputs reliable, secure, and production-ready.

Job Tags

Temporary work

Similar Jobs

Catapult Health

Nurse Practitioner Bilingual Job at Catapult Health

 ...PRN Nurse Practitioner Bilingual (English and Spanish) The PRN Nurse Practitioner...  ...status, or any other status protected under federal, state, or local law. This employer...  ...in E-Verify and will provide the federal government with your Form I-9 information to confirm... 

FocusGroupPanel

Remote Data Entry & Curation Specialist Job at FocusGroupPanel

 ...A data entry service provider is looking for a Remote Work From Home Data Entry Clerk for an entry-level position. This role offers the flexibility to work when you want, making it an excellent opportunity for anyone seeking side income or additional work from home. No... 

Lee Shettle Eye & Hearing

Licensed Optician Job at Lee Shettle Eye & Hearing

 ...practice with a large patient base, in addition to a large primary care referral base. We are seeking an experienced full-time licensed optician to manage our optical dispensary, in a friendly, personable environment. We are looking for someone who is passionate about... 

Johnson Controls, Inc.

HVAC Control Systems Engineer Job at Johnson Controls, Inc.

HVAC Controls Systems Engineer IIIUnder general direction, responsible for design, configuration, commissioning, start-up and operation of complete building control systems including fire, security, and other low voltage control subsystems (i.e. lighting, nurse call,... 

Allied Universal® Compliance and Investigations

Surveillance Investigator Job at Allied Universal® Compliance and Investigations

 ...Overview Company Overview: Advance Your Career in Insurance Claims with Allied Universal Compliance and Investigation Services. Allied Universal Compliance and Investigation Services is the premier destination for a career in insurance claim investigation. As...