Principal Infrastructure Performance Engineer

permanent
Fully Remote

Only accepting applications from: United States

  • Build a resilient, secure, and efficient cloud based observability platform.
  • Monitor and troubleshoot platform issues, including finding solutions to reduce known issues.
  • Build and scale the observability infrastructure to meet rapidly increasing demand.
  • Develop and improve operational practices and procedures.
  • Improve database monitoring: develop custom prometheus exporters in Go for use cases that go beyond what is possible with SQL exporter. Create Grafana dashboards and alerts for these new metrics.
  • MCP servers for observability: deploy MCP server to integrate our observability stack with our LLM tools.

Experience

  • 8+ years of relevant production-level experience.
  • Experience with VictoriaMetrics.
  • Experience with Sumologic.
  • Experience with tracing tools (e.g. OpenTelemetry, Honeycomb, Tempo).
  • Experience with profiling tools (e.g. Pyroscope)
  • Knowledge of cloud monitoring, logging and cost management tools.
  • Programming/scripting knowledge (Go, Java, or Python) and understanding of JVM concepts.
  • In-depth knowledge of AWS services, hands-on experience in AWS provisioning using terraform.
  • Experience with containerized applications and Kubernetes / EKS. Creating and updating / maintaining Helm charts.
  • Understanding of microservices architecture and debugging/investigation techniques.
  • Strong understanding of systems, networking and troubleshooting techniques.
  • Experience in automated build pipeline, continuous integration and continuous deployment.
  • Ability to operate in an agile, entrepreneurial start-up environment.
  • Experience with running Linux in production.

Salary and Perks

  • Competitive salary and stock option plan.
  • 100% paid coverage of medical, dental and vision insurance.
  • Flexible PTO.
  • Learning stipend for personal growth and development.
  • Paid parental leave.
  • Health & wellness initiatives.

About Upgrade

Upgrade is a fintech unicorn founded in 2017 that helps millions of families across America using a credit line, personal loan, or Rewards Checking or Premier Savings accounts

Upgrade is a fintech unicorn founded in 2017 that helps millions of families across America using a credit line, personal loan, or Rewards Checking or Premier Savings accounts

View all devops and sysadmin jobs

Workster

Remote Jobs for US Residents

We've built a new platform specifically for US residents to find remote work.

Discover Workster

Power Search

Find the jobs that don't get advertised

We've built a tool to help you discover all of the remote jobs that never get advertised.

Discover Power Search