CrawlJobs Logo

Sr Software Engineer - Observability

https://www.t-mobile.com Logo

T-Mobile

Location Icon

Location:
United States, Bellevue

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

113600.00 - 205000.00 USD / Year

Job Description:

Our team is searching for a Sr Software Engineer focusing on observability and telemetry. You will work with other software, test engineers, SRE engineers, and systems engineers to craft, implement, and deploy capabilities that support our observability initiatives. This role is critical in ensuring system reliability, performance, and availability through effective monitoring, logging, tracing, and alerting solutions. As a Senior Engineer, you will work closely with Site Reliability Engineering (SRE), DevOps, and Application teams to develop robust observability strategies that enable proactive issue resolution and system insights.

Job Responsibility:

  • Design and implement best-in-class observability solutions, including monitoring, logging, distributed tracing, and event correlation
  • Develop and maintain observability tooling, integrating technologies such as Prometheus, Grafana, OpenTelemetry, Splunk, Datadog, New Relic, or similar
  • Define and optimize alerting mechanisms to ensure the right signals are surfaced to teams, reducing noise and improving incident response
  • Analyze system performance, identify bottlenecks, and work with engineering teams to improve application and infrastructure efficiency
  • Automate observability-related processes, such as log analysis, anomaly detection, and self-healing mechanisms
  • Partner with development, operations, and security teams to implement observability best practices across services and applications
  • Support on-call teams by providing insights and visibility into system behavior, assisting in real-time troubleshooting
  • Build and maintain dashboards that provide clear insights into application health, performance, and user experience
  • Contributes to designs to implement new ideas which use new frameworks to improve an existing or new system/process/service
  • Review existing designs and processes to highlight more efficient ways to complete existing workload more effectively through industry perspectives
  • Understands the creation of company IPR
  • Collaborates with technical teams and applies system expertise to deliver technical solutions
  • Continuously learns and teaches others existing and new technologies
  • Contributes to the development of others through mentoring or in house workshops and learning sessions
  • Contributes to new and existing technology options that support business goals
  • Understands current technology that supports business goals. Understands system protocols, how systems operate and data flows. Aware of current technology benefits. Expected to independently develop a full software stack. Understands the building blocks, interactions, dependencies, and tools required to complete software and automation work. Independent study of current technology is expected. Interact with system engineers to define system requirement and/or necessary requirements for automation
  • Writes basic documentation on how technology works. Creates clear documentation for new code and systems used
  • Documenting systems designs, presentations, and business requirements for consumption and consideration at the manager level

Requirements:

  • Bachelor's Degree in Computer Science, Engineering or other technical subject area
  • 4+ years technical engineering experience
  • 5+ years of experience in Observability, SRE, DevOps, or related fields
  • Hands-on experience with observability tools such as Prometheus, Grafana, ELK (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, OpenTelemetry, or similar
  • Strong knowledge of scripting languages (Python, Bash, etc.) and infrastructure as code (Terraform, Ansible, etc.)
  • Experience with cloud platforms (AWS, Azure, GCP) and containerized environments (Kubernetes, Docker)
  • Deep understanding of distributed tracing, structured logging, and metrics collection
  • Communication
  • Customer Service
  • Analytics
  • Technical Writing
  • At least 18 years of age
  • Legally authorized to work in the United States
What we offer:
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Back-up care
  • Enhanced family support
  • Childcare subsidy
  • Tuition assistance
  • College coaching
  • Short- and long-term disability
  • Voluntary AD&D coverage
  • Voluntary accident coverage
  • Voluntary life insurance
  • Voluntary disability insurance
  • Voluntary long-term care insurance
  • Mobile service & home internet discounts
  • Pet insurance
  • Access to commuter and transit programs

Additional Information:

Job Posted:
April 05, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.