✨ Fill and validate PDF forms with InstaFill AI. Save an average of 34 minutes on each form, reducing mistakes by 90% and ensuring accuracy. Learn more

Remote Site Reliability Engineer

First American Financial Corporation Remote
remote engineer monitoring tools team infrastructure people aws cloud remote terraform elasticsearch communication
September 30, 2022
First American Financial Corporation
Homestead
FULL_TIME
Company Summary

Join a team that puts its People First First American Docutech offers a wide range of document technology solutions for mortgage, home equity and consumer lending, including document generation, eDelivery, eSign and print fulfillment. Our efficient solutions enable lending professionals to produce accurate and compliant loan packages in all 50 states. Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and empowered to be innovative and reach their full potential. Our inclusive, people-first culture has earned our company numerous accolades, including being named to the Fortune 100 Best Companies to Work For list for seven consecutive years. We have also earned awards as a best place to work for women, diversity and LGBTQ+ employees, and have been included on more than 50 regional best places to work lists. First American will always strive to be a great place to work, for all. For more information, please visit .

Job Summary

Open to 100% Remote Candidates

We're looking for a candidate who will work in a team atmosphere to develop and manage enterprise monitoring infrastructure and monitoring tools such as AWS Native tools and constructs, Terraform, and Elasticsearch Observability. This Candidate will also provide guidance on the best use of these tools to internal project teams. Must have strong communication and technical skills.

What you'll accomplish:

  • Build solutions to provide monitoring patterns for various in house and off the shelf applications across the company.
  • Measure and monitor all production systems with an eye towards availability, latency, and overall system health.
  • Engage with application teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability.
  • Contribute to continuous improvement initiatives for the team and customers with a goal to provide automation and enhance client service, efficiency, and profitability.
  • Fine tune existing tools, or research, develop, and implement new tools, to deliver additional monitoring capabilities.
  • Work on complex problems where analysis of situations or data requires an in-depth evaluation of multiple factors.
  • Develop & Implement monitoring patterns for various in house and off the shelf applications across the company.

What you'll bring:

  • Experience in software engineering, software development, and/or system operations.
  • Experience with APM and Observability using tools such as ELK Stack, AWS CloudWatch, New Relic, Splunk, Prometheus, Grafana etc.
  • Proven ability to lead complex initiatives/projects from inception to completion.
  • Ability to perform analysis on metrics & logs, using problem solving techniques to provide guidance on monitoring, alerting, dashboarding and visualization.
  • Ability to work with a high level of autonomy and with a globally distributed team.
  • Excellent communication skills, both verbal and written; able to explain complex technical topics to both internal and external stakeholders with ease and in remote/distributed environments.

Preferred Qualifications:

  • Hands-on experience with Elasticsearch including deployment and management of the Elastic Stack, Beats and/or Fleet Agents, APM, Dashboarding, and Reporting.
  • Hands-on experience with DevOps practices including using GIT & Developing CI/CD Pipelines.
  • Hands-on experience with Infrastructure as Code (Terraform preferred)
  • Hands-on experience with Monitoring & Log Aggregation technologies
  • Hands-on experience with cloud infrastructure such as AWS, Azure, or other cloud-based infrastructure.
  • Understanding on effective and useful dashboards, metrics, and SLO's
  • Strong knowledge of cloud design patterns for monitoring, resiliency, etc.
  • Ability to understand and write code to perform various tasks related to automation & monitoring.

First American invests in its employees' development and well-being, empowers them to provide superior customer service and encourages them to serve the communities where they live and work. First American is committed to diversity and inclusion. We are an equal opportunity employer.

Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.


Report this job

Similar jobs near me

Related articles