empty

Senior Operations Engineer (Elasticsearch and Kubernetes)

Recorded Future

Job Description

Posted on: 
September 17, 2024

Summary and company overview

Summary Information about the Role

Job Title: ASI Senior Operations Engineer

Company Overview:

With 1,000 intelligence professionals, over $300M in sales, and serving over 1,800 clients worldwide, Recorded Future is the world's most advanced, and largest, intelligence company!

Team Overview:

The Attack Surface Intelligence (ASI) scanning team is responsible for building and operating the tools and infrastructure which identify potential vulnerabilities in customers' publicly-exposed infrastructure. This involves port scanning, headless browsing, and active vulnerability scanning at various levels. The goal is to continuously improve both the breadth and depth of detection capabilities.

Job Overview:

The role is intended to be embedded in the scanning team, working closely with DevOps, Data, and Product Management to ensure the health of the ASI scanning infrastructure and application. The focus is on understanding how the application runs in the real world and ensuring it operates as intended. Additionally, the role will involve responsibilities for ensuring the reliability and performance of a data platform that includes data warehouses, data processing pipelines, and various messaging queues.

Responsibilities

Responsibilities of the Role

Scanning Infrastructure:

  • Work with Product Management to determine internal Service Level Objectives (SLOs) representing the long-term health of the scanning system (both infrastructure and application).
  • Build and maintain metric collection that can feed Service Level Indicators (SLIs) to guarantee SLOs.
  • Build and maintain dashboards displaying SLIs, SLOs, and other key metrics indicating the health of the application.
  • Investigate cases where the scanning infrastructure or application is not performing as desired, provide guidance to the development team for improvements, and develop additional metric collection/alerting as needed.
  • Identify application/deployment configuration optimizations for best use of resources and impending resource gaps.
  • Work closely with DevOps to maintain and improve custom Kubernetes-based application deployment operators.
  • Identify and fix automation gaps to reduce operational toil.

Data Platform:

  • Ensure the reliability and performance of the data platform, including data warehouses (Elasticsearch, ClickHouse, ScyllaDB, PostgreSQL).
  • Manage and optimize data processing pipelines using Apache Spark.
  • Work with messaging queues such as Apache Kafka, RabbitMQ, and Redis to ensure seamless data flow and processing.
  • Develop and maintain metric collection, monitoring, and alerting for the data platform.
  • Build and maintain dashboards displaying key metrics for data platform health and performance.
  • Collaborate with data engineering teams to troubleshoot and resolve issues related to data storage, processing, and messaging.
  • Implement best practices for data platform security, scalability, and performance.

Job Requirements

Qualifications of the Role

Required Qualifications:

  • Passion for "digging deep" and looking for "weird things" in the system that could indicate pending issues not captured in monitoring.
  • Experience with one or more metrics collection tools (managed and self-hosted): Prometheus, DataDog, InfluxDB, Telegraph, VictoriaMetrics.
  • Experience or desire to build visualization dashboards.
  • Experience with deployment of applications within a Kubernetes environment (preferably) or containerization (Docker, etc).
  • Experience with data warehouses (Elasticsearch, ClickHouse, ScyllaDB, PostgreSQL).
  • Experience with data processing pipelines using Apache Spark.
  • Experience with messaging queues such as Apache Kafka, RabbitMQ, and Redis.

Preferred Qualifications:

  • Ideally, experience with Kubernetes administration and custom operators.
  • Ideally, experience with Python.
  • Experience with network and vulnerability scanning tools a huge plus (nmap, zmap, masscan, nuclei, burp, NESSUS, etc).
  • Familiarity with the network stack and operating system tuning: TCP/IP, conntrack, NAT, etc.

Additional commentary

Additional Commentary

Why should you join Recorded Future?

Recorded Future employees (or "Futurists"), represent over 40 nationalities and embody our core values of having high standards, practicing inclusion, and acting ethically. Our dedication to empowering clients with intelligence to disrupt adversaries has earned us a 4.8-star user rating from Gartner and more than 45 of the Fortune 100 companies as clients.

Want more info?

  • Blog & Podcast: Learn everything you want to know (and maybe some things you'd rather not know) about the world of cyber threat intelligence
  • Instagram & Twitter: What's happening at Recorded Future
  • The Record: The Record is a cybersecurity news publication that explores the untold stories in this rapidly changing field
  • Timeline: History of Recorded Future
  • Recognition: Check out our awards and announcements

We are committed to maintaining an environment that attracts and retains talent from a diverse range of experiences, backgrounds and lifestyles. By ensuring all feel included and respected for being unique and bringing their whole selves to work, Recorded Future is made a better place every day.

If you need any accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to our recruiting team at careers@recordedfuture.com

Recorded Future is an equal opportunity and affirmative action employer and we encourage candidates from all backgrounds to apply. Recorded Future does not discriminate based on race, religion, color, national origin, gender including pregnancy, sexual orientation, gender identity, age, marital status, veteran status, disability or any other characteristic protected by law.

Recorded Future will not discharge, discipline or in any other manner discriminate against any employee or applicant for employment because such employee or applicant has inquired about, discussed, or disclosed the compensation of the employee or applicant or another employee or applicant.

Notice to Agency and Search Firm Representatives:

Recorded Future will not accept unsolicited resumes from any source other than directly from a candidate. Any unsolicited resumes sent to Recorded Future, including those sent to our employees or through our website, will become the property of Recorded Future. Recorded Future will not be liable for any fees related to unsolicited resumes.

Agencies must have a valid written agreement in place with Recorded Future's recruitment team and must receive written authorization before submitting resumes. Submissions made without such agreements and authorization will not be accepted and no fees will be paid.

Summary and company overview

Summary Information about the Role

Job Title: ASI Senior Operations Engineer

Company Overview:

With 1,000 intelligence professionals, over $300M in sales, and serving over 1,800 clients worldwide, Recorded Future is the world's most advanced, and largest, intelligence company!

Team Overview:

The Attack Surface Intelligence (ASI) scanning team is responsible for building and operating the tools and infrastructure which identify potential vulnerabilities in customers' publicly-exposed infrastructure. This involves port scanning, headless browsing, and active vulnerability scanning at various levels. The goal is to continuously improve both the breadth and depth of detection capabilities.

Job Overview:

The role is intended to be embedded in the scanning team, working closely with DevOps, Data, and Product Management to ensure the health of the ASI scanning infrastructure and application. The focus is on understanding how the application runs in the real world and ensuring it operates as intended. Additionally, the role will involve responsibilities for ensuring the reliability and performance of a data platform that includes data warehouses, data processing pipelines, and various messaging queues.

Apply now