Senior Devops Engineer
AnthologyAI’s mission is to create an equitable and fair data economy online by giving users ownership of their personal data, the ability to securely control it, and various ways to put it to work to create value with brands they trust, while always preserving their privacy. We’re on a mission to build Personalized AI.
We’re led by industry veterans, backed by powerhouse investors, and crewed by the brightest minds in the game. We exist to make the internet a better place for all.
The DevOps Engineer plays a crucial role at AnthologyAI in ensuring the smooth and effective operation of AnthologyAI’s platform. The position will report into the VP of Engineering. Your skills will add to the team’s expertise in designing, implementing, and maintaining the systems and infrastructure that form the backbone of AnthologyAI’s platform. You will work closely with members of AnthologyAI’s Engineering team to troubleshoot issues, optimize performance, and deploy updates, ultimately ensuring high availability, security, and scalability. By monitoring system health, analyzing performance metrics, and identifying potential vulnerabilities, you will contribute to the overall stability and reliability of the platform, enabling the company to deliver uninterrupted services, enhance customer satisfaction, and drive business growth.
Responsibilities:
- Collaborate with senior engineers to design, implement, and maintain our systems infrastructure.
- Utilize Terraform to automate the provisioning, configuration, and management of infrastructure resources across multiple cloud platforms.
- Implement and configure monitoring and alerting solutions using Datadog to ensure the health and performance of our systems.
- Work with Helm to manage deployments and updates of containerized applications within Kubernetes clusters.
- Assist in the deployment and management of services on public cloud platforms such as AWS and GCP.
- Contribute to the development of observability practices and tools to enable effective monitoring, logging, and debugging of distributed systems.
- Administer and support distributed systems, ensuring their reliability, performance, and scalability.
- Utilize your experience with Kafka to build and manage distributed streaming platforms.
- Apply your knowledge of MongoDB or other NoSQL databases to support their administration and performance tuning.
- Troubleshoot and resolve infrastructure-related issues, ensuring minimal impact on operations.
- Collaborate with cross-functional teams to ensure the seamless integration of new services and applications into our existing infrastructure.
- Stay up-to-date with emerging technologies and industry trends, continuously improving your technical skills and knowledge.
Qualifications:
- Bachelor's degree in Computer Science, Engineering, Information Technology or a related field.
- Strong understanding of infrastructure-as-code (IaC) principles and experience using Terraform for infrastructure provisioning and management.
- Familiarity with monitoring and observability tools such as Datadog to track system performance, troubleshoot issues, and ensure scalability.
- Proficiency in managing containerized applications using Helm within Kubernetes clusters.
- Experience with public cloud platforms, preferably AWS and GCP, including deploying and managing services.
- Knowledge of distributed systems concepts, best practices, and hands-on experience with their administration.
- Experience with Kafka for building and managing distributed streaming platforms.
- Strong problem-solving skills and the ability to analyze and resolve complex infrastructure issues.
- Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.
- Self-motivated and eager to learn new technologies and stay updated with industry advancements.
Bonus points for:
- Relevant certifications such as AWS Certified Solutions Architect, GCP Cloud Engineer, or Kubernetes certifications.
- Familiarity with additional tools and technologies related to infrastructure automation, containerization, and distributed systems.
Salary: $160,000 - $180,000 base. Salary may vary based on experience.
We offer a competitive salary, equity, and opportunities for professional development.
**The position is remote, but we prefer candidates based in New York City due to occasional in-person meetings.
**There is currently no relocation and/or visa (immigration) assistance provided for this position.