About Us
B2B Startup is revolutionizing enterprise workflows by providing cutting-edge automation solutions. We're empowering businesses to achieve unprecedented levels of efficiency and scalability through our innovative platform. Join us and be a part of shaping the future of B2B technology.
As a NOC / SRE Engineer at B2B Startup, you will play a pivotal role in ensuring the reliability, performance, and scalability of our cloud infrastructure. You will contribute to building and maintaining a robust and automated environment that supports our growing customer base.
Responsibilities
- Design, implement, and maintain infrastructure as code using Terraform to provision and manage resources across GCP, AWS, and Azure.
- Develop and maintain CI/CD pipelines using tools like Jenkins, GitLab CI, or CircleCI to automate software deployments and infrastructure changes.
- Proactively monitor system performance and availability using tools like Prometheus, Grafana, and Datadog, identifying and resolving potential issues before they impact users.
- Develop and maintain Python or Go-based automation scripts to streamline operational tasks and improve system efficiency.
- Troubleshoot and resolve complex incidents across our cloud environments, collaborating with development and operations teams to identify root causes and implement effective solutions.
- Participate in on-call rotation to provide 24/7 support for critical systems and infrastructure.
- Design and implement Kubernetes clusters and manage deployments to ensure high availability and scalability of our applications.
Requirements
- SENIOR-level experience in SRE.
- Proficiency in GCP, AWS, Azure, Kubernetes, Python, Go, Terraform, CI/CD.
- Strong communication and collaboration skills.
- Experience with incident management and root cause analysis.
- Deep understanding of networking principles, including TCP/IP, DNS, and load balancing.
- Proven ability to automate infrastructure and application deployments.
- Experience with monitoring and alerting tools such as Prometheus, Grafana, and Datadog.
Nice to Have
- Experience with service mesh technologies like Istio or Linkerd.
- Contributions to open-source projects.
- Experience with configuration management tools like Ansible or Chef.
- Security certifications (e.g., AWS Certified Security - Specialty, GCP Professional Cloud Security Engineer).
What We Offer
- Competitive compensation
- Remote š work environment
- Professional growth opportunities
- Unlimited PTO
- Stock options