Automate infrastructure provisioning and management, Using Terraform for provisioning and Ansible for configuration.
Build and maintain CI/CD pipelines to automatically deploy applications, integrating with Docker Compose services and preparing for Kubernetes deployments to enable zero-downtime releases.
Ensure system reliability and observability via Implementing monitoring, logging, and alerting (SLIs/SLOs). proactively identify issues and create mitigation strategies.
Participate in on-call rotations, perform root cause analysis (RCAs), and handle production issues across hybrid on-premises setups.
Script configurations for Network and Infrastructure.
Kubernetes migration and operations: Deploy and manage apps on Kubernetes clusters, optimizing for high availability, scaling, and auto-healing in multi-data-center environments.
Capacity planning and performance optimization: Monitor trends, plan for scale, and optimize workflows to support huge concurrent connections and geo-redundant services.
Collaborate cross-functionally: Work with engineering teams to review designs, evangelize best practices, and contribute to runbooks and automation tools.
Custom automation and API-driven tasks with Coding proficiency in Python, Bash, or Go.
Required Skills:
5+ years of SRE or infrastructure engineering experience, with proven track record in scripting and Git.
Hands-on expertise in Terraform (IaC for provisioning), Ansible (configuration management and orchestration), CI/CD pipelines with GitLab CI.
Proficiency in containerization and orchestration: Docker Compose (deployment, scaling, troubleshooting).
Hands-on expertise in Git and configure Network and infrastructure devices via scripts and codes.
Server virtualization experience. any of the VMware, Proxmox, or KVM.
Strong troubleshooting skills for production incidents, including log analysis, performance tuning, and disaster recovery.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK) and automation to reduce toil.
Familiarity with Object storage.
Familiarity with agile methodologies, on-call rotations, and SRE principles (error budgets, SLOs).
مهسان یک شرکت تولید کننده محصولات نرمافزاری است، که به صورت تخصصی در زمینه سامانههای ارتباطی نوین و امنیت شبکه فعالیت میکند.
همکاران ما در مهسان هر یک در کار خود متخصص هستند و تمامی مناسبات و فعالیتها، حرفهای دنبال میشود.
ما تلاش کردهایم فضاهای مناسب برای تمام نیازهای کاری را فراهم کنیم. شادابی محیط کار برای ما اهمیت ویژهای داشته است. همکاران ما کار خود را متعهدانه انجام میدهند. هدف همه یکی است و برای رسیدن به آن تمام تلاشمان را به کار میگیریم.