افود | Ofood

کامپیوتر، فناوری اطلاعات و اینترنت ۵۱ تا ۲۰۰ نفر ofood.app

فرصت‌های شغلی ۲۸

استخدام SRE) Site Reliability Engineer)

دسته‌بندی شغلی

IT / DevOps / Server
موقعیت مکانی

تهران ، تهران
نوع همکاری

تمام وقت
حداقل سابقه کار

سه تا شش سال
حقوق

توافقی

شرح موقعیت شغلی

As a Site Reliability Engineer (SRE) you will play a critical role in ensuring the availability, reliability, and performance of the systems that power our logistics platform. You will be responsible for maintaining and improving the infrastructure and tools that support our services, with a focus on monitoring, alerting, and automation. Your expertise in Grafana, Prometheus, Python, and Linux will be essential in driving operational excellence and enabling rapid response to incidents. Responsibilities may include designing and implementing continuous integration and delivery pipelines, managing cloud infrastructure, monitoring system performance, troubleshooting issues, and ensuring security and compliance. Strong communication skills and experience with tools such as Docker, Kubernetes, Git, and Iac – CaC - CICD tools are often required for this role.

Responsibilities:

Monitor and maintain systems' health, performance, and availability using Grafana and Prometheus.
Develop, implement, and maintain automated monitoring, alerting, and reporting solutions to detect and resolve system issues proactively.
Collaborate with development and operations teams to identify and implement system improvements, including performance optimizations, capacity planning, and automation of repetitive tasks.
Investigate and troubleshoot incidents, analyze root causes, and implement remediation actions.
Create and maintain technical documentation, including runbooks and playbooks, to ensure effective knowledge sharing and incident resolution.
Design, implement, and maintain robust and reliable backup and disaster recovery solutions to protect critical systems and data.
Design and implement infrastructure and tools to support software development, testing, deployment and monitoring
Maintain and improve existing infrastructure and tools
Work with developers to ensure that applications are designed and deployed in a scalable, secure, and efficient manner
Automate manual processes using tools such as Ansible, Bash, Python
Deploy, manage, and monitor applications in a Kubernetes environment
Manage and secure network traffic using IP tables or similar tools
Manage and monitor databases, specifically MySQL
Manage and monitor application logs and metrics using tools such as Pro (Education, Age, Background, …)

Requirements:

At least 3 years of experience as a DevOps/SRE Engineer or a related role
Bachelor's degree in computer science, engineering, or a related field.
Prior experience in a Site Reliability Engineer or similar role, with a track record of improving system reliability, performance, and availability.
Strong experience with Linux system administration
Strong experience with at least one programming language (e.g. Bash, Python)
Experience with Kubernetes, Docker, and container orchestration
Experience with MySQL or other relational databases
Experience with infrastructure automation tools (e.g. Ansible, Chef, Puppet)
Experience with Git and GitLab CI/CD

Hard Skills:

Proficiency in Python scripting for automation tasks and system administration.
Deep understanding of Linux-based operating systems, including performance tuning, troubleshooting, and security.
Familiarity with monitoring tools such as Prometheus, Grafana, ELK stack, or similar

Soft Skills:

Excellent communication and problem-solving skills
Strong problem-solving skills and ability to analyze and resolve complex technical issues promptly.
Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment.

Benefits:

Join our friendly and dynamic team and enjoy a range of perks, such as:

Professional development opportunities
Free breakfast every day
Birthday and anniversary gifts and surprises
Lunch and snack subsidies
Transportation budget
Comprehensive health insurance
Seasonal and special charges and discounts from Okala

معرفی شرکت

اُفود با سرمایه‌گذاری گروه سرمایه‌گذاری کوروش، زیرمجموعه گروه صنعتی گلرنگ، در سال ۱۴۰۲ وارد بازار شد. هدف اُفود تغییر قواعد بازار تحویل فوری غذا و میوه، نان، قهوه و… در ایران، به نفع مشتریان و فروشگاه‌داران است.
کاربران ما در هر زمانی از روز که بخواهند، می‌توانند وارد سایت یا اپ اُفود شوند، رستوران‌ها و سایر فروشگاه‌های اطراف را ببینند، غذای دلخواهشان را از آن‌ها آنلاین سفارش دهند و به‌موقع تحویل بگیرند.
فروشگاه‌های همکارمان نیز با دریافت کارمزد منصفانه و تسویه‌حساب سریع، یک همکاری بُردـبُرد خواهند داشت.
کاربران با حضور اُفود می‌توانند گزینه‌های بیشتری برای رفع نیازهای خود داشته باشند و آزادی انتخاب را تجربه کنند.
در اُفود در کنار هم جمع شده‌ایم تا تغییری بزرگ را رقم بزنیم. لازمهٔ رسیدن به این تغییر کیفیت، نوآوری، پویایی و اشتیاق به تغییر و خلق‌کردن است.
اگر از تجربه‌‌های جدید و جریان‌ساز لذت می‌برید و می‌خواهید از ابتدای این تغییر با ما باشید، ما مشتاق حضورتان در اُفود هستیم.

مهارت‌های مورد نیاز

DevOps CI/CD SRE kubernetes
جنسیت

مهم نیست
وضعیت نظام وظیفه

معافیت تحصیلی معافیت دائم پایان خدمت
حداقل مدرک تحصیلی

کارشناسی

مشاغل مشابه
اطلاع‌رسانی از طریق ایمیل

DevOps Engineer (۶ روز پیش)
- دواپس‌یار | DevopsYar
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)
SRE) Site Reliability Engineer) (۴۰ روز پیش)
- کارگزاری مفید | Mofid Securities
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)
(Site Reliability Engineer(Infra Team (۳۹ روز پیش)
- تومن | Toman
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)
(DevOps Engineer (SRE (۴ روز پیش)
- ویستا سامانه آسا | ASA
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)
DevOps Engineer (۱ روز پیش)
- همکاران سیستم | System Group
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)
DevOps Engineer (۴ روز پیش)
- مشاوره فناوری اطلاعات پانیا سامان رایانه | ParseIT Consulting-Group
- تهران، تهران
- قرارداد تمام‌وقت (برای مشاهده حقوق وارد شوید)