Job Description
Job Description:
- Responsible for the architecture design and function development of Shopee basic monitoring. including monitoring data processing, incidents root cause analysing, standardise alarms, etc.
- Involve in Administration, Architecture, Agent management, dashboard, reports and alerts for monitoring & Job management tools
- Maintain current functional and technical knowledge of the platform and future products.
- Monitor and maintain monitoring performance, availability, and capacity
- Assist and provide expert best practices in adoption, expansion, additional use cases and in setting up monitoring items and jobs.
- Dive into metrics in monitoring system, identify and control all the metrics are reasonable.
Requirements:
- Pursuing a Bachelor's or Master's Degree in Computer Science, Engineering, or a related field.
- Strong grasp of computer science fundamental.
- Familiarity with the principles and usage of monitoring tools such as Prometheus, Open-Falcon, Zabbix, etc. Experience with secondary development is a plus.
- Understanding of high concurrency and high availability system design, with practical experience in developing distributed systems.
- Demonstrated technical enthusiasm with a willingness to explore new technologies and an innovative mindset.
- Familiar with Golang or Python. Prior project experience in these languages is preferred.