Senior Site Reliability Engineer

Senior level • Full-time • Remote in Romania
HDTVT011
210
Remote

Job Summary

Our partner combines innovation with great domain expertise to help market participants address the challenges of trading and maintain their focus on business growth.

They deliver fast, reliable solutions for connectivity, trading, and compliance. Customers rely on their solutions to consolidate their connectivity and trading infrastructure, addressing the full trade lifecycle across asset classes.

They have a unique design philosophy. Modular architecture with open interfaces to facilitate business-specific deployment as well as integration with third parties. This flexibility translates into agility for third parties, meaning faster time-to-market and the ability to quickly adapt to business needs and regulatory change.

Responsibilities

Platform Monitoring, Incident management & Level 2 Support

  • Management of daily operations of clients-hosted platforms
  • Proactive monitoring of global systems, working in production shift
  • Coordination of incident/crisis response & stakeholder communication
  • Timely resolution of incidents & escalation, preparation of post-mortem, tracking & follow up
  • Regular review of system’s capacity & performance, initiate upgrades/maintenance as required
  • Produce daily health checks and customer status reports
  • Leverage data analytics to anticipate trends and potential challenges

Implementation & Change Management

  • As a gatekeeper, own the change control process with the Client, partners & internal teams
  • Review & deploy customer platforms with the assistance of Technical consultants
  • Assess, schedule & deploy platform enhancements/upgrades and new business features
  • Improve & automate production tools/scripts (monitoring, failover, checks, backup/recovery)

Project & Engineering

  • Manage technical projects (eg production tools enhancements - monitoring, deployment automation, Cloud migration)
  • Run proof of concepts and validate new technical solutions (e.g. new Failover mechanism)
  • Ensure critical processes are improved & automated (CMDB, change control, capacity management, failover & DR)

Skills and Qualifications

  • Degree in Computing Science / Software Engineering
  • Minimum 3 years of experience as an Application Support Engineer
  • Excellent analytical / troubleshooting skills with a strong production mindset
  • Good general IT knowledge (OS Linux, networking, systems, Cloud technology)
  • Proficient with one or more scripting/programming languages: Java, Perl, Python, Shell
  • Knowledge of Agile/DevOps practices, automation tools (Ansible), and CI-CD pipeline
  • Strong experience with monitoring platforms (eg Nagios, ITRS)

Expected Behavior

  • Committed, self-motivated, and a fast learner
  • Strong team spirit with excellent communication
  • Proactive, independent, and proven ability to work under pressure
  • Fluency in English essential

Benefits

  • Dedicated learning time
  • Enhanced annual leaves (minimum 25 days)
  • Wellness allowance
  • Medical insurance
  • Internal and international mobility
  • Flexible work hours
  • Location-specific benefits

Wait, we have more...

There has to be one perfect job for you, so here are some more similar jobs.

București, +1
București, +1
Cluj-Napoca, +1
Cluj-Napoca, +1
Hybrid Cluj-Napoca, +1
Fullstack Software Engineer
Mid-level • Full-time • HDBOK002
Hybrid București
Quality Assurance Engineer
Mid-level • Full-time • HDTDC003
Hybrid București
Senior Backend Developer
Senior level • Full-time • HDAPV002
Hybrid Cluj-Napoca
Specialist Data Analytics and Tax Processes
Mid-level • Full-time • HDFDB007
Remote Brasov
Rollout Workstream Lead FICO
Senior level • Full-time • HDFDB006
Remote Brasov

Introducing the developer’s
console.

Sign up to our newsletter and you will receive periodic updates of new blog posts, contests, events and job opportunities.

$