Senior 3rd line Support Engineer - Monitoring - Automation

Posted 17 December 2021
Salary £70000.00 - £80000.00 per annum + plus excellent package
LocationCity of London
Job type Permanent
ReferenceBBBH126392_1639755933
Contact NameLucy Judge

Job description

My client a leading financial institution is currently looking for an Senior 3rd line Support Engineer

This is a role that is vital to the success of my client IT Operations. The focus is to keep agility and stability in balance, and constantly strive to eliminate complexity in the systems that Operations Engineers on-board and have operational responsibility for.

Some out of hours / on-call may be required to fulfil this role.

Qualifications and Requirements

  • Experience of monitoring solutions (Nagios, Splunk, Datadog, SCOM, Grafana, Sensu, etc) - ability to create alarms, monitors, dashboards, use tooling for proactive monitoring, cleansing of unnecessary alarms / reducing sea of red.
  • Automation - ability to automate tasks via coding or use of automation software to remove repeatable tasks. Creating automated recovery actions for common service faults, automation of releases, testing of new releases etc.
  • Ensuring actions identified post incident to reduce risk of re-occurrence are completed. Proactively monitoring service and identifying / reducing vulnerabilities. Using data to identify common themes in service failures. Ensuring readiness of resilient environments and that fail-over / DR strategy and test plans are regularly updated and tested.
  • Experience of ServiceNow, Automic Workload Automation, Active Directory, Windows Server, Unix, Citrix, VMWare, SQL, Oracle, ITIL, Powershell, AWS

Main Duties & Responsibilities

  • Operational Assurance
    • Availability of Applications and services
    • Implementation and maintenance of tools and automation solutions
    • Monitoring: Event management / performance / service availability / customer experience
    • Capacity Planning
    • Emergency Response: Recovery of service
    • Execution / Implementation of forensic-based enhancements, e.g. driving RCA, identifying themes and areas of multiple issues
    • Service readiness, Operations readiness
    • Documenting / knowledge sharing
  • Application Maintenance
    • On-going deployments, upgrades of applications
    • Lifecycle Management
    • Patching
  • Automation
    • Strong knowledge of PowerShell is essential
    • Self-healing and service health-checks
    • Automation of testing
    • Repeatable build capabilities
    • Simplify work flow
    • Automated pre/post change checks
    • Automated deployments
    • Reduce toil
  • Infrastructure
    • Server build and decommission
    • DNS record management
    • DHCP scope migration and provision
    • Active Directory Management / Sites and Services management / Domain Controller promotion and demotion
    • Strong knowledge of group policy
    • Certificate lifecycle management
    • Knowledge of AWS and Azure nice to have
    • Knowledge of mail proxy concepts and management, ideally ProofPoint nice to have
    • Office 365 mailbox creation, migration and administration nice to have