Toronto, Ontario - Permanent
Our client are in the midst of an exciting transformation from on-premise to cloud-based infrastructure. The objective is not simply the implementation of cloud infrastructure, but a full shift to Infrastructure as Code (IaC) with Automated Infrastructure Management. The candidate will have the opportunity to contribute their experience and skills and to learn and work with a team of enthusiastic colleagues. The candidate will explore and assess the latest technologies, will chart their implementation path and make things happen.
The Role:To achieve our goals we seek a System Administrator that will manage, operate and support our existing infrastructure for both our corporate systems and our customer-facing systems while also working with our team to transform our infrastructure and our practices to modern cloud-based infrastructure and automation. Reporting to the Director, SaaS, the System Administrator role is ideal for candidates with strong on-premise infrastructure management and support skills, with a strong desire to shift their career towards cloud infrastructure and automated infrastructure management.
Must Have Skills:
-Possess a university degree in computer science or engineering or equivalent practical experience
-Minimum of 8 years of progressive experience including 2+ years experience in providing network support
-Track record of installing, monitoring and maintaining complex, business-critical applications
-Experience supporting end users in large-sized mixed technology environments, working with Windows Server, Linux, SAN hardware, firewalls, switches, load balancers and cloud technologies
-Demonstrated proficiency in the maintenance of hardware and software
-Working knowledge of Cloud Computing, particularly AWS
-Knowledge of Infrastructure as Code and Automated Infrastructure Management
-Working knowledge of issues related to system installations & troubleshooting
-Operating systems: Linux, Windows Server
-Virtual Machine Management: Xen Hypervisor
-Databases: PostgreSQL, MS SQL, standalone and in RDS
-AWS: EC2, EBS, Route 53, Load Balancers, S3
-Networking: Routers, VPN and Firewalls (particularly Fortinet)
-Tooling: Terraform (Other HashiCorp tools such as Packer, Consul, or Vault beneficial), SaltStack
-Source Code Control: SVN, Bitbucket/Git
-Scripting in Bash, Python
-PowerShell, Java, Maven, Ant
-New Relic, Cloud Watch, Elastic Stack and Pager duty
Desired Soft Skills:-Excellent analytical and problem-solving skills with strong attention to detail
-Strong interpersonal and communication (both verbal and written) skills with the ability to communicate and collaborate with technical and non-technical persons
-Punctual, honest and reliable
-Enthusiastic self-starter, able to work independently and to define and implement optimal processes
-Committed to life-long learning, proactively identifies areas of weakness and strives for personal improvement
-Aptitude and interest to learn and implement new technologies and to reduce manual tasks
-Able to create documentation (business case, implementation plan, process documentation, dashboards, etc.)
Nice to Have Skills:
-Working knowledge of New Relic, Cloud Watch, Elastic Stack and Pager duty
-Working knowledge of Cloud Computing, particularly Amazon Web Services
Major Responsibilities:Common Infrastructure Responsibilities
-Implement short- and long-term strategic plans to ensure infrastructure capacity meets existing and future requirements
-Develop and implement strategies for monitoring, maintaining and continuously improving network, server and storage infrastructure performance
-Manage hosts and databases in XenServer and AWS environments, including monitoring, backup, upgrades and disaster recovery preparation
-Manage deployments of the our Enterprise Wealth Management System, and suggest and develop process improvements
-Support and resolve high severity production incidents and respond to requests for assistance on technical questions, including regularly scheduled 24/7 on-call support for client-facing infrastructure
-Automate and improve processes
-Evaluate, develop and implement effective tooling, alerts, and automated responses to identify and address SLA risks
-Manage infrastructure incidents in a timely and effective manner
-Manage computing facilities, provide and oversee second and third-level support of infrastructure issues, diagnosing and troubleshooting systems to determine problem areas
-Maintain and organize the computer room; maintain and configure communications equipment such as concentrators, routers, and data transmission by modem, access to external database, Internet software and hardware
-In-depth knowledge of Active Directory, DNS and proficiency in PowerShell scripting
-Deep knowledge in backup, data recovery and environment recovery
-Assist with development of RFQs and RFIs, purchase orders, and other procurement artifacts related to the IT Platform Services group
-Ensure patch management policies are followed to keep Windows infrastructure updated (OS & Hardware)
-Automate the creation of public cloud resources in AWS: EC2, EBS, RDS, Load Balancers, etc. by developing our Terraform/Packer/SaltStack deployment infrastructure
-Develop reusable code for infrastructure automation
-Implement infrastructure reliability practices using monitoring tools like New Relic, Cloud watch etc.
-Install and configure employee computing devices
-Assist employees with hardware, software and network problems
-Identify, define and implement end-user computing tracking solutions
SaaS Operation Support
-Accountable for the quality of the services provided to Clients by the our Enterprise Wealth Management System (EWMS) SaaS platform, ensuring full compliance with the Service Level Agreement (SLA)
-Install, maintain, automate, control and monitor the EWMS SaaS environments and resources ensuring that they are secure, reliable and highly available
-Perform performance analysis using data collection tools and optimize the performance of the EWMS platform.
-Steadily improve performance, availability, and security through active analysis and configuration of production systems.
-Troubleshoot technical issues, identify and implement effective and optimized solutions through cross-functional collaborations.
-Eliminate manual tasks by implementing in-house and third-party tools for environment monitoring, business process management, etc.
-Provide high-quality client support directly and by supporting the Customer Success Analysts to ensure adherence with the Support Policy SLOs
-Plan, manage and execute along with other staff and customers activities in support of client business processes, for ex. playbooks, quarterly statements preparation, system and software upgrades, etc.
-Develop and manage service health dashboards
-Execute new client implementation activities following our internal methodology, including system setup and configuration and data conversion execution
-Plan and execute software configurations, monitoring and alerting tools setup and perform production readiness reviews.
-Assist clients with onboarding to our Flex assets and provide training on SaaS related topics ensuring that customers have a seamless onboarding experience
-Communicate and coordinate activities with clients and third parties (Fundserv)