Mitchell International, Inc.

  • Software Engineering Operations Manager

    Job Location US-CA-San Diego
    Job ID
    18-8472
    Type
    Regular Hire
  • Company Overview

    Mitchell International, Inc. is a leading provider of information and workflow solutions to the Property & Casualty Claims Industry and their supply chain partners. We solve interesting and complex problems that directly affect the customers our clients serve. We are constantly adapting to stay on the forefront of emerging technologies and we work diligently to maintain our position as a thought leader within our industry.

     

    Job Description

    Software Engineering Operations Manager 



    In this role, you will manage supporting business applications used to deliver Software based solutions & services to our clients, third party services, automation strategies and the monitoring, alarming and metrics reporting systems. We need someone with a proven track record in leading Software Engineering Operations/DevOps team, who is passionate about uptime, reliability, and the overall performance of our enterprise applications and infrastructure. They will own the site reliability & assist in capacity planning. This is a critical role that requires a high degree of technical mastery to ensure the quality of our customer experience. This is a highly technical, hands-on role which requires you to be a great team member as well as an individual contributor. This team is committed to provide operational support that provides the highest systems uptime and operations transparency. The global team provides a matrix approach to 24x7 infrastructure engineering and support services. In this role you will be helping to create a culture of change, collaboration, communication and enabling a technology shift in the organization.


    Responsibilities

    • Manage a team of Engineers who are passionate about providing quick response for production issues and drive resolution (operations responsibilities).
    • Job function includes monitoring and support web application servers (J2EE, .NET), databases, applications, networks and emerging technologies (Cloud, SQL, noSQL, Big Data etc).
    • Provide guidance and lead operations team for 24x7 on call support, and off-time upgrades/ maintenance.
    • Monitor, present & analyze transactional vital statistics and application uptime/response time, and recommend environmental or operational changes.
    • Troubleshoot origin of errors, website, application servers, network, and databases
    • Recommend and implement best practices for application support in production environments
    • Create and implement a strategy to ensure we meet uptime and performance SLA's
    • Possess drive, desire, and ability to process and correlate data from multiple sources to derive root cause of complex performance and availability issues and provide solutions to resolve such issues.
    • Experience implementing and managing enterprise-scale monitoring, trending, and alerting solutions.
    • Handle capacity planning, tuning systems stability, provisioning, performance and scaling of the application infrastructure.
    • Experience managing and leading geographically distributed teams while working with counterparts
    • Ensure best practices for support and problem management are adopted and practiced all the way to root cause documentation.
    • Selects, evaluates and mentors the development of staff to ensure efficient operation of the platform and the continued success of organizational goals.

    Qualifications

    Desired Skills and Experience

    • Bachelor’s degree in Electrical, Electronics or Computer Science Engineering with 10+ years of work experience.
    • 3+ years of experience managing technical teams
    • Experience with web application servers (J2EE, .NET), Application Performance Management (APMs) and IT Operations Analytics (ITOA) such as App Dynamics, New Relic etc.
    • 3+ years of experience with distributed server performance analysis & troubleshooting. Experience working in DevOps environment is a plus.
    • Experience with system and performance monitoring, log analysis, visualization using Nagios etc.
    • Knowledge of Python and general scripting skills is heavily desired.
    • Exceptional analytical, diagnostic, and problem solving skills.
    • Ability to present complex technical information in a clear and concise manner.
    • Strong presentation skills with demonstrated ability to provide customer interaction.
    • Ability to effectively command control during crisis situations.
    • Implement and enhance technical documentation of systems and endpoints as needed.
    • Disaster Recovery and Business Continuity experience.
    • Knowledge of operations skills across; build & release management, continuous Integration tools and frameworks such as, Git, Maven, etc will be a plus.
    • Good understanding of orchestration/CICD integration tools like Jenkins.

     

    Mitchell International, an equal opportunity employer, values the diversity of our workforce and the knowledge of our people.  Mitchell will not discriminate against an applicant or employee on the basis of race, color, religion, national origin, ancestry, sex/gender, age, physical or mental disability, military or veteran status, genetic information, sexual orientation, gender identity, gender expression, marital status, or any other characteristic protected by applicable federal, state or local law.

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed

    Connect With Us!

    Not ready to apply? Connect with us via our Talent Pools.