ISD Monitoring Engineer III (Cloud)

Location: VA Vienna - Headquarters Full/Part Time: Full-Time Regular/Temporary: Regular

Job Description

Employee Perks

Why You Will Love Being Part of the Navy Federal Team:

*Competitive compensation with opportunities for annual raises, promotions, and bonus potential
*Best-in-Class Benefits! (7% 401k match / Pension plan / Tuition reimbursement / Great insurance options)
*On-site amenities include fitness center, wellness center, cafeteria, etc. at Pensacola, FL; Vienna, VA and Winchester, VA campuses
*Consistently Awarded Top Workplace
*Nationally recognized training department by TRAINING Magazine IND123
*An employee-focused, diverse, and service-oriented workplace environment

Basic Purpose

To research, evaluate, design, implement, and maintain system and product solutions, applying knowledge of engineering principles.  To provide technical direction and engineering support for projects and infrastructure.
1. Technical Direction

  • Quickly learn and maintain functional knowledge of evolving industry technologies, concepts and trends
  • Provide technical guidance and assistance to internal customers and other ISD staff
  • Recommend, develop, and maintain standard operating procedures for supported systems
  • Create, review and modify diagrams, schematics and other technical documents and templates
  • Establish technical direction, develop and initiate activities and assignments, organize and coordinate plans and efforts
  • Conduct engineering and technical research and make recommendations on proposed solutions; anticipates and adjusts for potential problems
2.  Engineering Support
  • Participate in technology research, procurement, deployment, and configuration of new and modified systems
  • Uses rigorous logic and methods to investigate and diagnose system and product defects, and collaborate with internal technical teams and vendors for resolution
  • Recommend design modifications to maximize availability and performance
  • Ensure security and integrity of system and product solutions
  • Complete work according to standard engineering principles and practices
  • Ensure compliance with applicable standards for information security
  • Train, guide and mentor less experienced staff
3. Design Solutions
  • Oversee the design and implementation of system and product solutions in respective area supported
  • Review and interpret system and product requirements.  Analyze system requirement documents and other data to evaluate feasibility, cost and maintenance prerequisites
  • Consult with engineers and other ISD staff to implement standard operating procedures and provide technical expertise and direction
  • Document new system components or modifications to existing components to comply with engineering design and performance specifications
  • Apply engineering principles into the design and enhancement of new and existing systems
  • Coordinate with other ISD and business unit staff regarding implementation and modification of system and product solutions
  • Analyzes and make recommendations regarding performance, scalability and availability metrics
4 Analysis and Research
  • Investigate operational or business problems and propose solutions
  • Ensure compliance with Navy Federal Credit Union ISD standards and best practices
  • Solve business problems by defining the problem, interviewing stakeholders, identifying and evaluating alternatives, and presenting the findings
  • Present complete and organized documentation of processes, systems, and data
  • Identify and analyze opportunities for new and/or improved processes, data, or technology; provide clear picture of possible outcomes
5. Communication
  • Presents consistent, concise, relevant, reliable and timely information to all appropriate internal and external audiences/stakeholders through a variety of  media
  • Ensures accuracy of information  to enable effective business decisions
  • Frame message in line with audience experience, background, and expectations; uses terms, examples, and analogies that are meaningful to the audience
  • Seek input from audience; confirms understanding; presents message in different ways to enhance understanding
6. Performs other related duties as assigned.
Role Specific Responsibilities
  • Responsible for developing new cloud Monitoring leveraging our existing tool set, Moogsoft AIOps, CA APM, Extrahop, SolarWinds, SiteScope, and Splunk Enterprise as well as cloud native monitoring tools
  • Working with project teams and other ISD stakeholders to define monitoring requirements & identify gaps in existing monitoring.
  • Developing cloud monitoring strategies and solutions for vital business applications and channels. 
  • Configuring and implementing monitoring solutions across ISD environments (lower and production)
  • Developing of standard operating procedures and guidelines
  • Maintaining dashboards used by Major Incident Management, CCNO, and other ISD areas
  • Integrate Event Management processes and solutions with Availability, Incident and Problem processes
  • Ensuring the communication of Event Management processes, monitoring technologies and infrastructure, and monitoring capabilities, with other departments, application owners, developers, technology and infrastructure support
  • Implementing Event Management processes through the ISD project and software development lifecycle. 
  • Providing level I and II troubleshooting and resolution support when there are enterprise monitoring tool, related outages or issues
  • Training internal employees, project teams, and other ISD stakeholders on the capabilities of our tools and their use once deployed to production.
Level 3   
  • Develops broad knowledge and skills in a specific practice area.
  • Evaluates, selects, and applies standard techniques, procedures, and criteria to perform a task or sequence of tasks for conventional projects with few complex features.
  • Collaboratively uses judgment to determine adaptations in methods for non-routine aspects of assignments.
  • Works on small projects or part of larger projects
  • Performs moderate design tasks. Prepares portions of project documentation. Edits specifications and performs research.
  • Assigns tasks to and coordinates work with entry-level engineers, technicians, or administrative staff.
  • Assists in determining schedule and budget requirements
  • Possesses effective oral and written communication skills.  Assists with vendor, customer, or management contacts and communications pertaining to specific assignments or meetings
Qualifications (all required unless otherwise noted)
  • Bachelor’s Degree in a related field, or the equivalent combination of education, training, and experience
  • Demonstrated expertise in cloud technologies including monitoring
  • Intermediate to Advanced experience in Azure Platform as a Service (Azure PaaS) or other Cloud providers
  • Intermediate to Advanced experience in Azure Infrastructure as a Service (Azure IaaS) or other Cloud Providers
  • Experienced in cloud infrastructure and resources
    • Subscriptions, Resource Groups, Storage Accounts, virtual machines, disks, virtual networks, load balancers, availability sets, Monitor, etc.
  • Understanding of cloud automation
    • OMS, PowerShell DSC, Update Management, Runbooks, Inventory Management, Change Tracking
  • Intermediate experience in scripting with Windows PowerShell
  • Working knowledge of Active Directory, Group Policy, LDAP, Kerberos, DNS, TCP/IP, WMI, SNMP
  • Working knowledge of MS SQL Server 2012/2014/2016 usage and administration
  • Designs and formulates plans for new and existing system and product solutions
  • Provides engineering support for large-scale 24x7 operational environment; including off-hours
  • Proven skillset in troubleshooting and resolving complex system and product defects
  • Experienced with system and product performance, reliability, and availability
  • Creates and maintains detailed schematics and other documentation for system and product solutions
  • Applies research and analytical techniques to the design and development of new and existing systems and products
  • Capable of managing multiple projects, resolve conflicting requests, and adapt to changing requirements and priorities
  • Experienced in large project efforts from a technical perspective
  • Strong analytical, planning and technical problem solving skills
  • Clearly and concisely present findings and conclusions
  • Ability to exercise initiative, produce desired results and achieve objectives
  • Solid documentation and organizational skills
  • Comfortable working with all levels of employees; including senior management
  • Effective communications skills; written and verbal
  • Proficiently lead, guide, and mentor others
Role specific requirements
  • 5+ years’ experience with scripting languages and automation tools
  • Ability to multi-task and prioritize high priority tasks and projects
  • General knowledge of the Service Portfolio, Business Services, Configuration Items and Universal Configuration Management Database (UCMDB) and how that works
  • Ability to work through problems in a methodical approach while under pressure and within timelines
  • Proficiency in performing research and analysis within timelines
  • Ability to deliver monitoring solutions within timelines
  • Ability to effectively learn and understand Applications and how they function to effectively monitor them and/or how they impact Event Management
  • Ability to lead efforts and tasks
  • Knowledge and understanding of ISD processes such as PMLC, SDLC, Incident, Problem, Change
  • General knowledge of Oracle, MS SQL, DB2 databases, how to connect to databases, and run queries
  • General knowledge of the following: web services and how they work, XML requests/response, datapower, websphere, and JVMs
  • Capacity to work with all levels of employees
  • Expertise in effectively handle problems, complaints, provide user support, and deliver solutions
  • Ability to effectively deliver presentations on monitoring processes, technology, dashboards, infrastructure, and proposed solutions
  • Ability to develop and document Event Management processes, guidelines, standard operating procedures and implement them
  • Ability to formalize reports, charts, and action plans
  • Knowledge of Windows server administration and Linux/Unix administration and commands
Level 3
  • Receives instruction on specific objectives. Receives direction on unconventional and/or complex problems, and possible solutions. Receives a thorough review of completed work
  • Extensive experience that demonstrates knowledge of engineering discipline
  • Solid analytical, organizational, and problem solving skills
  • Strong ability to exercise initiative and good judgment, and make sound decisions
  • Certification in appropriate engineering discipline
  • Familiarity with financial industry
  • Understanding of ITIL concepts and/or certification in ITIL
  • Knowledge of NFCU operations, processes and procedures
  • Some experience in Perl, Python, Jython, SQL and Java a plus
  • Experience using Splunk, searching, dashboards, alerts, etc
Hours: Monday – Friday; 8:00am – 4:30pm
On-call and after-hours support required
Bank Secrecy Section
Remains cognizant of and adheres to Navy Federal policies and procedures, and regulations pertaining to the Bank Secrecy Act.


Equal Employment Opportunity

Navy Federal values, celebrates, and enacts diversity in the workplace.  Navy Federal takes affirmative action to employ and advance in employment qualified individuals with disabilities, disabled veterans, Armed Forces service medal veterans, recently separated veterans, and other protected veterans.  EOE/AA/M/F/Veteran/Disability