Executive Job Description | Highly motivated individual who will work as a part of a 24x7 Production Operations team with primary focus on monitoring. Technical expert for support and administration of monitoring tools for both infrastructure/application components across corporation. Selected candidate will be responsible for managing proactive monitoring and recovery of the distributed systems environment. The administrator will be responsible for application maintenance of the monitoring infrastructure, which includes, but is not limited to, performance tuning, capacity planning and forecasting. Participates in + makes recommendations for monitoring enhancement projects. Functions as liaison with the service center, divisional IT engineers and operations.
The position requires full understanding of the base OS and monitoring applications. The administrator will also is expected to lead continual improvement by utilizing ITSM based principles. This position will also work closely with management to understand current and future project needs and how they will affect the production environments.
POSITION-SPECIFIC RESPONSIBILITIES: Architect and manage the IT monitoring infrastructure for all system availability, system performance, and system capacity. Maintains enterprise event management infrastructure and console. Configures views, rules, and automates tasks associated with distributed systems (database, network, application, web, hardware and OS). Integrates alerts from various monitoring and diagnostic tools into a single 24x7 enterprise console. Works with other departments, locations, and new projects to deploy monitoring solutions to business systems. Generates, modifies, and troubleshoots scripts used for monitoring or automating systems recovery. Respond to all customer monitoring related technical issues and resolve rapidly Train, develop, and transfer your monitoring knowledge to IT Departments Troubleshoot all aspects of monitoring application issues effectively and understand the product configuration, flow and logging in detail. Design and maintain documentation and standards for efficient and consistent service delivery. Provide monitoring requirements on all new systems, applications, databases, and networks being implemented. Develop alerting and escalation processes, policies and procedures. Provide availability and trending reports and statistics to management. Participate in all post mortems as an interface to the Problem Manager. Functional understanding of Sarbanes-Oxley principals.
GENERAL PRODUCTION OPERATIONS RESPONSIBILITIES: The below statements are intended to describe the general nature and level of work being performed. They are not intended to indicate all responsibilities, duties, and skills for which a job holder may be held accountable, and may in some cases only represent a portion of those.
Think clearly and abstractly, in high pressure situations, to resolve critical issues Communicate in a clear, concise and professional manner. Articulate and provide convincing arguments around business/technical decisions or direction. Adapt response to a given situation and appropriately articulate technical language with the customer. Perform most tasks with minimal direction from management. Be flexible and willing to work off-hours. Liaison between architecture, application development, project managers, network infrastructure, server engineering, and support teams. Represent the team or group at technical and status meetings as required. Lead technical discussions to determine new application developments and/or changes within current application infrastructure. Assist in the development of application project-specific work plans and the level to which tasks can be performed. Timely communication of concerns and issues that may affect the production environment, project schedule and/or user acceptance of the developed application to product manager and project manager. Respond to critical events after business hours. Troubleshoot issues with existing or developed systems, and works with the appropriate resources to resolve them. Perform and assist others with root cause analysis and reporting. Collaborate and consult with key technical staff to design solutions that meet business requirements. Maintain status of issues and their resolutions in the ticketing system. Participate in relevant post mortems following high severity issues. Adhere to change management and system development lifecycle policies and procedures. Serve as resource to implement projects plans addressing objectives, timeframes, preliminary testing and post production support of complex software applications. Serve as a project resource to successfully complete assigned tasks on time and on budget. Perform audits of the environment to identify deficiencies of standards, performance and availability. Maintain and increase knowledge and proficiency of appropriate technical competencies. Stay current on new technology and propose ideas to the team and management that improve efficiency or effectiveness. Work with vendors to evaluate products and solutions that might improve or enhance the services provided by the team. Prepare and assist with business case documentation for capital investments in new technologies. Execute technical presentations to technical, non-technical, and executive audiences. Prepare status reports for current activities and projects. Prepare reports regarding the operational state of the environment for managerial review. Participate in on-call rotation to respond to critical events after business hours.
QUALIFICATIONS:
Bachelors degree in Computer Science, Management of Information Systems, or equivalent work experience. 3-5 years of demonstrated technical proficiency in supporting monitoring applications (ideally BMC ProactiveNet, BMC Performance Management, BMC PATROL, BMC BEIM/BEM/SIM, TM-Art, Nagios, Microsoft System Center Operations Manager, with experience in managing medium to large environments). 5-7 Years experience in Windows server support and administration, preferably enterprise environment (1000 servers or more). Solid understanding of ITSM/ITLD principals, certification preferred. Solid understanding of Microsoft Operating System (Server 2000/2003). - Intermediate understanding of Microsoft IIS (V 4, 5 and 6), HTTP, HTML, ASP.NET and SSL. Must have a very strong background in Microsoft technologies including Active Directory and Server operating systems Working knowledge of Linux and Apache web servers. Must understand performance monitoring and WMI/WBEM and performance counters. Demonstrated ability to work with team members from other areas to accomplish common objectives. Ability to carry out the responsibilities of this position with little direction. Must possess excellent written and oral communication skills. Able to prioritize and multi-task. Ability to be on call 24x7 and work occasional weekends on a rotating basis. Intermediate knowledge of standard scripting technologies (WMI, VBScript, Powershell, C#) 2 OF 4 Required Intermediate knowledge of relational databases (Oracle, SQL 2000, SQL 2005). Working knowledge of Visual Studio and SQL 2000 Reporting Services, and SQL 2005 Reporting Services Demonstrable proficiency with business productivity tools such as Microsoft Office, Microsoft Project, and Microsoft Internet Explorer. Must possess strong troubleshooting skills and analytical skills. Must be able to pass a psychological evaluation and background investigation.
please forward resume as a word attachment |