Website Under Construction

Information Technology Service Management - ITSM for Space Exploration

HubBucket Operations Division

HubBucket Operations | Information Technology Service Management - ITSM

ITOps, or IT operations, refers to the processes and services administered by an organization's IT staff to its internal or external clients. It’s one of four functions — along with applications management, technical management and service desk — defined in the IT Operations Management framework in ITIL (Information Technology Infrastructure Library).

Every organization that uses computers has a way of meeting employees’ or clients’ IT needs, whether or not they call it ITOps (information technology operations). In a typical enterprise environment, however, ITOps is a distinct group within the IT department. Every company organizes its IT resources differently, but an ITOps team is generally composed of a group of IT operators and headed by an IT operations manager who oversees all the activities for which ITOps is responsible.

ITOps plays a critical role in accomplishing business goals. Among other things, ITOps helps maintain a stable and reliable IT ecosystem and ensures that IT empowers the organization’s employees and management to achieve the business’s desired outcomes.

But how that role is carried out is changing as businesses increasingly migrate from the data center to the cloud. In this article, we’ll look at how traditional ITOps is evolving in the cloud age through the emergence of DevOps, NoOps and CloudOps; how AIOps can supercharge IT operations through automation; and what ITOps can bring to modern cloud environments.

Information Technology Service Management - ITSM

Information Technology Service Management - ITSM is simply how IT Teams manage the End-to-End delivery of IT Services to customers. This includes all the processes and activities to design, create, deliver, and support IT services. The core concept of ITSM is the belief that IT should be delivered as a service.

Information Technology Service Management - ITSM aligns an IT Team’s goals with the broader objectives of the business, and that their actions support the overall mission.

Why is Information Technology Service Management - ITSM Important?

Information Technologies now encompass and incorporate tasks and responsibilities from across the entire organization. Managing these services is an ongoing challenge, and customers expect businesses to be up to the task. Businesses depend on ITSM to effectively coordinate these nearly countless tasks and processes, while ensuring that they are providing real value to the customer.

Because Information Technology Service Management - ITSM is a collection of policies and processes for the management and support of IT services—throughout their entire lifecycle—ITSM helps improve an enterprise’s efficiency and increase employee productivity.

What are Information Technology Service Management - ITSM Processes?

Technology is an integral part of IT processes, but it should not necessarily be the primary focus. ITSM is more encompassing, expanding to include business objectives in addition to traditional IT objectives. As a result, IT service management improves efficiencies across the board. ITSM does this in the following ways:

Incident Management

Manage and track incidents (a disruption to normal operations), as well as service requests for new services, software, or hardware. ITSM incident-management handles the entire incident-management process to restore service to customers as quickly as possible.

The ability to prioritize incidents and service requests according to business impact allows staff to focus their efforts where they can be most effective.

Problem Management

Streamline incident investigations, from detection to eradication, through the ITIL sub-processes of problem control, error control, and proactive problem analysis.

Problem-management processes remove defects from the IT infrastructure, eliminate recurring incidents, and stabilize the environment.

Change and Release Management

ITSM tracks scheduled and planned infrastructure changes, including process management and planning capabilities. Change implementation is quicker and more consistent, with risks and errors minimized.

This application includes a built-in approval process for change and release management. The change module is completely integrated with incident, problem, and service-level management applications.

Service-level Management

Track service-level commitments with customers and from vendors, so management can pinpoint weaknesses and take corrective action.

Request Management

Manage and follow up on service requests, including requests to reset passwords, install new workstations, update personal data, or access information.

Request management helps ensure that important requests are always moving forward.

Configuration Management

Track all configuration items in your IT system. Identify, verify, and maintain critical configuration information for hardware, software, personnel, and documentation.

Configuration Management gives your IT teams a reliable repository for IT component information and establishes clear relationships between IT infrastructure components and services.

Continual Improvement Management

Request improvement opportunities and implement phases and tasks to track performance goals and measure success. Connect improvement to your overall business strategy and improve service efficiency.

Workflow and Talent Management

Place the right people with the right skills and knowledge into the roles that best suit them. Support your business objectives by correctly positioning the most talented employees to provide the best possible service.

Information Technology Infrastructure Library - ITIL

The Information Technology Infrastructure Library - ITIL is  a  framework for IT service management  that provides detailed  best  practices for IT functions to align with an organization’s business outcomes. ITIL stands for IT infrastructure library.

Information Technology Infrastructure Library - ITIL is one of several best-practices frameworks for ITSM, providing the necessary tools and techniques to deliver IT services effectively. The Information Technology Infrastructure Library - ITIL framework has been through several iterations, with the latest referred to as ITIL 5.

What are the 5 stages of ITIL?

There are five stages of the ITIL service lifecycle:

  • Service Strategy
  • Service Design
  • Service Transition
  • Service Operation
  • Continual Service Improvement

What are the four P's in Service Strategy?

There are four building blocks of the Service Strategy stage:

  • Perspective: Describe your vision or direction for your services.
  • Position: Compare your strategy with competitors to understand how to best position yourself in the market.
  • Plan: Identify the actions you will take to achieve your goals and overarching vision.
  • Pattern: The fundamental and ongoing actions your organization will take to run smoothly over time. This includes processes, policies, schedules, budgets, and management systems.

These four P’s guide your Service Strategy and play an integral role in how you outline and implement your service plans. Strategies that lack any of these components are less likely to succeed.

Why is IT operations important?

IT operations is important because it has end-to-end responsibility for the services provided by the IT organization, systems and infrastructure that support an organization’s business processes. It is tasked with maintaining the operational stability of the organization while at the same time supporting new initiatives to push business to the next level.

What does an IT Operations Team do?

ITOps provides high-level technological guidance and performs routine daily tasks to maintain the organization’s IT infrastructure. ITOps may be tailored to suit each organization’s needs and resources, rendering a uniform “to-do list” of tasks impractical. As a function, however, ITOps can be broken down into three key areas of responsibility. Which and how many of these tasks any individual ITOps team is responsible for will vary from one organization to another. Those tasks may include:

Network Infrastructure:

  • Configuring and managing all networking functions for internal and external IT communications
  • Configuring and managing telecommunication lines
  • Managing firewall ports to allow the network to communicate with outside servers
  • Providing authorized users secure remote access to the organization’s network
  • Monitoring network health and performance, detecting anomalies, and preventing or quickly resolving issues, which may include building and managing a Network Operations Center (NOC, pronounced “knock”), a centralized physical location from which ITOps teams can continuously monitor a network

Server and Device Management:

  • Configuring, maintaining and managing servers for infrastructure and applications
  • Managing network and individual storage to ensure they meet application requirements
  • Setting up and authorizing email and file servers
  • Provisioning and managing company-approved PCs
  • Provisioning and managing cell phones and other mobile devices
  • Managing licensing and desktop, laptop and mobile device software

Computer Operations and Help Desk / Service Desk:

  • Managing data center locations and equipment
  • Operating the help desk
  • Creating, authorizing and managing all user profiles on organizational systems
  • Providing network configuration auditing information to regulatory agencies, business partners and other outside entities
  • Ensuring high availability of the network and disaster recovery plans
  • Alerting users when a major incident impacts network services
  • Instituting regular backups to facilitate data recovery when needed
  • Maintaining the ITIL for the organization

Within each of these areas, ITOps team members are also responsible for managing and working with vendors and outside contractors; procuring and paying for all the hardware, software and services used for the network and its applications; project management and deploying upgrades and fixes to maintain the health and performance of the network.

What is the difference between ITOps, DevOps and NoOps?

 

ITOps, DevOps and NoOps are three different approaches to structuring an organization’s IT teams. Each has different responsibilities and goals, and while both ITOps and DevOps working groups have been widely adopted by enterprises, NoOps is still mostly theoretical. We’ll take a closer look at each one and how they relate to each other.

ITOps: ITOps’ broad and sometimes nebulous purview can make it seem that it covers anything IT-related. It’s true that ITOps activities can vary considerably from organization to organization, but in all cases, they fall under the responsibility of delivering and maintaining the technology needed to run a business. In practice, that includes tasks such as maintaining networks, managing data centers, ensuring security and regulatory compliance, managing the help desk, licensing and managing software and other tasks that empower workers and support daily business operations. Notably, it does not include program and application development and related tasks.

DevOps: DevOps refers to an approach to IT delivery that combines people, practices, and tools to break down silos between development and operations teams. But DevOps also refers to a distinct IT role responsible for developing, implementing and maintaining custom applications for internal or external use.

As its name indicates, DevOps brings together the roles of development and IT operations. Following a set of DevOps practices, DevOps teams accelerate the development of applications and services with a more responsive approach to the management of the IT infrastructure, so they can deploy and update IT products at the speed of the modern marketplace.

ITOps and DevOps are founded on different and opposing principles. ITOps — charged with ensuring a stable and secure infrastructure that adheres to standards and regulatory requirements — favors a precise approach that minimizes risk. DevOps revolves around innovating and optimizing apps while shortening the software development life cycle and speeding up time to market.

Not surprisingly, these different imperatives sometimes come into conflict. ITOps’ steady, linear approach to developing and maintaining infrastructure makes it difficult to implement changes quickly and slows the development process. DevOps’ need for speed sometimes prompts teams to work around ITOps due to time constraints, potentially creating risks to system security and stability. For this reason, a DevOps approach requires that ITOps abdicate some of its responsibilities and share others with DevOps to help development teams achieve their delivery goals.

NoOps: NoOps stands for No IT Operations and refers to an evolution of DevOps that completely removes IT operations from the software development environment. Proponents claim that infrastructure maintenance tasks can be fully automated, eliminating the need for an in-house ITOps team. NoOps isn’t a platform but relies on several cloud technologies such as AI and machine learning to be put into practice.

Advocates say NoOps offers a few potential benefits:

  • Reduced likelihood of human error — NoOps minimizes human errors from manual functions and their attendant downtime, along with other incident management tasks, because a fully automated system wouldn’t require any human mediation.
  • Greater speed and efficiency — It eliminates the conflicts between ITOps’ stability-and-security approach and DevOps’ drive for innovation, allowing development teams to work with more speed and agility.
  • An elevated role for ITOps — NoOPs relieves ITOps of its operational responsibilities, allowing IT to take on a more strategic role, working on technological advancements and ensuring DevOps teams get the tools they need.

At this point, NoOps is still more of a concept than a practical solution. Some believe that removing ITOps from the software life cycle process would put too much responsibility on developers and impede production. Others say that automating every function of ITOps simply isn’t realistic given the complexity of modern systems. In the near future, it’s more likely that certain segments of operations will be automated while other areas will necessarily be performed by humans.

Development and Operations - DevOps

Similar to ITIL, DevOps is an IT framework. But where ITIL helps streamline service management with a heavy focus on customer satisfaction, DevOps is a philosophy for IT development and delivery. It unites development and operations teams, with the goal of improving communication and collaboration within an organization.

DevOps allows for continuous application delivery, reduced expenses, shorter development cycles, and improved cooperation between departments. DevOps often incorporates Site Reliability Engineering - SRE, which further supports DevOps goals by allowing engineers to automate operational tasks and address incident management.

Development and Operations - DevOps is defined as a software engineering methodology which aims to integrate the work of development teams and operations teams by facilitating a culture of collaboration and shared responsibility.

What is Observability?

Observability is the measuring of the state of a system through its outputs. The term originated from control theory, which deals with the describing and understanding of self-regulating systems, but it has increasingly been applied to distributed IT systems to improve their performance. In this context, observability uses three types of telemetry data — metrics, logs and distributed tracing — to provide deep visibility into distributed systems and allow teams to answer a multitude of questions to improve the system’s performance.

The enterprise has rapidly adopted cloud infrastructure services offered as microservices, serverless and containers. Tracing an event to its cause in these distributed systems is extremely complex. Observability essentially makes modern systems more monitorable, allowing teams to easily find and connect the components of a complex chain and trace them back to their cause. Further, it empowers every role, from sysadmins to ITOps analysts to developers, to have visibility into their entire architecture, an essential capability in the era of microservices.

Observability is needed because it allows you greater control over complex systems. Simple systems have fewer moving parts, making their stability easier to manage. Distributed systems have a far greater number of interconnecting parts, increasing the number and types of failures that can occur. Distributed systems also produce more “unknown unknowns” than simpler systems. And because monitoring is predicated on “known unknowns,” diagnostics in these complicated environments become even more challenging.

As an exploratory approach that lets you ask questions about your system's behavior as issues arise, observability is better suited for the unpredictability of distributed systems. “Why is X broken?” “What is causing latency right now?” or “Is this issue impacting all Android users right now or just some of them?” are just a few of the questions effective observability can answer.

What is AIOps?

 

AIOps stands for Artificial Intelligence - AI for IT Operations and refers to the application of AI analytics and machine learning to automate and improve IT operations. 

It’s become increasingly difficult for IT operations to manage networks as they grow larger and more complex. Traditional operations management tools and practices can’t keep pace with ever-growing volumes of data from multiple sources generated within these varied network environments. AIOps was developed to address these challenges, offering two significant advantages in cloud environments:

1. It brings together data from multiple sources. Traditional IT management solutions were designed for static physical systems, not the mix of on-premises and private, managed and public cloud services most enterprise organizations use. As a result, they can’t process the high volume, variety and velocity of data produced by these dynamic networks. Instead, they consolidate, aggregate and average data, which compromises data fidelity. AIOps platforms are designed for today’s networks with an ability to capture large data sets across the environment while maintaining data fidelity for comprehensive analysis.

2. It simplifies data analysis. AIOps platforms can collect all formats of data, despite varying velocity and volume. The platform then conducts automated data analysis that predicts and prevents future issues while identifying the cause of existing issues. It can also suggest solutions, automate responses and alter its algorithms to improve how it handles future issues.

In practice, AIOps is a three-step process — observe, engage, act — performed in a continuous cycle:

  • Observe: First the AIOps platform processes real-time data from a variety of sources, including traditional IT monitoring and log events, among others. The AI algorithms use anomalies in the data to automatically detect significant issues, which the platform then analyzes, clustering similar issues.
  • Engage: The AIOps platform alerts the relevant IT teams to the anomalies. Because they’re grouped together by type, there are fewer notifications.
  • Act: AIOps platforms can automate workflow routing with or without human intervention, learning from the IT team’s responses to become more accurate with time. Ultimately, it may learn to resolve issues before the business becomes aware of them and before they impact end users.

What is CloudOps?

 

CloudOps is short for cloud operations, a blanket term describing the processes of managing and delivering cloud computing infrastructure services to either internal or external users. The goal of CloudOps is to keep cloud computing platforms — and their applications and data — healthy and functioning for the long term. CloudOps relies on continuous operation, a DevOps approach to running cloud-based systems that eliminates the need to ever remove part or all of an application from service. This requires operational automation to sustain zero downtime. CloudOps arose in response to the many unique characteristics of cloud-based systems:

  • The cloud is distributed, stateless and scalable — Cloud allows you to scale capacity on demand, spinning up new servers or storage devices and shutting down unnecessary ones as needed. During high-traffic times, you can set rules to auto-provision servers to keep up with demand and maintain uptime. Also, with cloud resources distributed globally, CloudOps allows you to monitor key performance metrics and respond from anywhere, increasing the flexibility and scalability of the underlying applications and infrastructure.
  • The cloud is infrastructure agnostic — The underlying infrastructure is abstracted from the platform and applications.
  • The cloud is fault and latency tolerant — Because Cloud-Centric Applications and Services can abstract themselves from the underlying infrastructure, they are less prone to latency and error.
  • The cloud is highly automated — Team members can program automated functions across every part of the software development life cycle and use your infrastructure to perform commands and tasks based on monitoring thresholds and other key performance metrics. This creates self-healing systems that can fix common operational issues without impacting applications or users.
  • The cloud is active-active — Active-active cloud networks use multiple independent processing nodes where each node has access to several replicated databases for a single application. Applications can pull the necessary data from different sources when a server goes down, leading to less downtime and fewer outages.
  • Cloud applications share resources — Cloud applications share services without being bound together.
  • Cloud data is redundant — The cloud enables you to store data in multiple physical locations, providing more failover options, a more resilient data pipeline and greater protection from data loss.

CloudOps aims to maximize the benefits of these cloud attributes by formalizing practices and processes for operating in a cloud-based system. The details will look different for each organization, but CloudOps generally addresses the same operational issues as traditional ITOps — including resource management, threat prevention and compliance — so that DevOps can get the most out of the cloud while ensuring speed, security and operational efficiency.

The Future of ITOps is in the Cloud (Cloud Computing)

Today, IT teams are charged with providing stability, security and efficiency in increasingly dynamic and connected environments. The prevalence of multi-cloud hybrids, the need for observability, and the velocity and agility required by DevOps are straining the limits of conventional IT approaches and tools. The result is inevitably compromised service levels and unhappy users.

It’s time for IT operations to embrace CloudOps and AIOps, which provide the speed and flexibility needed to manage cloud environments more effectively. These tools aren’t evolutions of ITOps but rather a whole new way of performing IT operations. And while implementation often requires an organization-wide culture shift, the net benefits — greater flexibility, security and reliability — are too great to pass up.