High Availability & Resiliancy
BRYTECH expertise in High Availability, resiliency and eliminating single points of failure protect your critical business applications.
High Availability & Resiliency
A single point of failure (SPOF) is a potential risk posed by a flaw in the design, implementation or configuration of a circuit or system in which one fault or malfunction causes an entire system or application to stop operating. A SPOF could be lurking within the server configuration, storage device, network or SAN switches or even power feeds. A single point of failure can compromise the availability of workloads — or the entire data center — depending on the location and interdependencies involved in the failure.
Not all businesses can afford extensive IT departments, equipment and redundancies. The threat of SPOFs is greatest in these businesses. BRYTECH architects and engineers can work with you to identify SPOFs and come up with a cost-effective remediation plan that fits within your budget, protecting your most critical business applications.
High availability solutions provide fully automated failover to a backup system so that users and applications can continue working without disruption. HA solutions must have the ability to provide an immediate recovery point. At the same time, they must provide a recovery time capability that is significantly better than the recovery time that you experience in a non-HA solution topology.
BRYTECH architects and engineers strive for 99.9999% uptime, also known as “six nines”, or an average of 2.5 minutes of unplanned outage per year, cumulative. They key to “six nines” is the proper combination of hardware resiliency, high availability software, available resources, and properly tuning to detect and react to failure conditions in less than 60 seconds.
BRYTECH has extensive experience protecting some of the most critical workloads in the government sector. We can bring that same expertise to your business-critical applications, giving you a competitive advantage.
The difference between high availability and resiliency is best described by stating that high availability is recovery of applications and services after a failure whereas resiliency is ensuring the failure has no impact to application or services availability.
Examples of resilient design are Oracle RAC, a webserver farm with a load balancer front end, redundant SAN and IP fabrics, or disk volumes mirrored across two storage devices. In these examples, a RAC node, a webserver, a network fabric or a storage device can fail completely without impact to application availability. The goal of resilient design in IT infrastructure is 100% uptime. Both planned and unplanned outages do not impact critical business applications.
Increased availability comes at a cost in additional hardware and software licenses. That’s why we recommend our Availability Assessment to help you determine which applications require high availability, which require resiliency and which ones require neither. BRYTECH’s approach is to provide you the optimal level of availability based on your unique business needs while keeping costs predictable and manageable.
Disaster recovery (DR) involves a set of policies, tools and procedures to enable the recovery or continuation of vital technology infrastructure and systems following a natural or human-induced disaster.
Your investment in DR is dependent upon your business needs expressed as recovery point objective (RPO) and recovery time objective (RTO). RPO is the amount of data loss that can be tolerated. RTO is how long you can be without business-critical applications. These factors can be expressed in hours, minutes, or even seconds. The target will generally dictate the technology required; the shorter the time frames, the more costly the solution.
BRYTECH recommends our DR assessment, performed by one of our senior engineers. We work with business stakeholders to define the RPO and RTO for every business application and process. We guide you through a cost/benefit analysis of available technology and prepare an implementation plan.
You may choose to execute that plan yourself or we can easily transition from the assessment to a professional services engagement. BRYTECH can help you achieve a successful DR implementation including run books and a live DR test, certifying the processes of declaring a disaster and recovering back to normal operations.
Every business should have a business continuity plan (BCP). A business continuity plan is a proactive document to avoid and mitigate risks associated with a disruption of operations. It details steps to be taken before, during and after an event to maintain the business functions.
A business continuity plan is different from a disaster recovery (DR) plan, although DR is part of business continuity plan – as is high availability (HA). For example, your BCP may dictate that fall back from an HA failover either take place immediately after repairs have been made, during off shift or at the next change window.
A BCP will dictate the types of maintenance of your IT resources. For some, 5×8 may be sufficient, others may require 24×7 with a 4-hour response time by a certified engineer. You most critical applications may even require a custom support contract that includes keeping spare parts within your data center.
BRYTECH architects have years of experience in documenting business continuity plans. Let us assist you in developing this critical business resource.