Tier 4 Data Center Cooling System Design
Following the series of posts related to the
Data Centers Cooling system design and based on the operation requirements developed and described in previous post,
let's now dive into design details of a Tier IV rated system according to
Uptime Institute requirements.
The HVAC design of datacenters that follow the
above requirements can be managed by assuring design, construction and
operation compliance with one of the most industry recognized standards. That
is the Uptime Institute Tier classification.
The Tier classification of a data center can
take four rates, namely:
Tier I,
Tier II, Tier III or Tier IV.
The Tier class can be unique for a data center
and describes criteria to differentiate four classifications of site
infrastructure topology based on increasing levels (as numbers increase I to
IV) of redundant capacity components
and distribution paths.
Scope of this article is to give the HVAC
design engineer initial guidelines on how the topology of the DC cooling system
should be structured in order to comply with the UI Tier standards
requirements. The standard of reference is Tier Standard: Topology [2] which
can be requested for download from here.
Carry on with reading this post if interested
on a Tier IV data center
requirements or check Tier I & II Cooling System
design or Tier III CoolingSystem design
respectively.
1 - Reference Standards and regulations
Guidelines, requirements and design criteria in
use below are in accordance with the following references:
- ASHRAE – Design Considerations for Datacom Equipment centers / 2nd ed. 2009,
- Uptime Institute – Data Center Site Infrastructure, Tier Standard: Topology / 2018,
Tier IV – Design Criteria
According to Tier Standard: Topology [2] /
clause 2.4.1, the fundamental requirements for a Tier IV – Fault Tolerant Site
Infrastructure are:
(a) A
Fault Tolerant data center has multiple, independent, physically isolated
systems that provide redundant capacity components and multiple, independent,
diverse, active distribution paths simultaneously serving the critical
environment. The redundant capacity components and diverse distribution paths
shall be configured such that ‘N” capacity is providing power and cooling to
the critical environment after any infrastructure failure.
(b) All
IT equipment is dual powered with a Fault Tolerant power design internal to the
unit and installed properly to be compatible with the topology of the site’s
architecture. Transfer devices, such as point-of-use switches, must be
incorporated for critical environment that does not meet this requirement.
(c) Complementary
systems and distribution paths must be physically isolated from one another
(Compartmentalized) to prevent any single event from simultaneously impacting
both systems or distribution paths.
(d) Continuous
Cooling is required. Continuous cooling provides a stable environment for all
critical spaces within the ASHRAE maximum temperature change for IT equipment
as defined in Thermal Guidelines for Data
Processing Environments, Third Edition. Additionally, the Continuous
Cooling duration should be such that it provides cooling until the mechanical
system is providing rated cooling at the extreme ambient conditions.
(e) Twelve
hours of on-site fuel storage for ‘N’ capacity.
A few important points
Before dive into details of the cooling system
design requirements in compliance to Tier criteria, keep first in mind a few
more important points from the aforementioned standard [2].
Every data center subsystem and system must be consistently deployed with the same site uptime objective to satisfy the distinctive Tier requirements.
This standard requirement makes clear that
certain Tier rating requirements shall be applicable to all mechanical,
electrical and building systems that serve the IT space. So whatever we will
discuss below about the cooling system for a Tier IV data center design are
equally applicable and essential for the on-site power production, UPS and
storage equipment, fuel tanks and water storage (evaporative cooling) systems
as well.
The Tier topology rating for an entire site is constrained by the rating of the weakest subsystem that will impact site operation. For example, a site with a robust Tier IV UPS configuration combined with a Tier II chilled water system yield a Tier II site rating.
So even if the team is quite careful and
especially concerned about the cooling system design, but fail to implement the
same criteria into the electrical or fuel supply systems, the overall rating of
the data center will be lower than the expected.
Note that in case of water systems there are also requirements for the makeup water installation and the storage tanks.
In general a Tier IV data center allows all capacity components to be maintained or repaired with disrupting the operation of the critical environment. The same happens for the distribution paths (piping) and individual elements (fittings, valves and acccessoried). The facility controls system is able to autonomously detect isolate and contain any failure.
The operational risk that this configuration brings to the computer facility owner is low but there still exist a risk of disruption during the scheduled maintenance on redundant components.
I hope that you find this post interesting and educative.
2 - Design of Tier IV Cooling system
The cooling system design of a Tier IV rated
data center shall comply with the following requirements:
Redundant Capacity
components –
meaning that there is need for redundancy (backup) of any equipment of the
cooling system including and not limited to components like:
- CRAC / CRAH units,
- Chillers, chilled water pumps,
- Cooling towers, condensing pumps,
- AHUs,
- Split type DX cooling units,
- Makeup water storage tanks and pumps.
The selection of number of equipment that will
serve the plant shall be so that ‘N’ capacity is providing cooling to the IT
space after any infrastructure failure. This is what we call “N” units
available after any failure.
Multiple independent
distribution paths
– meaning that the following distribution networks:
- Chilled water piping,
- Condensing water piping,
- Refrigerant copper piping for DX cooling systems,
- Makeup water piping,
shall be designed in a way that the transferred
medium (water, fuel, refrigerant) is able to reach any system unit (CRAC, AHU,
Chiller, Cooling tower) from two paths. The two distribution paths shall be
independent and always active, meaning that simultaneously and automatically
will be ready and available to serve the critical environment.
Refer to fig. 01(a) to 01(c) below where different
distribution paths are compared for better understanding of a Tier IV
distribution topology options. The operation, complexity and cost impact of
each topology shall always be considered along with the requirement of fault
tolerance which is explained further below.
Fig.01(a) – Case of a chilled water system
supplying N+1 CRAC units through a two pipe chilled water system compromising a
single non-redundant distribution path. This is a usual and very simply
topology which does not comply with neither the multiple independent
distribution paths nor fault tolerant requirements of a Tier IV facility.
Fig.01(b) – Case of the same chilled water
system supplying N+1 CRAC units through a four pipe chilled water system, which
actually is a group of two independent distribution paths, always active and
ready to serve the critical environment. This is a piping system that can be
part of a Tier IV cooling installation. In order to be fault tolerant a series
of automatic isolation valves are provided for each CRAC unit and the two
distribution paths. So in case of any unit failure the system can automatically
isolate that unit (closing the automatic valves) and activate the redundant.
The same will happen with the case of failure in one of the fittings or parts
of the distribution path. The damaged distribution path will be isolated
through automatic valve arrangements A or B (red boxes) and all the active
units will shift from one network to the other automatically by the right
configuration of the individual valves.
Fig.01(c) – Case of the same chilled water
system supplying N+N CRAC units through two independent distribution paths,
always active and ready to serve the critical environment. Two totally separate
systems which provide under any conditions N capacity to the critical
environment. Any type of failure at system A will be detected and automatically
shift to system B. Activation will happen through automatic valve arrangements
A and B (red boxes). This is a piping system that can be part of a Tier IV
cooling installation.
Concurrently
Maintainable – according
to [2] Tier Standard Topology / clause 2.4.2 (c), Each and Every capacity component
(CRAC, AHU, Chiller, Cooling Tower, pump) and distribution paths and element (valves, filters, check valves,
regulating valves, meters, instruments, sensors) can sustain a failure, error,
planned or unplanned event without impacting the Critical Environment or IT
processes.
Referring back to Fig.01 (b) and (c), you can
easily check the maintainability concept. In case of system topology (b), the
maintenance or replacement of any piping section, fitting or isolation valve
can be easily done by shifting the chilled water flow to the other piping
system. The same happens for any CRAC unit, since the configuration provides
capacity components redundancy.
In case of system topology (c), things are more
obvious because any element maintenance or replacement at system A is a straight
forward process, after shifting the operation to system B.
Two more requirements not demanded for Tier III
rated data centers are the following:
Fault Tolerant – according to [2] Tier Standard
Topology / clause 2.4.2 (e), any potential fault must be capable of being
detected, isolated, and contained while maintaining N capacity to the critical
load.
Compartmentalization – according to [2] Tier Standard
Topology / clause 2.4.1 (c), complementary systems and distribution paths must
be physically isolated from one another (Compartmentalized) to prevent any
single event from simultaneously impacting both systems or distribution paths.
Consider the pumps system of Fig.02. These
pumps serve a piping network where the flow demand is 200m3/h. The
design provides a set of three pumps each one sized with a maximum flow
capacity of 100m3/h. It comes out that we have a 2+1 configuration
where two pumps continually run and the third one is backup. According to Tier 4
requirements, the pump as a capacity component is redundant. Also the
concurrent maintainability of each and every component in the system is
achieved by providing two valves in the piping section between each pump.
Additionally to that, the system must also be fault tolerant and so the
infrastructure controls system supervise the pump and autonomous valves status.
In case of failure the controls system demonstrates autonomous response by
detection, isolation and containment of the failure element while sustaining
the critical environment.
Physical isolation between the pumps is
achieved with walls (dry, block or concrete type), that follow the building
fire safety and ex-proof requirements, provide the compartmentalization. Note
that each compartment shall contain no more than the number of redundant
components.
For example refer to fig.02 and assume that pumps
P01 and P03 are running and valve A suddenly fail. The controls system will:
- Receive an error signal from the valve informing about the failure (Detection),
- Automatically send signal two valves B, C to turn off and isolate valve A (Isolation),
- Stop running P03 and activate P02 to sustain the critical environment. Keeping this way, the affected pump within compartment 03 and protecting the rest of the system (Containment).
The same happens with failure of all the valves
and elements of this arrangement. This is a fault tolerant Tier IV compliant
pumps arrangement.
Fig.02
– A set of three pumps in an N+1 configuration that serve a redundant
distribution path (water flow and return is served from both sides). The
provision of two valves between each section assures a concurrently
maintainable system. Valves are also automatically controlled making the system
fault tolerant and each pump contained within a space to be compartmentalized. A
pump system arrangement which is fully compliable with Tier 4 requirements and
is applicable to chilled, condense and make up water systems.
The same concept can also be extended to any piping
system as shown in Fig.01 (b) and (c) described before. The arrangement of
valves is in a way that any part of the system can be isolated and maintained,
without any downtime of the cooling system. Again try to check each and every
component and element and you will realize that there is always a set of valves
that can achieve isolation of that component or element. Further to that all
the actions can be done autonomously since the valves are automatic assuring
that the system is also fault tolerant.
Compartmentalization of distribution paths has
also to follow the guidelines below:
- Chilled water piping systems that run within one and only one IT room do not have to be compartmentalized.
- CRAC units installed in a separate units corridor and piping systems that run inside the same corridor do not have to be compartmentalized if and only if the corridor is a separate fire compartment.
- Complementary systems and piping networks outside of the building don not have to be compartmentalized.
- Chilled water piping systems that serve more than one IT rooms have to be compartmentalized.
3 - How to apply these requirements into the design
These requirements can be applied in several
cooling type systems like shown in the schematic representations below. A Tier
IV cooling system can use split type air conditioning systems with refrigerant
(Fig.03), or a water type system with CRAC units inside the IT space (Fig.04)
or even more advanced solutions with chilled water and air handling units (AHU)
that blow air within the IT spaces (Fig.05).
These systems configurations are quite common
in the data center industry where redundancy of the 'capacity components' combined
with the proper design of distribution paths achieve concurrent maintainability
and system reliability. Adding autonomous control to achieve fault tolerant
operation with compartmentalization gives the system Tier IV rate.
For your understanding explore the schematics
below and try to identify all the principles discussed before related to capacity components, distribution paths, system maintainability, fault tolerance and compartmentalization.
Fig. 03 - A Tier IV split system CRAC units
developed with DX type systems make use of 'N+1' number of CRAC units in a
configuration of N units running and at least 1 redundant. The respective
external condensing units and refrigerant pipes all together combine a system.
Fig. 04 - A Tier IV water system CRAC unit
contain single or more running 'capacity components' all in a 'N+1'
configuration. This includes a combination of water chiller, cooling tower,
water pumps all together running at the same time and supplying chilled
water to 'N' number of running CRAC units. Also the system has at least one
more piece of redundant component for every type of capacity equipment. So
there is a redundant water chiller, cooling tower, pumps and CRAC unit. Two 2-pipe
chilled water systems connect all the chiller and CRAC units as 'redundant
distribution paths'. There are provisions of automatic valves to allow the
autonomous operation of the system with proper shut down / activation of equipment and piping sections
in order to assure continuous cooling of the critical environment. Each and
every component or element of the system can be maintained or replaced without
system shutdown. Compartmentalization of capacity components inside the plant
room and separation of the distribution paths protect the system integrity and
allow isolation and containment of failures.
Fig. 05 - A Tier IV water system Air Handling
unit contain single or more running 'capacity components' all in
a 'N+1' configuration. This includes a combination of water chiller, cooling
tower, water pumps all together running at the same time and
supplying chilled water to 'N' number of running Air Handling
units. Also the system has at least one more piece of redundant component
for every type of capacity equipment. So there is a redundant water chiller,
cooling tower, pumps and AHU. Two 2-pipe chilled water systems connect all
the chiller and CRAC units as 'redundant distribution paths'. There are
provisions of automatic valves to allow the autonomous operation of the system
with proper shut down / activation of
equipment and piping sections in order to assure continuous cooling of the
critical environment. Each and every component or element of the system can be
maintained or replaced without system shutdown. Compartmentalization of
capacity components inside the plant room and separation of the distribution
paths protect the system integrity and allow isolation and containment of
failures.
Note that in case of water systems there are also requirements for the makeup water installation and the storage tanks.
In general a Tier IV data center allows all capacity components to be maintained or repaired with disrupting the operation of the critical environment. The same happens for the distribution paths (piping) and individual elements (fittings, valves and acccessoried). The facility controls system is able to autonomously detect isolate and contain any failure.
The operational risk that this configuration brings to the computer facility owner is low but there still exist a risk of disruption during the scheduled maintenance on redundant components.
I hope that you find this post interesting and educative.
- If you did like it please share it through the social so that more people can have access to it.
- If you have any questions or would like to discuss any special case, please leave your comments below. I will be happy to answer!
Hi There
ReplyDeleteTanks for the informative article.
At the moment, I am looking for a chilled water system which need to be Tier IV. Kindly advise what need to be considered for chillers ? We want to avoid UPS feed to whole Chiller as it is very costly. We are looking for UP supply to chiller pump's/control + energy storage tank in the system.
This is thorough and deeply researched. Thanks.
ReplyDeleteCould you please explain how to size for the CRAC units in precision DX-based cooling?
Just compared this article with previous Tier 3 article. No differences found!
ReplyDeleteThanks for taking the time to discuss this, I feel strongly about it and love learning more on this topic. If possible, as you gain expertise, would you mind updating your blog with extra information? It is extremely helpful for me. rent a chiller
ReplyDeleteWow, this is really interesting reading. I am glad I found this and got to read it. Great job on this content. I like it. Newcastle ducted air conditioning
ReplyDelete