Data Storage Infrastructure

Data Centers operated by CESENET are based on several technologies. It goes about common Hierarchical Data Storage (HSM), object storage utilizing Ceph and disk arrays utilizing RAID technology, see below. Data Centers are located only within Czech Republic so all your data lies thereto. Distributed model of Data Centers allows us to increase data redundancy due to usage of geographical replicas. That model brings the advantage that your data are resistant to the data media failure, nature disaster or user mistake. Another advatage is that you can use the Data Center which is the closest to your location.

CESNET Data Storages utilizing folowing technologies:

HSM - Hierarchical Storage Management - Hierarchical model consists of two or more layer of data media. CESNET Data Storage utilizes two layers (disks and tape library). Frequently used data are then stored in fast media, (SAS „Serial Attached SCSI“ disks). HSM takes care of moving rarely used data to the slower/cheaper tier (Tape Library) from the disk array. HSM technology provides us with large capacity in comparison with disk-only data storage while keeping same investment cost.

Disk arrays - RAID technology - Redundant Array of Independent Disks – multiple disk array of independent disks. That is the method to secure your data against disk media failure. The core of RAID technology is storing your data onto more independent disks or disk arrays. In case of failure are corrupted data recalculated/rebuilt with specific algorythms. Our data storage utilizes RAID 6, so we can rebuild your data even if two disks from eight have failed.

CEPH - Ceph is the software solution for providing object storage solution. Object storage differs from common file-oriented data storages. Object storage stores data as an objects not like data blocks. Metadata is associated directly to the particular object. This system eliminates hierarchical structure utilized within common data storages. All data are then stored in flat name space which is easy scalable. Ceph is opensource system. It consists of object store, block store and distributed file system. Ceph supports object access (RADOS), exported block device and file access. Data stored within Ceph are replicated to multiple nodes, based on defined replication policy.

Data Center Type Location Raw capacity PB status
DU1 HSM Plzeň 3.6 decommissioned
DU2 HSM Jihlava 7.209 decommissioned
DU3 HSM Brno 3.52 decommissioned
DU4 HSM Ostrava 23.341 online
DU5 RAID Jihlava 9.36 online
CL1 CEPH Jihlava 10.56 online
CL2 CEPH Plzeň 25.433 online

Data Centers

DU4 Ostrava

Data Center in Ostrava is HSM-like and it has two tiers. Tier 1 are disk arrays consisting of 755pcs of 10 TB SAS disks. Tier 2 is represented by Tape Library TS4500 with twelve drives served with two robots. Tape Library has 1167pcs of tapes, each has capacity 15 TB. Tier 1 and Tier 2 are operated by GPFS file system. New and actively used data are on the Tier 1 and subsequently LIFO-like (Last In First Out) migrated to the Tier 2. The access to the data is served by three front end servers.

DU5 Jihlava

Jihlava Data Storage is consisting of three disk arrays. Each disk array has 38 RAID sets (RAID6 - 8+2) consisting of 10 TB NL (Near Line) HDD. Total usable capacity is then 9.36 PB. Metadata has own disk array consisting of SSD discs (144 TB). The access to the data is served by five front end servers.

CL1 Jihlava

Object storage CL1 is based on Ceph technology. It is consisting of 17 storage servers, each has 48pcs HDD with total capacity 10.56 PB. The access to the data is served by three front end servers.

CL2 Plzeň

Object storage CL2 is based on Ceph technology. It is consisting of 33 storage servers, each has 48pcs HDD (each 16 TB) with total capacity 25.34 PB. The access to the data is served by five front end servers.

Poslední úprava: 13.11.2020 12:12