LIBRISTO
LIBROAMANTO
obvezno
Pridružite se zajednici ljubitelja knjige iz cijelog svijeta i ostvarite mnoštvo pogodnosti. Izradite besplatni račun
0
Besplatna dostava Overseas kurirskom službom iznad 69.99 €
DPD kurir 3.99 Pošta 4.99 Overseas 4.99 Box Now 4.49 GLS 4.99 DPD točka 3.49 GLS paketomat 3.99

Besplatna dostava putem Box Now paketomata i Overseas kurirske službe iznad 69,99 €.

HPC Observability

Production Monitoring, Profiling, and Site Reliability for Linux Clusters, GPUs, and Parallel Storage at Scale

Jezik EngleskiEngleski
Knjiga Meki uvez
Knjiga HPC Observability M. Edwards
Libristo kod: 52747456
Nakladnici Independently published, svibanj 2026
HPC Observability is a hands-on guide for the engineers and administrators who keep high-performance... Cijeli opis
? points 49 b Novo Novo
20.34
Očekivane nove zalihe Dobivanje novih zaliha 02. 06. 2026

30 dana za povrat kupljenih proizvoda

HPC Observability is a hands-on guide for the engineers and administrators who keep high-performance computing systems running reliably at scale. It brings together the operational knowledge scattered across vendor documentation, conference papers, and forum threads into a practical framework for turning HPC telemetry into actionable insight.

Modern HPC environments - Slurm clusters, GPU-dense AI systems, Lustre and GPFS storage, InfiniBand and Slingshot fabrics - generate more data than any team can manually interpret. The result is wasted node-hours, failed simulations, hidden storage bottlenecks, fabric congestion, and GPU failures that surface only after days of runtime.

This book provides a complete operational approach to HPC observability through a five-layer model covering hardware, operating systems, schedulers, applications, storage, and networks. Readers learn how to build metrics pipelines for clusters from hundreds to tens of thousands of nodes; monitor GPUs with DCGM; profile MPI and OpenMP applications with PAPI and Score-P; diagnose storage and network slowdowns; create useful dashboards and alerts; and run effective incident response and post-mortems.

Drawing on peer-reviewed research and real production experience, the book includes original diagrams, practical workflows, reference material, Prometheus alert examples, and a step-by-step lab environment for learning on a laptop.

Written in the voice of a senior HPC engineer rather than an academic text, HPC Observability assumes readers already understand the fundamentals and focuses instead on the operational realities of running large-scale Linux, AI, and research-computing infrastructure.

Glumica & Poliglotkinja
EWA KASP za
Pusti video
Ewa Kasp
Libristo ima najveći izbor literature na stranim jezicima. Zato svoje knjige kupujem ovdje.

Informacije o knjizi

Puni naziv HPC Observability
Autor M. Edwards
Jezik Engleski
Uvez Knjiga - Meki uvez
Datum izdanja 2026
Broj stranica 164
EAN 9798198765443
Libristo kod 52747456
Težina 397
Dimenzije 216 x 280 x 9
Poklonite ovu knjigu još danas
To je jednostavno
1 Dodajte knjigu u košaricu i odaberite isporuku kao poklon 2 Zauzvrat ćemo vam poslati kupon 3 Knjiga dolazi na adresu poklonoprimca

Prijava

Prijavite se na svoj račun. Još nemate Libristo račun? Otvorite ga odmah!

 
obvezno
obvezno

Nemate račun? Ostvarite pogodnosti uz Libristo račun!

Sve ćete imati pod kontrolom uz Libristo račun.

Otvoriti Libristo račun
Književni savjetnik Libroamiko
Dobar dan, ja sam Libroamiko, mogu li vam pomoći?