IBM Blue Gene - Misplaced Pages

Blue Gene was an IBM project aimed at designing supercomputers that can reach operating speeds in the petaFLOPS (PFLOPS) range, with relatively low power consumption.

#919080

71-606: The project created three generations of supercomputers, Blue Gene/L , Blue Gene/P , and Blue Gene/Q . During their deployment, Blue Gene systems often led the TOP500 and Green500 rankings of the most powerful and most power-efficient supercomputers, respectively. Blue Gene systems have also consistently scored top positions in the Graph500 list. The project was awarded the 2009 National Medal of Technology and Innovation . After Blue Gene/Q, IBM focused its supercomputer efforts on

142-604: A quintillion 64-bit floating point arithmetic calculations per second. Frontier clocked in at approximately 1.1 exaflops , beating out the previous record-holder, Fugaku . Some major systems are not on the list. A prominent example is the NCSA's Blue Waters which publicly announced the decision not to participate in the list because they do not feel it accurately indicates the ability of any system to do useful work. Other organizations decide not to list systems for security and/or commercial competitiveness reasons. One such example

213-431: A "compute node". A single compute node has a peak performance of 13.6 GFLOPS. 32 Compute cards are plugged into an air-cooled node board. A rack contains 32 node boards (thus 1024 nodes, 4096 processor cores). By using many small, low-power, densely packaged chips, Blue Gene/P exceeded the power efficiency of other supercomputers of its generation, and at 371 MFLOPS/W Blue Gene/P installations ranked at or near

284-571: A (size-independent) power efficiency of about 2.1 GFLOPS/W, all these systems also populated the top of the June 2012 Green 500 list. Record-breaking science applications have been run on the BG/Q, the first to cross 10 petaflops of sustained performance. The cosmology simulation framework HACC achieved almost 14 petaflops with a 3.6 trillion particle benchmark run, while the Cardioid code, which models

355-607: A 5D torus configuration, with 2 GB/s chip-to-chip links. The Blue Gene/Q chip is manufactured on IBM's copper SOI process at 45 nm. It delivers a peak performance of 204.8 GFLOPS while drawing approximately 55 watts. The chip measures 19×19 mm (359.5 mm²) and comprises 1.47 billion transistors. Completing the compute node, the chip is mounted on a compute card along with 16 GB DDR3 DRAM (i.e., 1 GB for each user processor core). A Q32 "compute drawer" contains 32 compute nodes, each water cooled. A "midplane" (crate) contains 16 Q32 compute drawers for

426-400: A Blue Gene/L system could be partitioned into electronically isolated sets of nodes. The number of nodes in a partition had to be a positive integer power of 2, with at least 2 = 32 nodes. To run a program on Blue Gene/L, a partition of the computer was first to be reserved. The program was then loaded and run on all the nodes within the partition, and no other program could access nodes within

497-569: A cluster with over 100,000 H100s. xAI Memphis Supercluster (also known as "Colossus") allegedly features 100,000 of the same H100 GPUs, which could have put in on the first place, but it is reportedly not in full operation due to power shortages. IBM Roadrunner is no longer on the list (nor is any other using the Cell coprocessor, or PowerXCell ). Although Itanium -based systems reached second rank in 2004, none now remain. Similarly (non- SIMD -style) vector processors (NEC-based such as

568-562: A double-pipeline-double-precision Floating-Point Unit (FPU), a cache sub-system with built-in DRAM controller and the logic to support multiple communication sub-systems. The dual FPUs gave each Blue Gene/L node a theoretical peak performance of 5.6 GFLOPS (gigaFLOPS) . The two CPUs were not cache coherent with one another. Compute nodes were packaged two per compute card, with 16 compute cards (thus 32 nodes) plus up to 2 I/O nodes per node board. A cabinet/rack contained 32 node boards. By

639-488: A global interrupt network for fast barriers . The I/O nodes, which run the Linux operating system , provided communication to storage and external hosts via an Ethernet network. The I/O nodes handled filesystem operations on behalf of the compute nodes. A separate and private Ethernet management network provided access to any node for configuration, booting and diagnostics. To allow multiple programs to run concurrently,

710-514: A petaflop on the HPCG benchmark , delivering 2.9 petaflops and 1.8 petaflops, respectively. The average HPCG result on the current list is 213.3 teraflops, a marginal increase from 211.2 six months ago. Microsoft is back on the TOP500 list with six Microsoft Azure instances (that use/are benchmarked with Ubuntu , so all the supercomputers are still Linux-based), with CPUs and GPUs from same vendors,

781-637: A portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers. The most recent edition of TOP500 was published in November 2024 as the 64th edition of TOP500, while the next edition of TOP500 will be published in June 2025 as the 65th edition of TOP500. Since November 2024, the United States' El Capitan is the most powerful supercomputer on TOP500, reaching 1742 petaFlops (1.742 exaFlops) on

SECTION 10

#1732781011920

852-561: A reasonable cost, through novel machine architectures. The initial design for Blue Gene was based on an early version of the Cyclops64 architecture, designed by Monty Denneau . In parallel, Alan Gara had started working on an extension of the QCDOC architecture into a more general-purpose supercomputer. The US Department of Energy started funding the development of this system and it became known as Blue Gene/L (L for Light). Development of

923-478: A total of 512 compute nodes, electrically interconnected in a 5D torus configuration (4x4x4x4x2). Beyond the midplane level, all connections are optical. Racks have two midplanes, thus 32 compute drawers, for a total of 1024 compute nodes, 16,384 user cores, and 16 TB RAM. Separate I/O drawers, placed at the top of a rack or in a separate rack, are air cooled and contain 8 compute cards and 8 PCIe expansion slots for InfiniBand or 10 Gigabit Ethernet networking. At

994-599: Is a redundant manufacturing spare, used to increase yield. The spared-out core is disabled prior to system operation. The chip's processor cores are linked by a crossbar switch to a 32 MB eDRAM L2 cache, operating at half core speed. The L2 cache is multi-versioned—supporting transactional memory and speculative execution —and has hardware support for atomic operations . L2 cache misses are handled by two built-in DDR3 memory controllers running at 1.33 GHz. The chip also integrates logic for chip-to-chip communications in

1065-711: Is an award presented by the Association for Computing Machinery each year in conjunction with the SC Conference series (formerly known as the Supercomputing Conference ). The prize recognizes outstanding achievement in high-performance computing applications. The main purpose is to track the progress over time of parallel computing , by acknowledging and rewarding innovation in applying high-performance computing to applications in science, engineering, and large-scale data analytics . The prize

1136-545: Is anticipated to be operational in 2021 and, with a performance of greater than 1.5 exaflops, should then be the world's most powerful computer. Since June 2019, all TOP500 systems deliver a petaflop or more on the High Performance Linpack (HPL) benchmark, with the entry level to the list now at 1.022 petaflops. In May 2022, the Frontier supercomputer broke the exascale barrier , completing more than

1207-496: Is based on the 64-bit IBM A2 processor core. The A2 processor core is 4-way simultaneously multithreaded and was augmented with a SIMD quad-vector double-precision floating-point unit (IBM QPX). Each Blue Gene/Q compute chip contains 18 such A2 processor cores, running at 1.6 GHz. 16 Cores are used for application computing and a 17th core is used for handling operating system assist functions such as interrupts , asynchronous I/O , MPI pacing, and RAS . The 18th core

1278-567: Is because of better performance per watt ratios and higher absolute performance. AMD GPUs have taken the top 1 and displaced Nvidia in top 10 part of the list. The recent exceptions include the aforementioned Fugaku , Sunway TaihuLight , and K computer . Tianhe-2A is also an interesting exception, as US sanctions prevented use of Xeon Phi; instead, it was upgraded to use the Chinese-designed Matrix-2000 accelerators. Two computers which first appeared on

1349-624: Is compiled by Jack Dongarra of the University of Tennessee , Knoxville , Erich Strohmaier and Horst Simon of the National Energy Research Scientific Computing Center (NERSC) and Lawrence Berkeley National Laboratory (LBNL), and, until his death in 2014, Hans Meuer of the University of Mannheim , Germany . The TOP500 project also includes lists such as Green500 (measuring energy efficiency) and HPCG (measuring I/O bandwidth). In

1420-520: Is over 1,432,513 times faster than the Connection Machine CM-5/1024 (1,024 cores), which was the fastest system in November 1993 (twenty-five years prior) with an Rpeak of 131.0 G FLOPS . As of June 2022 , all supercomputers on TOP500 are 64-bit supercomputers, mostly based on CPUs with the x86-64 instruction set architecture , 384 of which are Intel EMT64 -based and 101 of which are AMD AMD64 -based, with

1491-655: Is the National Supercomputing Center at Qingdao's OceanLight supercomputer, completed in March 2021, which was submitted for, and won, the Gordon Bell Prize . The computer is an exaflop computer, but was not submitted to the TOP500 list; the first exaflop machine submitted to the TOP500 list was Frontier. Analysts suspected that the reason the NSCQ did not submit what would otherwise have been

SECTION 20

#1732781011920

1562-523: The ARMv8 architecture. The Flagship2020 program, by Fujitsu for RIKEN plans to break the exaflops barrier by 2020 through the Fugaku supercomputer , (and "it looks like China and France have a chance to do so and that the United States is content – for the moment at least – to wait until 2023 to break through the exaflops barrier." ) These processors will also implement extensions to

1633-417: The ARMv8.2 based Fugaku increased its performance on the new mixed precision HPC-AI benchmark to 2.0 exaflops, besting its 1.4 exaflops mark recorded six months ago. These represent the first benchmark measurements above one exaflop for any precision on any type of hardware. Summit, a previously fastest supercomputer, is currently highest-ranked IBM-made supercomputer; with IBM POWER9 CPUs. Sequoia became

1704-609: The Earth simulator that was fastest in 2002 ) have also fallen off the list. Also the Sun Starfire computers that occupied many spots in the past now no longer appear. The last non-Linux computers on the list – the two AIX ones – running on POWER7 (in July 2017 ranked 494th and 495th, originally 86th and 85th), dropped off the list in November 2017. Gordon Bell Prize The Gordon Bell Prize

1775-495: The Green500 list of most energy efficient supercomputers with up to 2.1 GFLOPS/W . In June 2012, Blue Gene/Q installations took the top positions in all three lists: TOP500 , Graph500 and Green500 . The following is an incomplete list of Blue Gene/Q installations. Per June 2012, the TOP500 list contained 20 Blue Gene/Q installations of 1/2-rack (512 nodes, 8192 processor cores, 86.35 TFLOPS Linpack) and larger. At

1846-484: The Linux kernel . Since November 2015, no computer on the list runs Windows (while Microsoft reappeared on the list in 2021 with Ubuntu based on Linux). In November 2014, Windows Azure cloud computer was no longer on the list of fastest supercomputers (its best rank was 165th in 2012), leaving the Shanghai Supercomputer Center 's Magic Cube as the only Windows-based supercomputer on

1917-581: The OpenPower platform, using accelerators such as FPGAs and GPUs to address the diminishing returns of Moore's law . A video presentation of the history and technology of the Blue Gene project was given at the Supercomputing 2020 conference. In December 1999, IBM announced a US$ 100 million research initiative for a five-year effort to build a massively parallel computer , to be applied to

1988-603: The San Diego Supercomputer Center . While the TOP500 measures performance on a single benchmark application, Linpack, Blue Gene/L also set records for performance on a wider set of applications. Blue Gene/L was the first supercomputer ever to run over 100 TFLOPS sustained on a real-world application, namely a three-dimensional molecular dynamics code (ddcMD), simulating solidification (nucleation and growth processes) of molten metal under high pressure and temperature conditions. This achievement won

2059-472: The TOP500 list, with a LINPACK benchmarks performance of 70.72 TFLOPS. It thereby overtook NEC's Earth Simulator , which had held the title of the fastest computer in the world since 2002. From 2004 through 2007 the Blue Gene/L installation at LLNL gradually expanded to 104 racks, achieving 478 TFLOPS Linpack and 596 TFLOPS peak. The LLNL BlueGene/L installation held the first position in

2130-483: The United States Department of Energy and Intel announced the first exaFLOP supercomputer would be operational at Argonne National Laboratory by the end of 2021. The computer, named Aurora , was delivered to Argonne by Intel and Cray . On 7 May 2019, The U.S. Department of Energy announced a contract with Cray to build the "Frontier" supercomputer at Oak Ridge National Laboratory. Frontier

2201-491: The 2005 Gordon Bell Prize . In June 2006, NNSA and IBM announced that Blue Gene/L achieved 207.3 TFLOPS on a quantum chemical application ( Qbox ). At Supercomputing 2006, Blue Gene/L was awarded the winning prize in all HPC Challenge Classes of awards. In 2007, a team from the IBM Almaden Research Center and the University of Nevada ran an artificial neural network almost half as complex as

IBM Blue Gene - Misplaced Pages Continue

2272-681: The ACM . Karp went on to be one of the first Gordon Bell Prize judges. Individuals or teams may apply for the award by submitting a technical paper describing their work through the SC conference submissions process. Finalists present their work at that year's conference, and their submissions are included in the conference proceedings. The ACM Gordon Bell Prize is primarily intended to recognize performance achievements that demonstrate: In earlier years, multiple prizes were sometimes awarded to reflect different types of achievements. According to current policies,

2343-721: The ARMv8 architecture equivalent to HPC-ACE2 that Fujitsu is developing with Arm . In June 2016, Sunway TaihuLight became the No. 1 system with 93 petaflop/s (PFLOP/s) on the Linpack benchmark. In November 2016, Piz Daint was upgraded, moving it from 8th to 3rd, leaving the US with no systems under the TOP3 for the 2nd time. Inspur , based out of Jinan , China, is one of the largest HPC system manufacturers. As of May 2017 , Inspur has become

2414-516: The Internet, including the following sources: The information from those sources was used for the first two lists. Since June 1993, the TOP500 is produced bi-annually based on site and vendor submissions only. Since 1993, performance of the No. 1 ranked position has grown steadily in accordance with Moore's law , doubling roughly every 14 months. In June 2018, Summit was fastest with an Rpeak of 187.6593 P FLOPS . For comparison, this

2485-521: The LINPACK benchmarks. As of 2018, the United States has by far the highest share of total computing power on the list (nearly 50%). As of 2023, the United States has the highest number of systems with 161 supercomputers, and China is in second place with 104. The 59th edition of TOP500, published in June 2022, was the first edition of TOP500 to feature only 64-bit supercomputers; as of June 2022, 32-bit supercomputers are no longer listed. The TOP500 list

2556-671: The Prize can be awarded in one or more of the following categories, depending on the entries received in a given year: Peak Performance : If the entry demonstrates outstanding performance in terms of floating point operations per second on an important science/engineering problem; the efficiency of the application in using bottleneck resources (such as memory size or bandwidth) is also taken into consideration. Special Achievement in Scalability, Special Achievement in Time to Solution : If

2627-522: The TOP500 list for 3.5 years, until in June 2008 it was overtaken by IBM's Cell-based Roadrunner system at Los Alamos National Laboratory , which was the first system to surpass the 1 PetaFLOPS mark. While the LLNL installation was the largest Blue Gene/L installation, many smaller installations followed. The November 2006 TOP500 list showed 27 computers with the eServer Blue Gene Solution architecture. For example, three racks of Blue Gene/L were housed at

2698-732: The TOP500 list up until November 2017. Inspur and Supermicro released a few platforms aimed at HPC using GPU such as SR-AI and AGX-2 in May 2017. In June 2018, Summit, an IBM-built system at the Oak Ridge National Laboratory (ORNL) in Tennessee, US, took the No. 1 spot with a performance of 122.3 petaflop/s (PFLOP/s), and Sierra, a very similar system at the Lawrence Livermore National Laboratory, CA, US took #3. These systems also took

2769-573: The TOP500 measures a specific benchmark algorithm using a specific numeric precision. In March 2024, Meta AI disclosed the operation of two datacenters with 24,576 H100 GPUs, which is almost 2x as on the Microsoft Azure Eagle (#3 as of September 2024), which could have made them occupy 3rd and 4th places in TOP500, but neither have been benchmarked. During company's Q3 2024 earnings call in October, M. Zuckerberg disclosed usage of

2840-562: The TOP500 systems are Linux -family based, but Linux above is generic Linux. Sunway TaihuLight is the system with the most CPU cores (10,649,600). Tianhe-2 has the most GPU/accelerator cores (4,554,752). Aurora is the system with the greatest power consumption with 38,698 kilowatts. In November 2014, it was announced that the United States was developing two new supercomputers to exceed China's Tianhe-2 in its place as world's fastest supercomputer. The two computers, Sierra and Summit , will each exceed Tianhe-2's 55 peak petaflops. Summit,

2911-430: The TOP500 that are in each of the listed countries or territories. As of 2024, United States has the most supercomputers on the list, with 172 machines. The United States has the highest aggregate computational power at 6,324 Petaflops Rmax with Japan second (919 Pflop/s) and Germany third (396 Pflop/s). (As of November 2023 ) By number of systems as of November 2024 : Note: All operating systems of

IBM Blue Gene - Misplaced Pages Continue

2982-479: The ascendancy of 32-bit x86 and later 64-bit x86-64 in the early 2000s, a variety of RISC processor families made up most TOP500 supercomputers, including SPARC , MIPS , PA-RISC , and Alpha . All the fastest supercomputers since the Earth Simulator supercomputer have used operating systems based on Linux . Since November 2017 , all the listed supercomputers use an operating system based on

3053-524: The brain of a mouse for the equivalent of a second (the network was run at 1/10 of normal speed for 10 seconds). The name Blue Gene comes from what it was originally designed to do, help biologists understand the processes of protein folding and gene development . "Blue" is a traditional moniker that IBM uses for many of its products and the company itself . The original Blue Gene design was renamed "Blue Gene/C" and eventually Cyclops64 . The "L" in Blue Gene/L comes from "Light" as that design's original name

3124-443: The chip can operate as a 4-way symmetric multiprocessor (SMP). The memory subsystem on the chip consists of small private L2 caches, a central shared 8 MB L3 cache, and dual DDR2 memory controllers. The chip also integrates the logic for node-to-node communication, using the same network topologies as Blue Gene/L, but at more than twice the bandwidth. A compute card contains a Blue Gene/P chip with 2 or 4 GB DRAM, comprising

3195-487: The current generation of supercomputers. In June 2007, IBM unveiled Blue Gene/P , the second generation of the Blue Gene series of supercomputers and designed through a collaboration that included IBM, LLNL, and Argonne National Laboratory 's Leadership Computing Facility . The design of Blue Gene/P is a technology evolution from Blue Gene/L. Each Blue Gene/P Compute chip contains four PowerPC 450 processor cores, running at 850 MHz. The cores are cache coherent and

3266-449: The early 1990s, a new definition of supercomputer was needed to produce meaningful statistics. After experimenting with metrics based on processor count in 1992, the idea arose at the University of Mannheim to use a detailed listing of installed systems as the basis. In early 1993, Jack Dongarra was persuaded to join the project with his LINPACK benchmarks . A first test version was produced in May 1993, partly based on data available on

3337-401: The electrophysiology of the human heart, achieved nearly 12 petaflops with a near real-time simulation, both on Sequoia . A fully compressible flow solver has also achieved 14.4 PFLOP/s (originally 11 PFLOP/s) on Sequoia, 72% of the machine's nominal peak performance. TOP500 The TOP500 project ranks and details the 500 most powerful non- distributed computer systems in

3408-422: The fastest one currently 11th, and another older/slower previously made 10th. And Amazon with one AWS instance currently ranked 64th (it was previously ranked 40th). The number of Arm-based supercomputers is 6; currently all Arm-based supercomputers use the same Fujitsu CPU as in the number 2 system, with the next one previously ranked 13th, now 25th. Legend: Numbers below represent the number of computers in

3479-536: The first two spots on the HPCG benchmark. Due to Summit and Sierra, the US took back the lead as consumer of HPC performance with 38.2% of the overall installed performance while China was second with 29.1% of the overall installed performance. For the first time ever, the leading HPC manufacturer was not a US company. Lenovo took the lead with 23.8% of systems installed. It is followed by HPE with 15.8%, Inspur with 13.6%, Cray with 11.2%, and Sugon with 11%. On 18 March 2019,

3550-596: The integration of all essential sub-systems on a single chip, and the use of low-power logic, each Compute or I/O node dissipated about 17 watts (including DRAMs). The low power per node allowed aggressive packaging of up to 1024 compute nodes, plus additional I/O nodes, in a standard 19-inch rack , within reasonable limits on electrical power supply and air cooling. The system performance metrics, in terms of FLOPS per watt , FLOPS per m of floorspace and FLOPS per unit cost, allowed scaling up to very high performance. With so many nodes, component failures were inevitable. The system

3621-504: The last IBM Blue Gene/Q model to drop completely off the list; it had been ranked 10th on the 52nd list (and 1st on the June 2012, 41st list, after an upgrade). For the first time, all 500 systems deliver a petaflop or more on the High Performance Linpack (HPL) benchmark, with the entry level to the list now at 1.022 petaflops." However, for a different benchmark "Summit and Sierra remain the only two systems to exceed

SECTION 50

#1732781011920

3692-459: The latter including the top eight supercomputers. 15 other supercomputers are all based on RISC architectures, including six based on ARM64 and seven based on the Power ISA used by IBM Power microprocessors . In recent years, heterogeneous computing has dominated the TOP500, mostly using Nvidia 's graphics processing units (GPUs) or Intel's x86-based Xeon Phi as coprocessors . This

3763-503: The leader on Green500 is JEDI, a Bull Sequana XH3000 system using the Nvidia Grace Hopper GH200 Superchip. In June 2022, the top 4 systems of Graph500 used both AMD CPUs and AMD accelerators. After an upgrade, for the 56th TOP500 in November 2020, Fugaku grew its HPL performance to 442 petaflops, a modest increase from the 416 petaflops the system achieved when it debuted in June 2020. More significantly,

3834-412: The list in 2018 were based on architectures new to the TOP500. One was a new x86-64 microarchitecture from Chinese manufacturer Sugon, using Hygon Dhyana CPUs (these resulted from a collaboration with AMD, and are a minor variant of Zen -based AMD EPYC ) and was ranked 38th, now 117th, and the other was the first ARM -based computer on the list – using Cavium ThunderX2 CPUs. Before

3905-462: The list, until it also dropped off the list. It was ranked 436th in its last appearance on the list released in June 2015, while its best rank was 11th in 2008. There are no longer any Mac OS computers on the list. It had at most five such systems at a time, one more than the Windows systems that came later, while the total performance share for Windows was higher. Their relative performance share of

3976-600: The more powerful of the two, will deliver 150–300 peak petaflops. On 10 April 2015, US government agencies banned selling chips, from Nvidia to supercomputing centers in China as "acting contrary to the national security ... interests of the United States"; and Intel Corporation from providing Xeon chips to China due to their use, according to the US, in researching nuclear weapons – research to which US export control law bans US companies from contributing – "The Department of Commerce refused, saying it

4047-513: The original Blue Gene architecture continued under the name Blue Gene/C (C for Cyclops) and, later, Cyclops64. Architecture and chip logic design for the Blue Gene systems was done at the IBM T. J. Watson Research Center , chip design was completed and chips were manufactured by IBM Microelectronics , and the systems were built at IBM Rochester, MN . In November 2004 a 16- rack system, with each rack holding 1,024 compute nodes, achieved first place in

4118-479: The partition while it was in use. Upon completion, the partition nodes were released for future programs to use. Blue Gene/L compute nodes used a minimal operating system supporting a single user program. Only a subset of POSIX calls was supported, and only one process could run at a time on a node in co-processor mode—or one process per CPU in virtual mode. Programmers needed to implement green threads in order to simulate local concurrency. Application development

4189-646: The study of biomolecular phenomena such as protein folding . The research and development was pursued by a large multi-disciplinary team at the IBM T. J. Watson Research Center , initially led by William R. Pulleyblank . The project had two main goals: to advance understanding of the mechanisms behind protein folding via large-scale simulation, and to explore novel ideas in massively parallel machine architecture and software. Major areas of investigation included: how to use this novel platform to effectively meet its scientific goals, how to make such massively parallel machines more usable, and how to achieve performance targets at

4260-484: The supercomputer much more energy efficient than the other top 10 (i.e. it was 5th on Green500 and other such ZettaScaler-2.2 -based systems take first three spots). At 19.86 million cores, it was by far the largest system by core-count, with almost double that of the then-best manycore system, the Chinese Sunway TaihuLight . As of November 2024 , the number one supercomputer is El Capitan ,

4331-471: The third manufacturer to have manufactured a 64-way system – a record that has previously been held by IBM and HP . The company has registered over $ 10B in revenue and has provided a number of systems to countries such as Sudan, Zimbabwe, Saudi Arabia and Venezuela. Inspur was also a major technology partner behind both the Tianhe-2 and Taihu supercomputers, occupying the top 2 positions of

SECTION 60

#1732781011920

4402-510: The time of the Blue Gene/Q system announcement in November 2011, an initial 4-rack Blue Gene/Q system (4096 nodes, 65536 user processor cores) achieved #17 in the TOP500 list with 677.1 TeraFLOPS Linpack, outperforming the original 2007 104-rack BlueGene/L installation described above. The same 4-rack system achieved the top position in the Graph500 list with over 250 GTEPS (giga traversed edges per second ). Blue Gene/Q systems also topped

4473-447: The top of the Green500 lists in 2007–2008. The following is an incomplete list of Blue Gene/P installations. Per November 2009, the TOP500 list contained 15 Blue Gene/P installations of 2-racks (2048 nodes, 8192 processor cores, 23.86 TFLOPS Linpack ) and larger. The third design in the Blue Gene series, Blue Gene/Q , significantly expanded and enhanced on the Blue Gene/L and /P architectures. The Blue Gene/Q "compute chip"

4544-669: The whole list was however similar, and never high for either. In 2004, the System X supercomputer based on Mac OS X ( Xserve , with 2,200 PowerPC 970 processors) once ranked 7th place. It has been well over a decade since MIPS systems dropped entirely off the list though the Gyoukou supercomputer that jumped to 4th place in November 2017 had a MIPS-based design as a small part of the coprocessors. Use of 2,048-core coprocessors (plus 8× 6-core MIPS, for each, that "no longer require to rely on an external Intel Xeon E5 host processor" ) made

4615-626: The world's first exascale supercomputer was to avoid inflaming political sentiments and fears within the United States, in the context of the United States – China trade war. Additional purpose-built machines that are not capable or do not run the benchmark were not included, such as RIKEN MDGRAPE-3 and MDGRAPE-4 . A Google Tensor Processing Unit v4 pod is capable of 1.1 exaflops of peak performance, while TPU v5p claims over 4 exaflops in Bfloat16 floating-point format , however these units are highly specialized to run machine learning workloads and

4686-634: The world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks ,

4757-493: Was "Blue Light". The "P" version was designed to be a petascale design. "Q" is just the letter after "P". The Blue Gene/L supercomputer was unique in the following aspects: The Blue Gene/L architecture was an evolution of the QCDSP and QCDOC architectures. Each Blue Gene/L Compute or I/O node was a single ASIC with associated DRAM memory chips. The ASIC integrated two 700 MHz PowerPC 440 embedded processors, each with

4828-400: Was able to electrically isolate faulty components, down to a granularity of half a rack (512 compute nodes), to allow the machine to continue to run. Each Blue Gene/L node was attached to three parallel communications networks: a 3D toroidal network for peer-to-peer communication between compute nodes, a collective network for collective communication (broadcasts and reduce operations), and

4899-515: Was concerned about nuclear research being done with the machine." On 29 July 2015, President Obama signed an executive order creating a National Strategic Computing Initiative calling for the accelerated development of an exascale (1000 petaflop) system and funding research into post-semiconductor computing. In June 2016, Japanese firm Fujitsu announced at the International Supercomputing Conference that its future exascale supercomputer will feature processors of its own design that implement

4970-585: Was established in 1987. A cash award of $ 10,000 (since 2011) accompanies the recognition, funded by Gordon Bell , a pioneer in high-performance and parallel computing. The Prizes were preceded by a nominal prize ($ 100) established by Alan Karp, a numerical analyst (then of IBM) who challenged claims of MIMD performance improvements proposed in the Letters to the Editor section of the Communications of

5041-414: Was usually performed in C , C++ , or Fortran using MPI for communication. However, some scripting languages such as Ruby and Python have been ported to the compute nodes. IBM published BlueMatter, the application developed to exercise Blue Gene/L, as open source. This serves to document how the torus and collective interfaces were used by applications, and may serve as a base for others to exercise

#919080