Tachyum Successfully Runs LINPACK on FPGA With IEEE 754-2019 Compliant FPU
LAS VEGAS, November 29, 2022 – Tachyum™ continues to advance towards production-ready status of its universal processor after reaching its latest milestone of running LINPACK benchmarks using Prodigy’s Floating-Point Unit (FPU) on a Field Programmable Gate Array (FPGA). This was achieved by running applications under Linux on the integer part of the processor and uses IEEE compliant Floating-Point Unit (FPU) to analyze and solve linear equations and linear least-square problems.
The vector unit includes copies of 16 Floating-Point Units (FPUs) and additional shuffle and reduction operations. While there are many instructions to test in a vector unit, the Floating-Point vector operations are the hardest part of a vector unit, and that part is now successfully behind Tachyum’s product development team.
LINPACK measures a system’s floating-point computing power by solving a dense system of linear equations to determine performance. It is a widely used benchmark for supercomputers, including the NSCC Slovakia Supercomputer. After successfully reaching this FPU milestone, Tachyum has only four more steps to go before the final netlist of the Prodigy processor chip. The next milestone is running UEFI and boot loaders loading Linux on the FPGA, completing vector-based LINPACK testing with I/O, followed by I/O with virtualization, RAS (Reliability, Availability and Serviceability). Afterwards, Prodigy will be ready for final netlist, followed by tape-out.
Tachyum built its FPU from the ground up and is one of the most advanced in the world at the highest clock speeds. The company’s FPU includes FMA, divider, format converter, reciprocal approximator, reciprocal square root approximator and square root approximator. Its FPU is fully IEEE compliant and corner cases have been successfully debugged. In addition to IEEE single and double precision, the Prodigy processor will also support 16-bit Bfloat16 (Brain Floating Point).
The next milestone to be achieved is running vector operations, including mask operations and operations of unaligned vectors. The vectorization in the compiler reaching the production stage and vectorizing compilers and vectorized libraries will be fully available before chip shipments next year.
“Despite having to overcome obstacles of replacing IP and EDA tools, our engineering team has risen to the challenge of advancing the Prodigy stack so that we can get to tape-out and production next year,” said Dr. Radoslav Danilak, founder and CEO of Tachyum. “We have taken every opportunity to develop Prodigy as a processor that does not simply meet expectations but exceeds them. Successfully running LINPACK means that we are one step closer to completing our vision of transforming data centers into Universal Computing Centers with Prodigy.”
Prodigy delivers unprecedented data center performance, power, and economics, reducing CAPEX and OPEX significantly. Because of its utility for both high-performance and line-of-business applications, Prodigy-powered data center servers can seamlessly and dynamically switch between workloads, eliminating the need for expensive dedicated AI hardware and dramatically increasing server utilization. Tachyum’s Prodigy integrates 128 high-performance custom-designed 64-bit compute cores, to deliver up to 4x the performance of the highest-performing x86 processors for cloud workloads, up to 3x that of the highest performing GPU for HPC, and 6x for AI applications.
A video demonstration of successfully running LINPACK and some QA regression tests for FPU can be found below.
About Tachyum
Tachyum is transforming AI, HPC, public and private cloud data center markets with its recently launched flagship product. Prodigy, the world’s first Universal Processor, unifies the functionality of a CPU, a GPU, and a TPU into a single processor that delivers industry-leading performance, cost, and power efficiency for both specialty and general-purpose computing. When Prodigy processors are provisioned in a hyperscale data center, they enable all AI, HPC, and general-purpose applications to run on one hardware infrastructure, saving companies billions of dollars per year. With data centers currently consuming over 4% of the planet’s electricity, predicted to be 10% by 2030, the ultra-low power Prodigy Universal Processor is critical to continue doubling worldwide data center capacity every four years. Tachyum, co-founded by Dr. Radoslav Danilak is building the world’s fastest AI supercomputer (128 AI exaflops) in the EU based on Prodigy processors. Tachyum has offices in the United States and Slovakia. For more information, visit https://www.tachyum.com/.
|
Related News
- Tachyum Successfully Runs UEFI on Prodigy FPGA
- Microchip's PolarFire® FPGA's Single-Chip Crypto Design Flow "Successfully Reviewed" By the United Kingdom Government's National Cyber Security Centre
- Tachyum Testing Applications on Prodigy FPGA
- Tachyum Signs MOU with Cologne Chip
- Tachyum Boots Linux on Prodigy FPGA
Breaking News
- Ubitium Debuts First Universal RISC-V Processor to Enable AI at No Additional Cost, as It Raises $3.7M
- TSMC drives A16, 3D process technology
- Frontgrade Gaisler Unveils GR716B, a New Standard in Space-Grade Microcontrollers
- Blueshift Memory launches BlueFive processor, accelerating computation by up to 50 times and saving up to 65% energy
- Eliyan Ports Industry's Highest Performing PHY to Samsung Foundry SF4X Process Node, Achieving up to 40 Gbps Bandwidth at Unprecedented Power Levels with UCIe-Compliant Chiplet Interconnect Technology
Most Popular
- Cadence Unveils Arm-Based System Chiplet
- CXL Fabless Startup Panmnesia Secures Over $60M in Series A Funding, Aiming to Lead the CXL Switch Silicon Chip and CXL IP
- Esperanto Technologies and NEC Cooperate on Initiative to Advance Next Generation RISC-V Chips and Software Solutions for HPC
- Eliyan Ports Industry's Highest Performing PHY to Samsung Foundry SF4X Process Node, Achieving up to 40 Gbps Bandwidth at Unprecedented Power Levels with UCIe-Compliant Chiplet Interconnect Technology
- Arteris Selected by GigaDevice for Development in Next-Generation Automotive SoC With Enhanced FuSa Standards
E-mail This Article | Printer-Friendly Page |