Omnitek achieves world-leading CNN performance per watt in a midrange programmable device.
Omnitek achieves world-leading CNN performance per watt in a midrange programmable device.
BASINGSTOKE, UK. April 16th, 2019 – Omnitek today announced immediate availability of a new Convolutional Neural Network, delivering world-leading performance per watt at full FP32 accuracy in a midrange SoC FPGA.
Optimised for the Intel Arria 10 GX architecture, the Omnitek Deep Learning Processing Unit (DPU) achieves 135 GOPS/W at full 32-bit floating point accuracy when running the VGG-16 CNN in an Arria 10 GX 1150. The innovative design employs a novel mathematical framework combining low-precision fixed point maths with floating point maths to achieve this very high compute density with zero loss of accuracy.
Scalable across a wide range of Arria 10 GX and Stratix 10 GX devices, the DPU can be tuned for low cost or high performance in either embedded or data centre applications. The DPU is fully software programmable in C/C++ or Python using standard frameworks such as TensorFlow, enabling it to be configured for a wide range of standard CNN models including GoogLeNet, ResNet-50 and VGG-16 as well as custom models. No FPGA design expertise is required to do this.
Roger Fawcett, CEO at Omnitek, commented “We are very excited to apply this unique innovation, resulting from our joint research program with Oxford University, to reducing the cost of a whole slew of AI-enabled applications, particularly in video and imaging where we have a rich library of highly optimised IP to complement the DPU and create complete systems on a chip.”
FPGAs are being adopted as the platform of choice for many intelligent video and vision systems. They are ideally suited to Machine Learning applications due to their massively parallel DSP architecture, distributed memory and ability to reconfigure the logic and connectivity for different algorithms. To this latter point, Omnitek’s DPU can be configured to provide optimal compute performance for CNNs, RNNs, MLPs and other neural network topologies which exist today and more importantly, the as yet unknown algorithms and innovative optimisation techniques that will exist in future due to the significant research in this field.
More information is available at www.Omnitek.tv/DPU .
About Omnitek
Omnitek is a world leader in the design of intelligent video and vision systems based on programmable FPGAs and SoCs. Through the supply of expert design services with highly optimised FPGA intellectual property cores covering high-performance video / vision and AI / machine learning, Omnitek can provide cost-optimised solutions to a broad range of markets. To complement this business Omnitek also designs and manufactures a comprehensive suite of video test & measurement equipment.
|
Related News
- Omnitek Demonstrates Highest Performance Convolutional Neural Network on an FPGA
- Ambiq Micro Achieves World-Leading Power Consumption Performance with TSMC 40ULP Technology
- Neural Network Inference Engine IP Core Delivers >10 TeraOPS per Watt
- Altera FPGAs Achieve Compelling Performance-per-Watt in Cloud Data Center Acceleration Using CNN Algorithms
- Mid-Range FPGAs Reach the Next Power and Performance Milestone for Edge Compute Systems
Breaking News
- Ubitium Debuts First Universal RISC-V Processor to Enable AI at No Additional Cost, as It Raises $3.7M
- TSMC drives A16, 3D process technology
- Frontgrade Gaisler Unveils GR716B, a New Standard in Space-Grade Microcontrollers
- Blueshift Memory launches BlueFive processor, accelerating computation by up to 50 times and saving up to 65% energy
- Eliyan Ports Industry's Highest Performing PHY to Samsung Foundry SF4X Process Node, Achieving up to 40 Gbps Bandwidth at Unprecedented Power Levels with UCIe-Compliant Chiplet Interconnect Technology
Most Popular
- Cadence Unveils Arm-Based System Chiplet
- CXL Fabless Startup Panmnesia Secures Over $60M in Series A Funding, Aiming to Lead the CXL Switch Silicon Chip and CXL IP
- Esperanto Technologies and NEC Cooperate on Initiative to Advance Next Generation RISC-V Chips and Software Solutions for HPC
- Eliyan Ports Industry's Highest Performing PHY to Samsung Foundry SF4X Process Node, Achieving up to 40 Gbps Bandwidth at Unprecedented Power Levels with UCIe-Compliant Chiplet Interconnect Technology
- Arteris Selected by GigaDevice for Development in Next-Generation Automotive SoC With Enhanced FuSa Standards
E-mail This Article | Printer-Friendly Page |