Omnitek achieves world-leading CNN performance per watt in a midrange programmable device.
Omnitek achieves world-leading CNN performance per watt in a midrange programmable device.
BASINGSTOKE, UK. April 16th, 2019 – Omnitek today announced immediate availability of a new Convolutional Neural Network, delivering world-leading performance per watt at full FP32 accuracy in a midrange SoC FPGA.
Optimised for the Intel Arria 10 GX architecture, the Omnitek Deep Learning Processing Unit (DPU) achieves 135 GOPS/W at full 32-bit floating point accuracy when running the VGG-16 CNN in an Arria 10 GX 1150. The innovative design employs a novel mathematical framework combining low-precision fixed point maths with floating point maths to achieve this very high compute density with zero loss of accuracy.
Scalable across a wide range of Arria 10 GX and Stratix 10 GX devices, the DPU can be tuned for low cost or high performance in either embedded or data centre applications. The DPU is fully software programmable in C/C++ or Python using standard frameworks such as TensorFlow, enabling it to be configured for a wide range of standard CNN models including GoogLeNet, ResNet-50 and VGG-16 as well as custom models. No FPGA design expertise is required to do this.
Roger Fawcett, CEO at Omnitek, commented “We are very excited to apply this unique innovation, resulting from our joint research program with Oxford University, to reducing the cost of a whole slew of AI-enabled applications, particularly in video and imaging where we have a rich library of highly optimised IP to complement the DPU and create complete systems on a chip.”
FPGAs are being adopted as the platform of choice for many intelligent video and vision systems. They are ideally suited to Machine Learning applications due to their massively parallel DSP architecture, distributed memory and ability to reconfigure the logic and connectivity for different algorithms. To this latter point, Omnitek’s DPU can be configured to provide optimal compute performance for CNNs, RNNs, MLPs and other neural network topologies which exist today and more importantly, the as yet unknown algorithms and innovative optimisation techniques that will exist in future due to the significant research in this field.
More information is available at www.Omnitek.tv/DPU .
About Omnitek
Omnitek is a world leader in the design of intelligent video and vision systems based on programmable FPGAs and SoCs. Through the supply of expert design services with highly optimised FPGA intellectual property cores covering high-performance video / vision and AI / machine learning, Omnitek can provide cost-optimised solutions to a broad range of markets. To complement this business Omnitek also designs and manufactures a comprehensive suite of video test & measurement equipment.
|
Related News
- Omnitek Demonstrates Highest Performance Convolutional Neural Network on an FPGA
- Ambiq Micro Achieves World-Leading Power Consumption Performance with TSMC 40ULP Technology
- Neural Network Inference Engine IP Core Delivers >10 TeraOPS per Watt
- Altera FPGAs Achieve Compelling Performance-per-Watt in Cloud Data Center Acceleration Using CNN Algorithms
- Mid-Range FPGAs Reach the Next Power and Performance Milestone for Edge Compute Systems
Breaking News
- Logic Design Solutions launches Gen4 NVMe host IP
- ULYSS1, Microcontroller (MCU) for Automotive market, designed by Cortus is available
- M31 is partnering with Taiwan Cooperative Bank to launch an Employee Stock Ownership Trust to strengthen talent retention
- Sondrel announces CEO transition to lead next phase of growth
- JEDEC Publishes LPDDR5 CAMM2 Connector Performance Standard
Most Popular
- Arm's power play will backfire
- Alphawave Semi Selected for AI Innovation Research Grant from UK Government's Advanced Research + Invention Agency
- Secure-IC obtains the first worldwide CAVP Certification of Post-Quantum Cryptography algorithms, tested by SERMA Safety & Security
- Weebit Nano continuing to make progress with potential customers and qualifying its technology Moving closer to finalisation of licensing agreements Q1 FY25 Quarterly Activities Report
- PUFsecurity Collaborate with Arm on PSA Certified RoT Component Level 3 Certification for its Crypto Coprocessor to Provide Robust Security Subsystem Essential for the AIoT era
E-mail This Article | Printer-Friendly Page |