EdgeThought

From Skymizer

Contact Vendor

Ask Volt AI questions about this IP

Quickly find out if this is the right IP for you with our semiconductor-trained AI assistant, Volt.

Description

EdgeThought represents a shift in deploying large language models (LLMs) at the device level, optimizing for high performance and cost-effectiveness. This innovative solution combines extensive memory bandwidth with deterministic response times, tailored for decoding high-capacity transformer models such as LLaMA and Mistral. The unique architecture offers programmability and model flexibility, enabling efficient deployment across various edge environments.

EdgeThought supports a partition of 7-13 billion parameter models, requiring minimal MAC operations while leveraging modular instruction sets for extensive coverage of transformer models. It significantly reduces response times, making EdgeThought well-suited for applications demanding rapid processing speeds and high data throughput.

This system's ecosystem readiness is underscored by its compatibility with prominent LLM frameworks such as HuggingFace Transformers and Nvidia Triton Inference Server. By integrating fine-tuning and retrieval-augmented generation tools, EdgeThought fosters seamless adaptation and enrichment of AI applications across diverse hardware infrastructures, from data centers to edge computing devices.

Features

Supports large language models
High performance execution
Ecosystem ready with frameworks

Foundries & Process Nodes

Foundry	Process Nodes
All Foundries	All Process Nodes

Tech Specs

Class	Value
Categories	Processor > AI Processor Processor > Vision Processor Platform Level IP > Multiprocessor / DSP Platform Level IP > Processor Core Dependent
Availability	All Countries & Regions

Applications

Edge deployments
LLM applications

EdgeThought

Ask Volt AI questions about this IP

Sign Up for Silicon Hub

Switch to a Buyer Account

Create a Buyer Account