Blog

Benchmarking LLM Inference on Intel Arc Pro B60: A Comparative Analysis of vLLM

A comprehensive benchmark analysis of Intel Arc Pro B60 for LLM inference workloads, comparing vLLM and LLM-Scaler performance across different scenarios.

Read Full Article

Benchmarking LLM Inference on Intel Arc Pro B60

All Blog Posts

By EmbeddedLLM Team • 4 mins

Feb 14, 2026

Benchmarking LLM Inference on Intel Arc Pro B60: A Comparative Analysis of vLLM

A comprehensive benchmark analysis of Intel Arc Pro B60 for LLM inference workloads, comparing vLLM and LLM-Scaler performance across different scenarios.

Introducing JamAI Base v2: From Intent to Action with Executable AI Workflows

By EmbeddedLLM Team • 3 mins

Sep 3, 2025

Introducing JamAI Base v2: From Intent to Execution

Unleash truly agentic AI with executable Python workflows, custom RAG-powered agents, and a supercharged PostgreSQL backend.

FoSEAL Hackathon 2025 Winner GrowSmart App

By EmbeddedLLM Team • 4 mins

Jun 23, 2025

Growing Solutions: Students Win FoSEAL Hackathon 2025 with AI-Powered Agriculture App

How a smart agriculture app built by students and powered by JamAI Base is reimagining the future of food security.

By EmbeddedLLM Team • 3 mins

Apr 2, 2025

Inaugural vLLM Asia Developer Day 2025

Discover the highlights of the inaugural vLLM Asia Developer Day 2025, a full-day event uniting AI engineers developers, and researchers to explore cutting-edge LLM inference, deployment strategies, and optimization techniques. Dive into the agenda, connect with top contributors, and get inspired by the future of open-source AI

Liger Kernels Leap the CUDA Moat: A Case Study with Liger, LinkedIn's SOTA Training Kernels on AMD GPU

By EmbeddedLLM Team • 14 mins

Jan 6, 2025

Beyond GPUs: Why JamAI Base Moved Embedding Models to Intel Xeon CPUs

The journey of JamAI Base towards CPU-powered embedding models highlights a crucial shift in the AI landscape. By harnessing the power of Intel Xeon CPUs and OpenVINO, JamAI Base delivers a compelling combination of performance, efficiency, and cost-effectiveness. This approach democratizes access to powerful AI capabilities, making it easier for organizations of all sizes to leverage AI for transformative outcomes

vLLM Now Supports Running GGUF on AMD Radeon GPU

By EmbeddedLLM Team • 2 mins

Dec 1, 2024

vLLM Now Supports Running GGUF on AMD Radeon GPU

This guide shows the impact of Liger-Kernels Training Kernels on AMD MI300X. The build has been verified for ROCm 6.2.

By EmbeddedLLM Team • 8 mins

Nov 5, 2024

Liger Kernels Leap the CUDA Moat: A Case Study with Liger, LinkedIn's SOTA Training Kernels on AMD GPU

This guide shows the impact of Liger-Kernels Training Kernels on AMD MI300X. The build has been verified for ROCm 6.2.

See the Power of Llama 3.2 Vision on AMD MI300X

By EmbeddedLLM Team • 5 mins

Oct 28, 2024

See the Power of Llama 3.2 Vision on AMD MI300X

This blog post shows you how to run Meta's powerful Llama 3.2-90B-Vision-Instruct model on an AMD MI300X GPU using vLLM. We provide the Docker commands, code snippets, and a video demo to help you get started with image-based prompts and experience impressive performance

By EmbeddedLLM Team • 8 mins

Oct 11, 2024

How to Build vLLM on MI300X from Source

This guide walks you through the process of building vLLM from source on AMD MI300X. The build has been verified for ROCm 6.2.

By EmbeddedLLM Team • 7 mins

Oct 27, 2023

High throughput LLM inference with vLLM and AMD: Achieving LLM inference parity with Nvidia

EmbeddedLLM has ported vLLM to ROCm 5.6, and we are excited to report that LLM inference has achieved parity with Nvidia A100 using AMD MI210.

Benchmarking LLM Inference on Intel Arc Pro B60: A Comparative Analysis of vLLM

All Blog Posts

Benchmarking LLM Inference on Intel Arc Pro B60: A Comparative Analysis of vLLM

Introducing JamAI Base v2: From Intent to Execution

Growing Solutions: Students Win FoSEAL Hackathon 2025 with AI-Powered Agriculture App

Inaugural vLLM Asia Developer Day 2025

Beyond GPUs: Why JamAI Base Moved Embedding Models to Intel Xeon CPUs

vLLM Now Supports Running GGUF on AMD Radeon GPU

Liger Kernels Leap the CUDA Moat: A Case Study with Liger, LinkedIn's SOTA Training Kernels on AMD GPU

See the Power of Llama 3.2 Vision on AMD MI300X

How to Build vLLM on MI300X from Source

High throughput LLM inference with vLLM and AMD: Achieving LLM inference parity with Nvidia

Legal

Email Us