Category: Misc

Misc

Dynamic Memory Compression

Post author By
Post date January 24, 2025
No Comments on Dynamic Memory Compression

Three icons, with text LLMs, Optimize, Deploy. Despite the success of large language models (LLMs) as general-purpose AI tools, their high demand for computational resources make their deployment challenging…

Despite the success of large language models (LLMs) as general-purpose AI tools, their high demand for computational resources make their deployment challenging in many real-world scenarios. The sizes of the model and conversation state are limited by the available high-bandwidth memory, limiting the number of users that can be served and the maximum conversation length. At present…

Source

Misc

Optimize AI Inference Performance with NVIDIA Full-Stack Solutions

Post author By
Post date January 24, 2025
No Comments on Optimize AI Inference Performance with NVIDIA Full-Stack Solutions

The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing…

The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing operational complexity and cost, and AI infrastructure. NVIDIA is empowering developers with full-stack innovations—spanning chips, systems, and software—that redefine what’s possible in AI inference, making it faster, more efficient…

Source

Misc

We now support VLMs in smolagents!

Post author By
Post date January 24, 2025
No Comments on We now support VLMs in smolagents!

Misc

Fast, Low-Cost Inference Offers Key to Profitable AI

Post author By
Post date January 23, 2025
No Comments on Fast, Low-Cost Inference Offers Key to Profitable AI

Businesses across every industry are rolling out AI services this year. For Microsoft, Oracle, Perplexity, Snap and hundreds of other leading companies, using the NVIDIA AI inference platform — a full stack comprising world-class silicon, systems and software — is the key to delivering high-throughput and low-latency inference and enabling great user experiences while lowering
Read Article

Misc

‘Baldur’s Gate 3’ Mod Support Launches in the Cloud

Post author By
Post date January 23, 2025
No Comments on ‘Baldur’s Gate 3’ Mod Support Launches in the Cloud

GeForce NOW is expanding mod support for hit game Baldur’s Gate 3 in collaboration with Larian Studios and mod.io for Ultimate and Performance members. This expanded mod support arrives alongside seven new games joining the cloud this week. Level Up Gaming Time to roll for initiative — adventurers in the Forgotten Realms can now enjoy
Read Article

Misc

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Post author By
Post date January 23, 2025
No Comments on SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Misc

AI Maps Titan’s Methane Clouds in Record Time

Post author By
Post date January 23, 2025
No Comments on AI Maps Titan’s Methane Clouds in Record Time

NVIDIA GPUs powered deep learning to decode years of Cassini data in seconds—helping researchers pioneer a smarter way to explore alien worlds.

Misc

Spinal Health Diagnostics Gets Deep Learning Automation

Post author By
Post date January 22, 2025
No Comments on Spinal Health Diagnostics Gets Deep Learning Automation

A woman doctor holding up two spinal X-rays. An advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of…

An advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of handling even complex cases, the research promises to help doctors save time, reduce diagnostic errors, and improve treatment plans for patients with spinal conditions like scoliosis and kyphosis. “Although spinopelvic alignment analysis…

Source

Misc

How AI Helps Fight Fraud in Financial Services, Healthcare, Government and More

Post author By
Post date January 22, 2025
No Comments on How AI Helps Fight Fraud in Financial Services, Healthcare, Government and More

Companies and organizations are increasingly using AI to protect their customers and thwart the efforts of fraudsters around the world. Voice security company Hiya found that 550 million scam calls were placed per week in 2023, with INTERPOL estimating that scammers stole $1 trillion from victims that same year. In the U.S., one of four
Read Article

Misc

Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes

Post author By
Post date January 22, 2025
No Comments on Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes

Decorative image of two cartoon llamas in sunglasses. NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it’s important to understand the…

NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it’s important to understand the compute and memory profile of these microservices to set up a successful autoscaling plan. In this post, we describe how to set up and use Kubernetes Horizontal Pod Autoscaling (HPA) with an NVIDIA NIM for LLMs model to automatically scale…

Source