Language Model Applications

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

ascopubs.org

Enhancing Patient-Trial Matching With Large Language Models: A Scoping Review of Emerging Applications and Approaches

A comprehensive search was conducted in PubMed, Web of Science, and OpenAlex for literature published between December 1, 2022, and December 31, 2024. Studies were included if they explicitly ...

14d

Distributive Data Base Option For Large Language Model (LLM) Released By Scientel

Business Wire

Writer Releases New Frontier Model Palmyra X 004 to Add Intelligent Action to Enterprise AI Applications

SAN FRANCISCO--(BUSINESS WIRE)--Writer, the full-stack generative AI platform for the enterprise, today released its newest large language model (LLM) to power the next generation of AI applications ...

Semiconductor Engineering

Applications Of Large Language Models For Industrial Chip Design (NVIDIA)

“ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the ...

Forbes

The Next Leap In AI: From Large Language Models To Large World Models?

The realm of artificial intelligence (AI) may be on the cusp of a new transformative leap, transitioning from Large Language Models (LLMs) to an innovative and expansive concept, which we may call ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results