Tag: Large Language Models

Researchers Uncover Prompt Injection Vulnerabilities in DeepSeek and Claude AI

Cyber security December 9, 2024

Details have emerged about a now-patched security flaw in the DeepSeek artificial intelligence (AI) chatbot that, if successfully exploited, could permit a bad actor...

Agentic AI: How Large Language Models Are Shaping the Future of...

AI News November 1, 2024

After the rise of generative AI, artificial intelligence is on the brink of another significant transformation with the advent of agentic AI. This change...

TensorRT-LLM: A Comprehensive Guide to Optimizing Large Language Model Inference for...

AI News September 13, 2024

As the demand for large language models (LLMs) continues to rise, ensuring fast, efficient, and scalable inference has become more crucial than ever. NVIDIA's...

Reflection 70B : LLM with Self-Correcting Cognition and Leading Performance

AI News September 11, 2024

Reflection 70B is an open-source large language model (LLM) developed by HyperWrite. This new model introduces an approach to AI cognition that could reshape...

Direct Preference Optimization: A Complete Guide

AI News August 14, 2024

import torch import torch.nn.functional as F class DPOTrainer: def __init__(self, model, ref_model, beta=0.1, lr=1e-5): self.model =...

Mistral 2 and Mistral NeMo: A Comprehensive Guide to the Latest...

AI News August 2, 2024

Founded by alums from Google's DeepMind and Meta, Paris-based startup Mistral AI has consistently made waves in the AI community since 2023.Mistral AI first...

MoE-LLaVA: Combination of Specialists for Giant Imaginative and prescient-Language Fashions

AI News April 2, 2024

Latest developments in Giant Imaginative and prescient Language Fashions (LVLMs) have proven that scaling these frameworks considerably boosts efficiency throughout a wide range of...