Join our Discord Server

AI

How to Customize LLM Models with Ollama’s Modelfile?

Introduction Large Language Models (LLMs) have become increasingly accessible to developers and enthusiasts, allowing anyone to run powerful AI models locally...
Adesoji Alu
1 min read

Getting Started with NVIDIA Dynamo: A Powerful Framework for Distributed LLM Inference

In the rapidly evolving landscape of generative AI, efficiently serving large language models (LLMs) at scale remains a significant challenge. Enter...
Collabnix Team
3 min read

Running DeepSeek R1 on Azure Kubernetes Service (AKS) using Ollama

Introduction DeepSeek is an advanced open-source code language model (LLM) that has gained significant popularity in the developer community. When paired...
Adesoji Alu
4 min read

Running LLMs with TensorRT-LLM on NVIDIA Jetson Orin Nano Super

TensorRT-LLM is essentially a specialized tool that makes large language models (like ChatGPT) run much faster on NVIDIA hardware. Think of...
Ajeet Raina
15 min read

Deploying NVIDIA NIM for Generative AI Applications

NVIDIA’s NIM (Neural Inference Microservices) provides developers an efficient way to deploy optimized AI models from various sources, including community partners...
Collabnix Team
3 min read

The AI Economy: How Claude is Being Used Across Industries Based on Anthropic’s Economic Index Study

AI’s impact on work has moved from speculation to reality. A groundbreaking study from Anthropic analyzing millions of conversations with Claude...
Tanvir Kour
1 min read

Introducing AutoGen v0.4: Revolutionizing Agentic AI with Enhanced Scalability, Flexibility, and Reliability

Over the past year, Microsoft developments with AutoGen have underscored the remarkable capabilities of agentic AI and multi-agent systems. Microsoft is...
Adesoji Alu
7 min read

How vLLM and Docker are Changing the Game for LLM Deployments

Have you ever wanted to deploy a large language model (LLM) that doesn’t just work well but also works lightning-fast? Meet...
Tanvir Kour
2 min read

Exploring LLMs: Ollama, vLLM, Hugging Face, LangChain, and Open WebUI

The world of large language models (LLMs) is evolving rapidly, offering diverse tools for developers to integrate powerful AI into their...
Tanvir Kour
2 min read

Large Language Models in Vertical Industries: Revolutionizing Medical Documentation

Large Language Models (LLMs) have emerged as a groundbreaking force in artificial intelligence, demonstrating remarkable capabilities in understanding and generating human-like...
Tanvir Kour
2 min read

Exploring the Revolutionary Nemotron-4-340B-Instruct: Enhanced Instruction Following and Mathematical Reasoning

Model Overview Nemotron-4-340B-Instruct is a large language model developed by NVIDIA, designed for English-based single and multi-turn chat applications. It has...
Adesoji Alu
4 min read

Powerful RAG Techniques for AI and NLP Projects

Retrieval Augmented Generation also known as (RAG) is the process of optimizing the output of a large language model, so it...
Adesoji Alu
21 min read

What is OpenDevin and what Number 1 problem does it solve for you?

Software development is becoming increasingly complex. From managing intricate codebases to deploying applications across various platforms, developers face a multitude of...
Adesoji Alu
11 min read
Join our Discord Server
Index