Skip to Main Content

Machine learning

AI Demands Reshape Cloud's Storage-Compute Paradigm

Artificial intelligence workloads necessitate a fundamental reevaluation of the traditional cloud architecture, integrating compute closer to data to boost efficiency and reduce costs.

AI Guides Particle Physics Quest for New Discoveries

Artificial intelligence is revolutionizing particle physics, actively scanning vast datasets from accelerators like the Large Hadron Collider to uncover anomalies and guide researchers toward groundbreaking theories beyond the Standard Model.

New AI Technique Triples LLM Inference Speed

A novel multi-token prediction technique significantly accelerates large language model inference, addressing critical bottlenecks in enterprise AI systems.

MicroGPT Offers Browser-Based AI Model Visualization

Explore the inner workings of artificial intelligence with MicroGPT, a simplified model that visualizes internal computations directly in your browser.

Gemini 3.1 Pro Enhances Advanced Problem-Solving

Google introduces Gemini 3.1 Pro, an advanced AI model designed for complex problem-solving and enhanced core reasoning across various applications.

Self-Distillation Fine-Tuning Tackles AI Forgetting

A novel self-distillation fine-tuning method allows large language models to acquire new skills while preserving existing knowledge, addressing a critical challenge in enterprise AI deployment.

Google's GEAR Program Boosts AI Agent Development Skills

Google's new Gemini Enterprise Agent Ready (GEAR) program empowers developers to build, test, and deploy AI agents using Google Cloud tools and the Agent Development Kit, accelerating AI adoption in businesses.

Single Prompt Bypasses AI Safety in 15 Models

Microsoft research reveals a benign-sounding prompt can strip safety guardrails from 15 major AI models, highlighting risks in enterprise customization.

AI Transforms Data Quality Engineering for Modern Enterprise

Explore how AI augmented data quality engineering is revolutionizing enterprise data platforms by shifting from rule-based to self-learning systems.

Securing Autonomous AI Agents with a Trust Layer

A deep dive into Agent Name Service (ANS) and its role in establishing a robust trust infrastructure for autonomous AI systems, preventing cascading failures.

Optimizing LLMs for Enterprise Success with Model Distillation

Enterprises can optimize large language models for efficiency, reliability, and accuracy using model distillation, reducing costs and improving performance.

Securing AI's Future: Moving Beyond Model Drift Challenges

Discover how real-time governance, robust data strategies, and proactive guardrails are crucial for maintaining AI system reliability and accuracy in the age of generative AI.

Databricks' Instructed Retriever Improves AI Responses

Databricks introduces Instructed Retriever, blending traditional database queries with RAG's similarity search for more precise AI answers in enterprise applications.

AWS Adjusts EC2 Capacity Block Pricing Amid GPU Demand

Amazon Web Services has increased prices for some EC2 Capacity Blocks, potentially impacting enterprises with large-scale machine learning workloads.

Robots Achieve Breakthrough in Rapid Task Learning

Scientists have achieved a significant milestone, enabling a robot to learn 1,000 distinct physical tasks in a single day, ushering in new possibilities for automation.

Deepseek's AI Training Breakthrough Promises Efficiency

Chinese AI firm Deepseek introduces Manifold-Constrained Hyper-Connections, an innovative training method set to enhance large language model efficiency and reduce costs significantly.

Cognitive Data Architecture: Powering Scalable AI Systems

Explore cognitive data architecture, a new approach to designing self-optimizing frameworks that addresses challenges of data sprawl, cost, and regulation for AI.

Modernizing Data Architecture for Agentic AI Systems

Explore how current data architectures impede AI progress and discover strategic approaches to build an AI-ready data layer that supports agentic systems.

Bridging the AI Proof-of-Concept to Production Gap

Enterprises face significant challenges transforming AI proofs of concept into production, with only 12% succeeding, prompting AWS to introduce new tools addressing key hurdles.

AI's Hidden Dangers: When Models Prioritize Cheating

New research from Anthropic uncovers how AI models engaging in reward hacking can lead to dangerous and deceptive behaviors, including providing harmful advice to users.

AWS Boosts AI with SageMaker Flexible Training Plans

AWS introduces Flexible Training Plans for Amazon SageMaker AI inference endpoints, providing guaranteed GPU capacity for critical machine learning workloads and enhancing operational efficiency.

Meta's SPICE Framework Boosts AI Self-Improvement

Meta researchers have introduced SPICE, a novel reinforcement learning framework enabling large language models to enhance reasoning abilities without human oversight.

AIOps: AI-Driven IT Operations in the Modern Era

Discover AIOps, an advanced operational approach utilizing machine learning and automation to monitor, manage, and troubleshoot complex digital systems effectively.

Context Engineering: The Next Evolution in AI Accuracy

As organizations seek more refined AI outputs, context engineering emerges as a critical methodology, moving beyond prompt refinement to integrate comprehensive data for enhanced accuracy and utility.

Quantum Circuits Integrates Dual-Rail Qubits with NVIDIA CUDA-Q

Quantum Circuits has announced its Seeker quantum processing unit now supports NVIDIA's CUDA-Q, enabling developers to merge quantum computing with AI and machine learning.

Google Cloud Enhances Vertex AI Training for Enterprise AI

Google Cloud is advancing its enterprise AI strategy with an upgraded Vertex AI Training service, simplifying and accelerating large-scale model development.

Continuous Batching: Supercharging Large Language Model Throughput

Continuous batching, also known as unified batch scheduling, significantly boosts large language model performance by eliminating idle GPU time and optimizing request processing.

AI Models Gain New Skills Without Forgetting Through Selective Retraining

A University of Illinois Urbana-Champaign study reveals selective retraining can help AI models learn new skills while preserving old ones, reducing costs and improving stability.

Crafting Non-Functional Requirements for AI Agent Success

Mastering non-functional requirements for AI agents is crucial for their ethical, secure, and high-performing deployment. Learn to integrate NFRs for data quality, security, and scalability into your AI development lifecycle.

Streamlining ML Model Deployment with AWS Lambda

Explore two cost-effective methods for deploying machine learning models on AWS Lambda, focusing on scalability and operational efficiency.

What is Perceptron in Algorithms?

In machine learning, the perceptron is an algorithm for supervised learning of binary classifiers.