Seriously Testing LLMs
Explores the unique challenges of testing Generative AI and Large Language Models, contrasting it with traditional software testing approaches.
Explores the unique challenges of testing Generative AI and Large Language Models, contrasting it with traditional software testing approaches.
Introducing Claudiomiro, a Node.js CLI tool that enables Claude AI to autonomously complete complex coding tasks through a structured, looping workflow.
Explores four main methods for evaluating Large Language Models (LLMs), including code examples for implementing each approach from scratch.
A guide to the four main methods for evaluating Large Language Models, including code examples and practical implementation details.
A developer explains how to configure a ColdFusion JDBC connection string to use UTC time, fixing a 5-hour time discrepancy with MySQL.
A developer compares Claude Sonnet 4.5, GPT-5 Codex, and Grok Code Fast 1 for coding tasks in Cursor, testing feature generation and test creation.
Azure Local Arc Gateway is now GA, providing a centralized HTTPS egress point for Azure Local workloads, simplifying security and reducing endpoints.
Analysis of changes in Claude Code 2.0's system prompt, showing less prescriptive guidance and more trust in the Sonnet 4.5 model.
Weekly roundup of recent Azure, .NET, GitHub, and Visual Studio blog posts, including updates on AI tools, security, and development features.
Announcement of a beta release for GExperts 1.3.26, a toolset for the Delphi 13 64-bit IDE, including installation steps and known limitations.
Explains how to use Azure Policy to automatically enable Virtual Network Flow Logs across many VNets for security and troubleshooting.
A technical article describing a solution for sorting hierarchical data fields using string interning in C, focusing on maintaining original order while grouping nested structures.
A developer's evolving code methodology for using if/else vs separate if statements based on whether the logic fits on one screen.
A blog post exploring the differences between AI and ML, clarifying terminology and common misconceptions in the field.
Explains how HTMX uses 'previous' and 'next' mechanics with CSS selectors and the Node.compareDocumentPosition() method.
Analyzes C# performance benchmarks for slicing lists, comparing Skip/Take, Range operator, and GetRange methods, highlighting a common benchmarking error.
The article argues that a decline in watching romantic comedies is a cultural crisis, depriving people of valuable life lessons and role models.
Delphi Tools Manager 1.0.4 released, adding support for Delphi versions 6 through 13, including the latest IDE versions.
Explains how ToDictionaryAsync in Entity Framework Core retrieves entire database objects, impacting performance, and provides a solution using Select.
A developer's personal learning journey into the AI ecosystem, starting with an exploration of the Model Context Protocol (MCP) for connecting LLMs to APIs.