How I Try To Keep Up With The Data Tech World (A List of Data Blogs)
A data professional shares their curated list of data tech blogs and explains their return to using RSS feeds to stay current in the field.
Robin Moffatt is a Principal DevEx Engineer and seasoned conference speaker with 15+ years of experience presenting at top events like QCon, Devoxx, Kafka Summit, and Strata. He shares insights on developer experience, distributed systems, and cloud technologies through his blog, YouTube, and public talks.
491 articles from this blog
A data professional shares their curated list of data tech blogs and explains their return to using RSS feeds to stay current in the field.
A developer troubleshoots a connection issue when using ngrok to expose a local Kafka service, ultimately discovering a DNS-related problem.
A guide to prevent AWS CLI from clearing the screen after displaying help or command output, using environment variables like MANPAGER and AWS_PAGER.
A developer's guide to troubleshooting common pitfalls and misconfigurations when setting up and using Apache Flink SQL with JDBC connectors.
A monthly roundup of news and resources on stream processing, Apache Flink, Kafka, and data engineering from industry experts.
Overview of Apache Flink sessions at Kafka Summit London 2024, covering streaming data processing, SQL, checkpoints, and AI integration.
A guide to using Apache Flink's SQL Gateway REST API for submitting and managing SQL jobs, including practical examples with Postman and HTTPie.
Troubleshooting guide for resolving ClassNotFoundException errors in Apache Flink SQL by managing and locating the correct JAR files.
Monthly roundup of articles and resources on data streaming, covering Flink, Kafka, Debezium, and streaming SQL developments.
A hands-on guide to using different catalogs, including Apache Hive, with Flink SQL, covering installation, configuration, and practical insights.
Explains the role and types of catalogs in Apache Flink SQL, comparing them to traditional RDBMS systems and highlighting their importance in data management.
A technical guide on deploying Antora documentation to Cloudflare Pages using GitHub Actions and GitHub Apps across private repositories.
Explains how a DNS misconfiguration can lead to subdomain hijacking on GitHub Pages, allowing attackers to host spammy content on your domain.
A technical guide to solving the One Billion Row Challenge (1BRC) using SQL and DuckDB, including data loading and aggregation.
A technical guide on deploying Antora documentation using GitHub Actions with authentication for private GitHub repositories.
A monthly roundup of articles and resources on stream processing, Apache Flink, Kafka, and SQL for data engineering and real-time analytics.
A personal review and comparison of popular AI image generation tools like DALL·E 3, Midjourney, Adobe Firefly, and Ideogram for productivity.
A developer troubleshoots Hugo's live-reload feature failing on Mac after switching from Docker Desktop to OrbStack, and finds a workaround.
Exploring the two JDBC driver options for connecting to Apache Flink: the new Flink JDBC driver and the Hive JDBC driver via the SQL Gateway.
Monthly roundup of data streaming trends, featuring Apache Iceberg, Kafka Streams, Flink deployments, and streaming SQL insights.