Getting Started Using Hadoop, Part 4: Creating Tables With Hive
A tutorial on using Apache Hive to create tables and views from data loaded into a Hadoop cluster, continuing a multi-part series.
A tutorial on using Apache Hive to create tables and views from data loaded into a Hadoop cluster, continuing a multi-part series.
Explains how to use PostgreSQL's COMMENT feature to document tables, columns, and SQL for better data understanding and maintenance.
A tutorial on using PostgreSQL's crosstab function to pivot data directly in SQL, enabling easier analysis without external tools.
A summary of upcoming technical talks on statistical computing, rare DNA variant analysis, and handling large datasets with R and SQL.
A collection of practical tips and guides for effective indexing in PostgreSQL, covering unused indexes, costs, and performance considerations.
A developer shares common SQL bad habits like using column numbers in ORDER BY, implicit joins, and lacking comments, with examples and better practices.
A guide to using PostgreSQL's array_agg function to efficiently aggregate and format data, avoiding manual application-level processing.
A developer shares his preferred method for working with PostgreSQL, advocating for the psql command-line tool and sharing key tips and configurations.
Argues that SQL databases are inherently non-scalable, questioning the effectiveness of traditional scaling techniques like caching and sharding.
A tutorial video demonstrating how to execute SQL queries within the R programming language using the 'sqldf' package for data analysis.
A developer shares his approach to writing clean, readable SQL with practical examples and best practices for structuring queries.
Explains how to use the Array datatype in PostgreSQL for storing and querying data like tags or purchase items, with practical examples.
A technical guide on reducing SQL database size by normalizing repeated data like IPs and User Agents into separate relational tables.
A reminder to cancel free tech event registrations if you can't attend, freeing up spots for others on waiting lists.
A guide to displaying generated SQL queries in the Rails 3 console by configuring ActiveRecord's logger to output to STDOUT.
Explains the GROUPING SETS operator in SQL Server for flexible data aggregation, comparing it to UNION ALL and WITH CUBE approaches.
SQL queries to analyze and identify performance bottlenecks in Oracle Data Integrator (ODI) batch jobs with many tasks.
A technical guide on optimizing SQL queries with wildcard-prefixed LIKE conditions using indexed computed columns for better performance.
A personal recap of attending SQLBits VI, covering sessions on SQL Server performance tuning, parallel query execution, and the query optimizer.
Explains how to optimize SQL query performance by avoiding functions in WHERE clauses, using date filtering as a key example.