Blog

Elfinwood Data Science

Data Science for Everyone

Elfinwood is a term used to describe stunted forests characteristic of most subalpine and alpine regions world-wide. Also referred to as krummholtz, these miniature forests occur at the upper altitudinal limits of trees.

The objective of this blog is to help improve data management and analysis in general, with examples given from the biological and environmental sciences. To this end, I’ll be presenting an introduction to data management and analysis, including using SQL and R, and a PostgreSQL database schema model with the intent of moving us towards a more integrated approach to data management. The materials presented will be equivalent to that of a graduate level course in data management.

For an overview of this blog go to the post “Elfin….what?” which includes a list of current blog posts grouped by topic.

Follow This Blog

Join 12 other followers

Note: If you’d like to follow this blog, but you don’t want to follow by subscribing through WordPress then a couple of options exist. You can “Like” the Elfinwood Data Science Blog Facebook Page or send me your email using the form on the Contacts Page and I’ll add you to the blog’s email list.

Lesson 003: Inserting Data into a Table

Introduction In the previous post I introduced PostgreSQL sequences, tables, and columns; and we used the Lesson_002 files in the learning-data-science GitHub repository to create a table for storing metadata about scientific studies. In this post, Lesson 003, we’ll insert data into the dom_study table that we created in the previous post. Since data analysisContinue reading “Lesson 003: Inserting Data into a Table”

Lesson 002: Creating Tables in PostgreSQL

Introduction In the previous post I introduced PostgreSQL schemas, and we used the Lesson_001 files in the learning-data-science GitHub repository to create a schema for organizing domain tables. In this post, Lesson 002, we’ll create a sequence and a data table, and add column comments. Since data analysis and management are best learned by doing,Continue reading “Lesson 002: Creating Tables in PostgreSQL”

Lesson 001: Creating a Schema

Introduction In the previous post we installed pgAdmin and PostgreSQL, and created a database. In this post, Lesson 001 of Learning Data Science, we’ll discuss database schemas and create a new schema in my_first_database. Since data analysis and management are best learned by doing, the objective of this and future posts is to get readersContinue reading “Lesson 001: Creating a Schema”

Creating a PostgreSQL Database

Introduction In the previous post I introduced the learning-data-science GitHub repository, defined forking a repository, and provided instructions for forking the learning-data-science repository and cloning it to your computer. In this post, we’ll install pgAdmin and PostgreSQL, and create a database. Since data analysis and management are best learned by doing, the objective of thisContinue reading “Creating a PostgreSQL Database”

Learning Data Science

Introduction In the previous post I covered branches and basic merging in Git. In this post I’ll introduce readers to a GitHub repository that we’ll be using for this and future posts to facilitate learning, and then you’ll fork the repository and clone the forked repository to your computer. Since data analysis and management areContinue reading “Learning Data Science”

Version Control: An Overview

Introduction In the last post I provided an overview of databases and data management systems. This post is the first in a 5 part series on version control. The objective of this post is to provide an overview of version control for managing code. This is in preparation for using version control in future postsContinue reading “Version Control: An Overview”

Loading…

Something went wrong. Please refresh the page and/or try again.


Follow My Blog

Get new content delivered directly to your inbox.

Join 12 other followers

Copyright © 2020, Aaron Wells

%d bloggers like this: