Blog
PRODUCTS
KEYWORDS
We adopted Bash Automated Testing System (Bats) to test the Dolt command-line. As of March 10, 2020 we are up to 473 tests, though 55 are skipped because they currently fail. The tests define desired behavior …
5 min readRead MoreDolt is Git for data. It's a SQL database that lets you branch, merge, and fork your data just like you would a Git repository. In previous blog posts we announced how you can use special system tables to q…
6 min readRead MoreHere at DoltHub, we've been working on COVID-19 data since February 5, 2020. First, we started importing John Hopkins data and then we worked on assembling the largest open, regularly-updated set of case detai…
5 min readRead MoreAs part of our effort to track data related to the Novel Coronavirus (COVID-19), we wanted to scrape a JavaScript-enabled website on Coronavirus from Hong Kong. Moreover, you'll notice that the website from Ho…
4 min readRead MoreOn Saturday, February 29, this transpired in our company chat room: Tim/Brian Google Chat Snippet A project was born. We had time series data for confirmed cases, deaths, and recoveries segmented by l…
6 min readRead MoreIn our introductory article for this series, we took a high-level look at the technology stack and architecture behind DoltHub, the online home for Dolt data repositories. In this article, we'll delve a little…
6 min readRead MoreIn this blog post I want to give an introduction to some core concepts used to implement fast querying of databases. These techniques were implemented in Dolt and produced significant performance improvements…
6 min readRead MoreDolt is Git for Data. Learn about the options for versioning data catalogs, data pipeline version tools, and version controlled databases. The Dolt database versions data and schema with full audit history, di…
10 min readRead MoreIn the first part of this two part blog I covered NOAA's "Global Hourly Surface Data" dataset and how it is modeled in Dolt. Dolt is git for data, and for this dataset we model a day of observations as a si…
7 min readRead MoreThe National Oceanic and Atmospheric Administration, NOAA, publishes weather measurements taken from stations around the world. It started in 1901 with a handful of stations, and there are more than 35,000 st…
6 min readRead MoreDolt is Git for data. We built Dolt to help teams collaborate on data sets using the forking, branching, and merging workflows that Git popularized. These workflows are what enable software engineers to co…
7 min readRead MoreIn our previous blog post we examined some freely available licensing tools for open data from Creative Commons. To briefly recap a license specifies the terms under which copyrightable material is made availa…
4 min readRead MoreIntroduction Dolt is a data format. DoltHub is a collaboration platform for data stored in the Dolt format. When sharing copyrighted content the terms of that sharing are governed by a license. In this post…
3 min readRead MoreJohn Hopkins University Center for Systems Science and Engineering began collecting, tabulating, and publishing Novel Coronavirus (COVID-19) data on January 31, 2020. We started importing this dataset into Dol…
8 min readRead MoreTowards the end of last month, we launched a totally reworked and redesigned version of DoltHub, our web application for hosting and collaborating on Dolt repositories. Now that we've had a little while to iro…
5 min readRead MoreBackground We are excited to announce the launch of our documentation site. The goal of Dolt and DoltHub is to enable developers and the data community with radically better data infrastructure. High qualit…
3 min readRead MoreHappy Valentines Day from all of us at DoltHub! You are the reason we do what we do! It you. In honor of the holiday, we want to talk about how much we love making queries faster. We're going to examin…
10 min readRead MoreDolt and DoltHub strive to be the best data distribution platform on the internet. Having documentation versioned alongside data, and a standard, easy way to read the documentation online are features we admi…
4 min readRead MoreDolt is a SQL database with Git-style versioning and distribution. The most recent releases of Dolt introduced support for SQL views that are stored as part of, and versioned along with, a Dolt repository. …
7 min readRead MoreDolt is a SQL database with Git-style versioning. In Git the unit of versioning is files. In Dolt, the unit of versioning is SQL tables. Dolt will eventually support 100% of the Git command line and 100% of My…
8 min readRead More
Tim's Weekly DoltHub Update
Stay in the loop and join the community on Discord