Blog

PRODUCTS

KEYWORDS

0 matching articles.
  1. Dolt is to DoltHub as Git is to GitHub - except with Dolt, the unit of versioning is SQL tables. Dolt also has Git-like semantics such as pull, branch and merge. By running dolt pull in a Dolt repository, you …

    4 min read
    Read More
  2. How Dolt Types Work

    UPDATED FEBRUARY 10, 2021: Updated the final table with the types that have been added to Dolt since the article was first written. When we started on Dolt, our goal was to apply Git's idea of versioning to…

    10 min read
    Read More
  3. As COVID-19 continues to affect the lives of millions of people around the world, having the most recent and accurate information is an increasingly important tool to help combat the disease. We've been tra…

    11 min read
    Read More
  4. Dolt is a SQL database with Git-style versioning. In Git the unit of versioning is files. In Dolt, the unit of versioning is SQL tables. Dolt will eventually support 100% of the Git command line and 100% of My…

    8 min read
    Read More
  5. Data CI with DoltHub Webhooks

    Dolt and DoltHub are Git and GitHub for data. The same way that GitHub enables collaboration on source code repositories in Git format, DoltHub enables collaboration on data repositories in Dolt format. A v…

    8 min read
    Read More
  6. Tracking Dolt's SQL regressions As part of our journey to make Dolt a great SQL database, we set out to track the correctness of Dolt’s SQL engine against a suite of SQL tests called the sqllogictests. The…

    6 min read
    Read More
  7. Dolt for Git Noobs

    TL;DR Dolt is a SQL database with built-in Git versioning, branching, and distribution semantics that makes collaborating on and distributing data effortless. What Git does for files, Dolt does for data. …

    14 min read
    Read More
  8. How Dolt Stores Table Data

    Dolt is Git for data. It's a SQL database that lets you clone, branch, diff, merge, and fork your data just like you can with a filesystem tree in Git. This blog post explores one of the fundamental datastr…

    8 min read
    Read More
  9. Dolt Use Cases

    Dolt is Git for data. Instead of versioning files, Dolt versions tables. DoltHub is a place on the internet to share Dolt repositories. As far as we can tell, Dolt is the only database with branches. How would…

    6 min read
    Read More
  10. Overview In this blog post, we discuss an approach for simulating an outbreak of COVID-19 in the US Congress. This is a long technical article about data sets, epidemiology, and simulation. Feel free t…

    15 min read
    Read More
  11. In the previous article in this series, we took a deep look at the overall system architecture of DoltHub, the online data community powered by the Dolt version-controlled database. In this article, we'll zoom…

    9 min read
    Read More
  12. Testing Dolt using Bats

    We adopted Bash Automated Testing System (Bats) to test the Dolt command-line. As of March 10, 2020 we are up to 473 tests, though 55 are skipped because they currently fail. The tests define desired behavior …

    5 min read
    Read More
  13. Dolt is Git for data. It's a SQL database that lets you branch, merge, and fork your data just like you would a Git repository. In previous blog posts we announced how you can use special system tables to q…

    6 min read
    Read More
  14. Here at DoltHub, we've been working on COVID-19 data since February 5, 2020. First, we started importing John Hopkins data and then we worked on assembling the largest open, regularly-updated set of case detai…

    5 min read
    Read More
  15. As part of our effort to track data related to the Novel Coronavirus (COVID-19), we wanted to scrape a JavaScript-enabled website on Coronavirus from Hong Kong. Moreover, you'll notice that the website from Ho…

    4 min read
    Read More
  16. On Saturday, February 29, this transpired in our company chat room: Tim/Brian Google Chat Snippet A project was born. We had time series data for confirmed cases, deaths, and recoveries segmented by l…

    6 min read
    Read More
  17. In our introductory article for this series, we took a high-level look at the technology stack and architecture behind DoltHub, the online home for Dolt data repositories. In this article, we'll delve a little…

    6 min read
    Read More
  18. In this blog post I want to give an introduction to some core concepts used to implement fast querying of databases. These techniques were implemented in Dolt and produced significant performance improvements…

    6 min read
    Read More
  19. So You Want Git for Data?

    Dolt is Git for Data. Learn about the options for versioning data catalogs, data pipeline version tools, and version controlled databases. The Dolt database versions data and schema with full audit history, di…

    10 min read
    Read More
  20. In the first part of this two part blog I covered NOAA's "Global Hourly Surface Data" dataset and how it is modeled in Dolt. Dolt is git for data, and for this dataset we model a day of observations as a si…

    7 min read
    Read More
  21. Tim Sehn

    Tim's Weekly DoltHub Update

    Stay in the loop and join the community on Discord