Blog

PRODUCTS

KEYWORDS

0 matching articles.
  1. John Hopkins University Center for Systems Science and Engineering began collecting, tabulating, and publishing Novel Coronavirus (COVID-19) data on January 31, 2020. We started importing this dataset into Dol…

    8 min read
    Read More
  2. How We Built DoltHub: Introduction

    Towards the end of last month, we launched a totally reworked and redesigned version of DoltHub, our web application for hosting and collaborating on Dolt repositories. Now that we've had a little while to iro…

    5 min read
    Read More
  3. Background We are excited to announce the launch of our documentation site. The goal of Dolt and DoltHub is to enable developers and the data community with radically better data infrastructure. High qualit…

    3 min read
    Read More
  4. Happy Valentines Day from all of us at DoltHub! You are the reason we do what we do! It you. In honor of the holiday, we want to talk about how much we love making queries faster. We're going to examin…

    10 min read
    Read More
  5. LICENSE.md and README.md in Dolt

    Dolt and DoltHub strive to be the best data distribution platform on the internet. Having documentation versioned alongside data, and a standard, easy way to read the documentation online are features we admi…

    4 min read
    Read More
  6. Introducing SQL VIEW Support in Dolt

    Dolt is a SQL database with Git-style versioning and distribution. The most recent releases of Dolt introduced support for SQL views that are stored as part of, and versioned along with, a Dolt repository. …

    7 min read
    Read More
  7. Dolt is a SQL database with Git-style versioning. In Git the unit of versioning is files. In Dolt, the unit of versioning is SQL tables. Dolt will eventually support 100% of the Git command line and 100% of My…

    8 min read
    Read More
  8. In a previous blog I showed how the history of a dataset can be queried using the dolt history tables, and in the first part of this 2 part blog I covered the IRS SOI data. In this second part I use the IR…

    5 min read
    Read More
  9. Every year the IRS publishes a treasure trove of data. It contains over a hundred different metrics which provide insight into the finances of American taxpayers. Even more compelling is they provide this inf…

    6 min read
    Read More
  10. Querying DoltHub Repositories with SQL

    Since its launch in 2008, GitHub has catalyzed the open source software world and accelerated the culture of software collaboration. Source control was an old idea at that point, but GitHub offered a central…

    2 min read
    Read More
  11. When we started developing Dolt our vision was to deliver git functionality for data. Where git versions files, Dolt versions tables. We implemented table based diff and conflict logic and shipped the init…

    6 min read
    Read More
  12. DoltHub Redesign

    Redesigning DoltHub Dolt is a database and a data format. DoltHub is a way of hosting and collaborating on Dolt databases. We decided to redesign DoltHub to make it more user friendly. We are excited to ann…

    5 min read
    Read More
  13. A few months ago we finally settled on a good way to measure the correctness of Dolt's SQL engine: the sqllogictest package, first developed for SQLite and since used as a benchmark for lots of other datab…

    6 min read
    Read More
  14. The History of Data Exchange

    IBM and General Electric invented the first databases in the early 1960s. It was only by the early 1970s that enough data had accumulated in databases that the need to transfer data between databases emerged. …

    5 min read
    Read More
  15. Wikipedia is the largest and most popular general reference work on the internet, making it a powerful tool for predictive language modeling. Wikipedia releases a dump of all its articles and pages twice a mon…

    6 min read
    Read More
  16. Since releasing Dolt, we have often been asked how it scales. How many rows and how many gigs can you get into a Dolt dataset before things start breaking badly? Answering this question in practice is kind…

    5 min read
    Read More
  17. I have been a huge Econtalk fan for over ten years. On his podcast with Sebastian Junger, Russ Roberts brought up what he called a Chinese proverb. No food, one problem. Have food, many problems. The wisdom of…

    3 min read
    Read More
  18. ImageNet in Dolt

    ImageNet is a dataset maintained by the Stanford Vision Lab. It seems to have fallen into disrepair. The links to download the image labels are broken. We have managed to procure all four released versions of …

    4 min read
    Read More
  19. Tracking Data Changes with Dolt Blame

    Ever look at some data and wonder where a particular value came from, how long it's been there, or what the reason for changing it was? This is important information, but current data storage formats don't tra…

    7 min read
    Read More
  20. As we discussed in the Where Is the Data Catalog? blog post, Dolt is a database designed for internet-scale collaboration. There are databases with differences, history, rollback, and audit logging. We think t…

    3 min read
    Read More
  21. Tim Sehn

    Tim's Weekly DoltHub Update

    Stay in the loop and join the community on Discord