Blog

PRODUCTS

KEYWORDS

0 matching articles.
  1. Introducing Type Changes

    Dolt is a SQL database with Git-style versioning. With each new version of Dolt, we increase the number of supported SQL features, moving toward our goal of being a complete drop-in replacement for MySQL, whi…

    6 min read
    Read More
  2. Part I: Dolt API Surface Area

    Background When DoltHub was founded it was called Liquidata. The goal was to bring liquidity to the data market. The founders realized that the pipes were broken: sending around CSV, JSON, and other forma…

    8 min read
    Read More
  3. DoltHub is a place on the internet to share, discover, and collaborate on Dolt databases. It's a Next.js application written in Typescript, backed by a GraphQL server that calls gRPC services written in Go…

    10 min read
    Read More
  4. How it Started For those following along, we've been working on improving Dolt's performance with the goal of making Dolt no more than 2-4 times slower than MySQL. When we set out to measure Dolt's performa…

    6 min read
    Read More
  5. January Dataset Spotlight

    It's that time. Our January dataset spotlight here at DoltHub. For the new folks, Dolt is a SQL database with git-like versioning and DoltHub is a place on the internet to share Dolt databases. This monthly fe…

    2 min read
    Read More
  6. Announcing DoltHub Issues

    DoltHub is a place on the internet to share and collaborate on Dolt databases. We built DoltHub because we thought it would be useful to interact with versioned SQL databases in familiar ways. For example, …

    3 min read
    Read More
  7. More Hiring

    In October, we set out to hire more engineers to work on Dolt and DoltHub. Dolt is a SQL database with Git-like versioning and DoltHub is a place to share Dolt repositories. Since then, we added three engineer…

    3 min read
    Read More
  8. Release notes generation for GitHub repos

    Introduction Today we're excited to announce the open sourcing of a tool to automatically generate markdown formatted release notes for GitHub repositories. Dolt is using this tool to generate our releas…

    6 min read
    Read More
  9. Dolt is Git for data, a SQL database with version control. We've been working hard recently on making Dolt a useful tool for Data Science (DS) practitioners and we're hoping to launch some slick integration…

    5 min read
    Read More
  10. Dolt is Git for data and DoltHub is our web application that houses Dolt repositories. DoltHub consists of three separate React applications: our main Next.js app, as well as two Gatsby apps for our blog an…

    6 min read
    Read More
  11. Performance Benchmarks on Pull Request

    Overview Not long ago we wrote about measuring Dolt's performance against MySQL with the goal of improving Dolt to be no more than 2-4 times slower than MySQL. To work toward this goal, we created a cont…

    4 min read
    Read More
  12. On January 1, 2021, a US law was passed requiring hospitals to publish their prices in human and machine readable format. We would like to assemble the best open dataset of hospital prices in the US to aid res…

    4 min read
    Read More
  13. Introduction Back in November, we announced support for uploading CSV files on DoltHub directly to Dolt repository commits. Since then, we've been quickly iterating on features for upload on the web. We…

    7 min read
    Read More
  14. Introduction Dolt stores data in a content addressable prolly tree in order to get efficient merges and diffs. In designing the table data format one of our goals was to make table column additions and de…

    8 min read
    Read More
  15. Introduction In November, shortly after the election, we published an analysis of Pennsylvania ballot data provided by the Pennsylvania Department of State. The purpose of the analysis was to determine i…

    19 min read
    Read More
  16. We have been running the DoltHub dataset spotlight since May 2020. This is our eighth issue. The intent was to add additional exposure to Dolt datasets published on DoltHub. Publishing this blog monthly has…

    3 min read
    Read More
  17. Introduction Dolt is Git for Data. It's a SQL database that you can clone, fork, branch, and merge. Dolt's SQL engine is go-mysql-server, and today we're going to discuss how it implements join planning…

    15 min read
    Read More
  18. Last Monday, we released our first data bounty to earn a share of $25,000 by wrangling US Presidential Precinct-level data. This blog will update you on the progress and encourage you to participate. Finally, …

    4 min read
    Read More
  19. Keyless Tables in Dolt

    Dolt is a tool built for collaboration and data distribution, it's Git for Data. Git versions files, Dolt versions tables. Today, we're announcing support for keyless tables in Dolt. Strongly typed schem…

    5 min read
    Read More
  20. Bounty Attribution

    On Monday we launched Bounties, a product that pays users to gather and clean data. In less than a week, our first data bounty has already shown the power of Dolt as a collaborative data platform. In that ti…

    5 min read
    Read More
  21. Tim Sehn

    Tim's Weekly DoltHub Update

    Stay in the loop and join the community on Discord