Redis

Redis
Original author(s)Salvatore Sanfilippo[1][2]
Developer(s)Redis[1][2]
Initial releaseFebruary 26, 2009; 15 years ago (2009-02-26)[3]
Stable release
7.4.1[4] Edit this on Wikidata / October 2, 2024; 2 months ago (October 2, 2024)
Repository
Written inC
Operating systemUnix-like[5]
Available inEnglish
TypeData structure store, key–value database
LicenseRedis Source Available License or SSPL[6]
Websiteredis.io Edit this on Wikidata

Redis (/ˈrɛdɪs/;[7][8] Remote Dictionary Server)[7] is a source-available, in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability.[9] Because it holds all data in memory and because of its design, Redis offers low-latency reads and writes, making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database,[10][11][12] and one of the most popular databases overall.[13] Redis is used in companies like Twitter,[14][15] Airbnb,[16] Tinder,[17] Yahoo,[18] Adobe,[19] Hulu,[20] Amazon[21] and OpenAI.[22]

Redis supports different kinds of abstract data structures, such as strings, lists, maps, sets, sorted sets, HyperLogLogs, bitmaps, streams, and spatial indices.

The project was developed and maintained by Salvatore Sanfilippo, starting in 2009.[23] From 2015 until 2020, he led a project core team sponsored by Redis Labs.[24] Salvatore Sanfilippo left Redis as the maintainer in 2020.[25] In 2021 Redis Labs dropped the Labs from its name and now is known simply as "Redis".[26]

In 2018, some modules for Redis adopted the SSPL.[27] In 2024, the main Redis code switched to dual-licensed under the Redis Source Available License v2 and the Server Side Public License v1.[6]

History

Salvatore Sanfilippo, the original developer of Redis (photo taken in 2015)

The name Redis means Remote Dictionary Server.[7] The Redis project began when Salvatore Sanfilippo, nicknamed antirez, the original developer of Redis, was trying to improve the scalability of his Italian startup, developing a real-time web log analyzer. After encountering significant problems in scaling some types of workloads using traditional database systems, Sanfilippo began in 2009 to prototype a first proof of concept version of Redis in Tcl.[28] Later Sanfilippo translated that prototype to the C language and implemented the first data type, the list. After a few weeks of using the project internally with success, Sanfilippo decided to open source it, announcing the project on Hacker News. The project began to get traction, particularly among the Ruby community, with GitHub and Instagram being among the first companies adopting it.[29][30]

Sanfilippo was hired by VMware in March 2010.[31][32][33]

In May 2013, Redis was sponsored by Pivotal Software (a VMware spin-off).[34]

In June 2015, development became sponsored by Redis Labs.[35]

In August 2018, Redis Labs announced the switch to a source-available software license, Server Side Public License for some of the optional modules available for Redis,[27] while affirming that the core software will remain BSD-licensed.[36]

In October 2018, Redis 5.0 was released, introducing Redis Stream – a new data structure that allows storage of multiple fields and string values with an automatic, time-based sequence at a single key.[37]

In June 2020, Salvatore Sanfilippo stepped down as Redis' sole maintainer. Sanfilippo was succeeded by Yossi Gottlieb and Oran Agra.[38][39]

In March 2024, Redis switched to a different proprietary software license, styled as the Redis Source Available License v2, and the Server Side Public License v1, also for its core repository,[6] breaking the 2018 pledge. As a consequence, the Linux Foundation created a fork under the name of Valkey,[40] allowing community maintainers, contributors, and users to continue working on an open source version of the Redis database.

Differences from other database systems

Redis popularized the idea of a system that can be considered a store and a cache at the same time. It was designed so that data is always modified and read from the main computer memory, but also stored on disk in a format that is unsuitable for random data access. The formatted data is only reconstructed into memory once the system restarts.

Redis also provides a data model that is very unusual compared to a relational database management system (RDBMS). User commands do not describe a query to be executed by the database engine but rather specific operations that are performed on given abstract data types. Therefore data must be stored in a way which is suitable later for fast retrieval. The retrieval is done without help from the database system in form of secondary indexes, aggregations or other common features of traditional RDBMS. The Redis implementation makes heavy use of the fork system call, to duplicate the process holding the data, so that the parent process continues to serve clients while the child process writes the in-memory data to disk.

Popularity

According to monthly DB-Engines rankings, Redis is often the most popular key–value database.[10] Redis has also been ranked the #4 NoSQL database in user satisfaction and market presence based on user reviews,[41] the most popular NoSQL database in containers,[42] and the #4 Data store of 2019 by ranking website stackshare.io.[43] It was voted most loved database in the Stack Overflow Developer Survey each year from 2017 to 2021.[44]

Supported languages

Since version 2.6, Redis features server-side scripting in the language Lua.[45]

Many programming languages have Redis language bindings on the client side, including:[46] ActionScript, C, C++, C#, Chicken, Clojure, Common Lisp, Crystal, D, Dart, Delphi,[47] Elixir, Erlang, Go, Haskell, Haxe, Io, Java, Nim, JavaScript (Node.js), Julia, Lua, Objective-C, OCaml, Perl, PHP, Pure Data, Python, R,[48] Racket, Ruby, Rust, Scala, Smalltalk, Swift, and Tcl. Several client software programs exist in these languages.[46]

Data types

Redis maps keys to types of values. An important difference between Redis and other structured storage systems is that Redis supports not only strings, but also abstract data types:

  • Lists of strings
  • Sets of strings (collections of non-repeating unsorted elements)
  • Sorted sets of strings (collections of non-repeating elements ordered by a floating-point number called score)
  • Hash tables where keys and values are strings
  • HyperLogLogs used for approximated set cardinality size estimation, available since Redis 2.8.9 in April 2014.[37]
  • Stream of entries with consumer groups, allows you to store multiple fields and string values with an automatic, time-based sequence at a single key, available since Redis 5.0 in October 2018[37]
  • Geospatial data through the implementation of the geohash technique, available since Redis 3.2.[49]

The type of a value determines what operations (called commands) are available for the value. Redis supports high-level, atomic, server-side operations like intersection, union, and difference between sets and sorting of lists, sets and sorted sets.

More data types are supported based on Redis Modules API. Note, that some of them are dual licensed, and not under the BSD 3 clause:[50]

  • JSON – RedisJSON[51] implements ECMA-404 (the JavaScript Object Notation Data Interchange Standard) as a native data type.[52]
  • Search - A query engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.[53]
  • Time series – RedisTimeSeries[54] implements a time series data structure
  • Bloom filter, Cuckoo filter, Count–min sketch, and Top-K – RedisBloom[55] implements a set of probabilistic data structures for Redis
  • Others[56]

Former implementations include:

  • Graph – RedisGraph[57] implements a queryable property graph
    • RedisGraph has been discontinued,[58] and continued in the form of a fork called FalkorDB.[59]

Persistence

Redis typically holds the whole dataset in memory. Versions up to 2.4 could be configured to use what they refer to as virtual memory[60] in which some of the dataset is stored on disk, but this feature is deprecated. Persistence in Redis can be achieved through two different methods. First by snapshotting, where the dataset is asynchronously transferred from memory to disk at regular intervals as a binary dump, using the Redis RDB Dump File Format. Alternatively by journaling, where a record of each operation that modifies the dataset is added to an append-only file (AOF) in a background process. Redis can rewrite the append-only file in the background to avoid an indefinite growth of the journal. Journaling was introduced in version 1.1 and is generally considered the safer approach.

By default, Redis writes data to a file system at least every 2 seconds, with more or less robust options available if needed. In the case of a complete system failure on default settings, only a few seconds of data would be lost.

Replication

Redis supports master–replica replication. Data from any Redis server can replicate to any number of replicas. A replica may be a master to another replica. This allows Redis to implement a single-rooted replication tree. Redis replicas can be configured to accept writes, permitting intentional and unintentional inconsistency between instances. The publish–subscribe feature is fully implemented, so a client of a replica may subscribe to a channel and receive a full feed of messages published to the master, anywhere up the replication tree. Replication is useful for read (but not write) scalability or data redundancy.[61]

Performance

When the durability of data is not needed, the in-memory nature of Redis allows it to perform well compared to database systems that write every change to disk before considering a transaction committed.[7] Redis operates as a single process and is single-threaded or double-threaded when it rewrites the AOF (append-only file).[62] Thus, a single Redis instance cannot use parallel execution of tasks such as stored procedures.

Clustering

Redis introduced clustering in April 2015 with the release of version 3.0.[63] The cluster specification implements a subset of Redis commands: all single-key commands are available, multi-key operations (commands related to unions and intersections) are restricted to keys belonging to the same node, and commands related to database selection operations are unavailable.[64] A Redis cluster can scale up to 1,000 nodes, achieve "acceptable" write safety and to continue operations when some nodes fail.[65][66]

Use cases

Due to the nature of the database design, typical use cases are session caching, full page cache, message queue applications, leaderboards and counting among others.[67] The publish–subscribe messaging paradigm allows real-time communication between servers.

Amazon Web Services offers a managed Redis service called ElastiCache for Redis, Google offers a managed Redis service called Cloud Memorystore,[68] Microsoft offers Azure Cache for Redis in Azure,[69] and Alibaba offers ApsaraDB for Redis in Alibaba Cloud.[70]

Users

Redis is being used in companies like Twitter,[14][15][71] AirBnB,[16] Tinder,[17] Yahoo,[18] Adobe,[19] Hulu,[20] and Amazon.[21]

See also

References

  1. ^ a b Bernardi, Stefano (January 4, 2011). "An interview with Salvatore Sanfilippo, creator of Redis, working out of Sicily". EU-Startups. Menlo Media.
  2. ^ a b Haber, Itamar (July 15, 2015). "Salvatore Sanfilippo: Welcome to Redis Labs". Redis Labs.
  3. ^ "Page 7 of 7 - Redis - Google Code Archive - Long-term storage for Google Code Project Hosting". code.google.com. Retrieved 22 March 2024.
  4. ^ "Release 7.4.1". 2 October 2024. Retrieved 27 October 2024.
  5. ^ "Introduction to Redis". Redis is written in ANSI C and works in most POSIX systems like Linux, *BSD, OS X without external dependencies.
  6. ^ a b c "LICENSE.txt". GitHub. 20 March 2024.
  7. ^ a b c d "FAQ: Redis". Redis.io. Retrieved 12 February 2022.
  8. ^ "Google Groups". groups.google.com. Retrieved 25 February 2022.
  9. ^ "Redis". Redis. Retrieved 2023-07-22.
  10. ^ a b "DB-Engines Ranking - popularity ranking of key-value stores". DB-Engines.
  11. ^ Clark, Lindsay. "Redis becomes the most popular database on AWS as complex cloud application deployments surge". www.theregister.com. Retrieved 2023-07-22.
  12. ^ "Instablinks EP 07: Redis™—The Most Popular In-Memory Database Technology". Instaclustr. Retrieved 2023-07-22.
  13. ^ "DB-Engines Ranking". DB-Engines. Retrieved 2023-07-22.
  14. ^ a b Scaling Redis at Twitter, 31 August 2014, retrieved 2023-07-22
  15. ^ a b Using Redis at Scale at Twitter - by Rashmi Ramesh of Twitter - RedisConf17 -, 5 July 2017, retrieved 2023-07-22
  16. ^ a b AWS re:Invent 2018: Airbnb's Journey from Self-Managed Redis to ElastiCache for Redis (DAT319), 28 November 2018, retrieved 2023-07-22
  17. ^ a b "Building resiliency at scale at Tinder with Amazon ElastiCache | AWS Database Blog". aws.amazon.com. 2020-01-30. Retrieved 2023-07-22.
  18. ^ a b AWS re:Invent 2022 - How Yahoo cost optimizes their in-memory workloads with AWS (DAT321), 2 December 2022, retrieved 2023-07-22
  19. ^ a b AWS re:Invent 2014 | (SDD402) Amazon ElastiCache Deep Dive, 17 November 2014, retrieved 2023-07-22
  20. ^ a b "Hulu Case Study". Amazon Web Services, Inc. Retrieved 2023-07-22.
  21. ^ a b "Amazon GameOn Database Migration Case Study – Amazon Web Services (AWS)". Amazon Web Services, Inc. Retrieved 2023-07-22.
  22. ^ "Elevated API Errors". status.openai.com. Retrieved 2023-10-28.
  23. ^ "A conversation with Salvatore Sanfilippo, creator of the open-source database Redis". VentureBeat. 2016-06-20. Retrieved 2021-06-29.
  24. ^ Kepes, Ben (July 15, 2015). "Redis Labs hires the creator of Redis, Salvatore Sanfilippo". Network World. Retrieved August 30, 2015.
  25. ^ Francisco, Thomas Claburn in San. "Database maestro Antirez says arrivederci to Redis: Seems he wants an unstructured life writing code, not a structured one managing software". www.theregister.com. Retrieved 2021-06-29.
  26. ^ "Database startup Redis Labs rebrands as ... just Redis". SiliconANGLE. 2021-08-11. Retrieved 2021-08-11.
  27. ^ a b Claburn, Thomas. "Redis has a license to kill: Open-source database maker takes some code proprietary". www.theregister.com. Retrieved 2024-03-21.
  28. ^ Sanfilippo, Salvatore (April 28, 2017). "Tcl prototype of Redis". GitHub Gist. Retrieved October 8, 2018.
  29. ^ Wanstrath, Chris (November 3, 2009). "Introducing Resque". Blog. Retrieved October 8, 2018.
  30. ^ Krieger, Mike (October 31, 2011). "Storing hundreds of millions of simple key-value pairs in Redis". Instagram Engineering Blog. Retrieved October 8, 2018.
  31. ^ Shapira, Gwen (March 17, 2010). "VMware Hires Redis Key Developer – But Why?". Blog. Retrieved September 25, 2016.
  32. ^ Sanfilippo, Salvatore (March 15, 2010). "VMware: the new Redis home". Blog. Retrieved September 25, 2016.
  33. ^ Collison, Derek (March 15, 2010). "VMware: The Console: VMware hires key developer for Redis". VMware Blog. Archived from the original on March 22, 2010. Retrieved September 25, 2016.
  34. ^ Sanfilippo, Salvatore. "Redis Sponsors". Redis.io. Redis Labs. Retrieved April 11, 2019.
  35. ^ Sanfilippo, Salvatore (July 15, 2015). "Thanks Pivotal, Hello Redis Labs". <antirez>. Retrieved 2019-04-03.
  36. ^ Shoolman, Yiftach (August 22, 2018). "Redis' License is BSD and will remain BSD". Redis.io. Redis Labs. Retrieved September 30, 2024.
  37. ^ a b c "Redis 5.0 is here!". 22 October 2018.
  38. ^ "Database maestro Antirez says arrivederci to Redis: Seems he wants an unstructured life writing code, not a structured one managing software". theregister.com.
  39. ^ "The end of the Redis adventure -". antirez.com. Retrieved 2020-11-10.
  40. ^ "Linux Foundation Launches Open Source Valkey Community".
  41. ^ "Best NoSQL Databases: Fall 2015 Report from G2 Crowd". G2 Crowd. Archived from the original on 2015-08-24. Retrieved 2015-08-25.
  42. ^ "8 Surprising Facts about Real Docker Adoption". Datadog. 13 June 2018.
  43. ^ "🏆 The Top 50 Developer Tools of 2019". StackShare. Retrieved 2020-07-28.
  44. ^ "Developer Survey Results 2021: Most Loved, Dreaded, and Wanted Databases". Stack Overflow. Stack Exchange. Retrieved 2021-08-23.
  45. ^ "EVAL – Redis". redis.io.
  46. ^ a b "Redis". redis.io.
  47. ^ "Danieleteti/Delphiredisclient". GitHub. 17 September 2022.
  48. ^ Lewis, B. W. (5 July 2015). "rredis: "Redis" Key/Value Database Client". The Comprehensive R Archive Network. Retrieved 2019-04-03.
  49. ^ "Redis 3.2 Release Notes". GitHub. Retrieved 2017-03-10.
  50. ^ "Redis Licensing Overview". Redis. Retrieved 2023-09-30.
  51. ^ "RedisJSON - a JSON data type for Redis".
  52. ^ "RedisJSON - a JSON data type for Redis". redisjson.io.
  53. ^ RediSearch, RediSearch, 2023-09-30, retrieved 2023-09-30
  54. ^ "RedisTimeSeries - Time-Series data structure for Redis".
  55. ^ "RedisBloom - Probabilistic Datatypes Module for Redis".
  56. ^ "Modules". Redis. Retrieved 2023-09-30.
  57. ^ "RedisGraph - a graph database module for Redis".
  58. ^ Kogan, Lior (2023-07-05). "RedisGraph End-of-Life Announcement". Redis. Retrieved 2023-09-30.
  59. ^ FalkorDB, FalkorDB, 2023-09-29, retrieved 2023-09-30
  60. ^ "Virtual Memory". Redis.io. Retrieved April 11, 2019.
  61. ^ "Google Code Archive - Long-term storage for Google Code Project Hosting". code.google.com.
  62. ^ "Redis on the Raspberry Pi: adventures in unaligned lands - <antirez>". antirez.com.
  63. ^ "Redis 3.0 Release Notes". GitHub. Retrieved 2017-03-10.
  64. ^ "Cluster Spec". Retrieved 2017-03-10.
  65. ^ "Cluster Spec". Retrieved 2017-03-10.
  66. ^ "Cluster Tutorial". Retrieved 2017-03-10.
  67. ^ "Top 5 Redis use cases - ObjectRocket". ObjectRocket. Rackspace. 7 November 2017.
  68. ^ "Memorystore: in-memory data store". Google Cloud. Retrieved 2023-02-03.
  69. ^ "Azure Redis Cache - Redis cache cloud service - Microsoft Azure". azure.microsoft.com.
  70. ^ "ApsaraDB for Redis: A Key Value Database Service - Alibaba Cloud". www.alibabacloud.com.
  71. ^ "How Twitter Uses Redis to Scale - 105TB RAM, 39MM QPS, 10,000+ Instances - High Scalability". Highscalability.com. 8 September 2014.

Further reading