Redis for Administrators and Developers: Difference between revisions
												
				Jump to navigation
				Jump to search
				
| No edit summary | 
| (No difference) | 
Latest revision as of 02:37, 12 November 2017
THIS IS A DRAFT
This text may not be complete.
<slideshow style="nobleprog" headingmark="⌘" incmark="…" scaled="true" font="Trebuchet MS" >
- title
- Redis for Administrators and Developers
- author
- Alexander Patrakov
</slideshow>
Redis Overview ⌘
- Fast in-memory NoSQL database
- "REmote DIctionary Server"
- More than 1 000 000 GETs/sec on a typical developer's laptop, even more on servers
- Unique feature: performance guarantees for each command in Big-O notation
 
- Different from memcached
- Offers disk-based persistence
- Has more than just strings to offer
 
- Different from MongoDB
- Redis data types are closely related to fundamental data structures, not JSON
- No rich queries, just simple commands
 
Redis Releases ⌘
- Download source from http://download.redis.io/releases/
- Redis 4.0.1 released on 24-Jul-2017
- No https, no hash sums! Are they serious about security?
- There are also tarballs available from https://github.com/antirez/redis/releases, with a different hash sum!
 
- Major.Minor.Micro versioning
- Excellent backwards compatibility even between major versions
- Security fixes are provided (?) in the form of micro releases
 
- Support policy: two latest minor versions
- But actually there were no 3.0.x releases after 3.2.0
- Security implications: always use the latest version?
- Or rely on a Linux distribution to backport the security patches
- Verify that they do backport security patches!
 
- The reality may be not that sad: version 3.2.10 was released after 4.0.1, with a number of fixes
 
Redis Installation ⌘
- Use packages? Sometimes they are good enough!
- Debian/Ubuntu: apt install redis-server # from the main repository?
- CentOS: yum install redis # from EPEL
- Arch: pacman -S redis # from the community repository
- SuSE: zypper install redis # from the server:database repository
 
- Redis developers would like you to compile Redis from sources
- So that you have the latest version
 
Installation on Windows ⌘
- There was a fork by Microsoft Open Technologies
- Redis-32 (very outdated) and Redis-64 NuGet packages available
- MSI releases also available
- Latest release in 2016, based on redis-3.0 codebase
 
What's new ⌘
- Redis 4.0.x: 14-Jul-2017 - Present
- Redis Cluster is now compatible with NAT
- New replication engine
- Memory optimizations, including online defragmentation
- Background deletes
- Redis modules system
 
- Redis 3.2.x: 06-May-2016 - Present
- Redis Sentinel is now compatible with NAT (3.2.2)
- New command for efficient bit-field manipulation
- New set of commands for geo indexing
- Memory optimizations
- May or may not fully apply to versions in Linux distributions - bundled vs system jemalloc issue
 
 
- Redis 3.0.x: 01-Apr-2015 - 28-Jan-2016
- Redis 2.8.x: 22-Nov-2013 - 18-Dec-2015
Typical Package contents ⌘
- Binaries:
- /usr/bin/redis-server
- /usr/bin/redis-cli
- /usr/bin/redis-sentinel
- Symlink to redis-server
 
- /usr/bin/redis-trib
- /usr/bin/redis-benchmark
- /usr/bin/redis-check-aof
- /usr/bin/redis-check-rdb
 
- Distributions add files such as:
- /usr/lib/systemd/system/redis.service
- /etc/logrotate.d/redis
- /etc/redis.conf
 
What does what ⌘
- redis-server: the server
- Can start in standalone mode, in sentinel mode, or in cluster mode
 
- redis-cli: a command-line client
- Used by sysadmins for debugging, monitoring and reconfiguration
- Also used in training courses to make them programming-language-neutral
 
- redis-benchmark: the official way of benchmarking Redis
- redis-check-aof, redis-check-rdb: check and possibly fix the structure of Redis database
- redis-trib: a Ruby script that reconfigures a cluster
- Redis 3.0+
- Missing in Arch Linux because of unpackaged Ruby Redis client library
- The missing library can be installed via gem, and the script will then work
 
 
Configuration file ⌘
- Usually located at /etc/redis.conf or /etc/redis/redis.conf
- Extensive documentation in the form of comments
- The name of the configuration file is passed to redis-server as a parameter
- Redis can be asked to rewrite its own configuration after dynamic changes
- Here is how: redis-cli CONFIG REWRITE
- Fails in Arch: (error) ERR Rewriting config file: Read-only file system
 
- This feature is used by Redis Sentinel and Redis Cluster
- For standalone Redis servers, a read-only configuration file is better
 
- Here is how: redis-cli CONFIG REWRITE
Security notes ⌘
- Redis is insecure
- Allows full access (including CONFIG SET and CONFIG REWRITE) to everyone who can connect and knows the password
 
- Security features:
- The "bind" directive (specifies on which IP and port to accept connections)
- Defaults to 127.0.0.1 in Debian, missing elsewhere
 
- The "requirepass" directive
- Sets global password (no username)
 
 
- The "bind" directive (specifies on which IP and port to accept connections)
- Protected mode
- If activated, and there is no "bind" and no "requirepass", don't accept commands on non-local connections
 
Security notes, continued ⌘
- There is no SSL
- Run Redis on a trusted network
- Use stunnel/spiped/IPSec if absolutely needed
 
- Don't run Redis as root
- Or your root's authorized_keys will be overwritten from a hacked webapp
- Demo readily available
 
 
- Or your root's authorized_keys will be overwritten from a hacked webapp
- Don't let HTTP clients send queries to Redis- There was a browser-based demo that steals data
- Does not work anymore - Redis developers have added protections against cross-protocol scripting
 
- Set up a firewall
- Don't ignore protections (apparmor, systemd ProtectSystem) set up by distributions!
How Redis works ⌘
- At startup, loads the existing RDB or AOF file
- Populates internal data structures in memory
 
- Listens on sockets
- Handles all clients in one thread using epoll
- Attempts to read commands from clients
- Attempts to write any buffered output to clients
- Beware of clients and slaves behind slow networks - memory bloat issue
 
- Executes one command at a time, puts any output into the buffer
- Beware of slow commands - latency issue for other clients
 
 
- If configured, preiodically forks and saves its state into RDB file
Versions in distributions ⌘
- Data current as of 28-Aug-2017
- Debian Stable: 2:3.2.6-1
- No significant patches
- Cross-protocol scripting protection is missing!
 
- Debian Testing & Backports: 4:4.0.1-7 and 4:4.0.1-3~bpo9+1
- Latest version, the Debian version churn is about dropping non-deterministic tests
 
- Ubuntu 16.04 LTS: 2:3.0.6-1
- Never updated since Ubuntu release
- Is in Universe, so no bugfixes and no security support
 
- Ubuntu 17.04: 2:3.2.1-1
- So no NAT support in Sentinel
 
Versions in distributions ⌘
- Arch always has the latest Redis version
- Fedora 26: Redis 3.2.8
- CentOS: install from EPEL
- CentOS 6: Redis 2.4.10
- CentOS 7: Redis 3.2.3
 
- OpenSuSE 42.3: Redis 3.2.9-84.11
- No significant patches over vanilla 3.2.9
 
Installation from source ⌘
- No dependencies except gcc and related packages
- make -j4
- Takes 18 seconds on a Lenovo Yoga 2 Pro laptop
- Possible tweaks: OPTIMIZATION=..., MALLOC=...
- Default: OPTIMIZATION=-O2, MALLOC=jemalloc
 
- Or: make 32bit
- make test
- Needs tcl installed
- 2 minutes here, everything passes
- Debian maintainers complain that some tests are sensitive to timing or otherwise unreliable
 
- sudo make install
- Installs in /usr/local/bin
- Override: PREFIX=...
 
Situation with memory allocators ⌘
- Default on Linux: MALLOC=jemalloc
- Jemalloc-4.0.3 is bundled with redis-4.0.1
- There was an upgrade attempt to 4.4.0, which failed due to deadlocks
- Other options: libc, jemalloc, tcmalloc, tcmalloc_minimal
 
- Distributions often undo the bundling of jemalloc
- So, in Debian, redis uses the system jemalloc 3.6.0
 
Quick test without installation ⌘
- ./src/redis-server ./redis.conf
- ./redis-cli
- See example session below
- List of all commands (this training only covers a subset)
- There is also a help command that provides the same help!
 
- kill `pidof redis-server`
- Look, there is a dump.rdb file!
 
127.0.0.1:6379> set foo bar OK 127.0.0.1:6379> get foo "bar" 127.0.0.1:6379> get boom (nil) 127.0.0.1:6379> quit
Connecting to other servers/databases ⌘
- redis-cli -h host -p port
- TCP connection
 
- redis-cli -s /run/redis/redis.sock
- UNIX socket, will be faster, but can be used for local connections only
 
- redis-cli -d database
- By default, there are 16 databases, numbered from 0 (default) to 15
 
- redis-cli -a password
- The password is per-server
- There is no username
 
- There is no SSL - use stunnel or spiped
How to start and stop redis ⌘
- Start: redis-server /etc/redis.conf
- Will start listening on 127.0.0.1:6379 by default
 
- Stop: two ways to do it
- redis-cli shutdown # specify the port if you changed it
- Or just kill the main process with SIGTERM
- Both ways can fail in different cases
 
- Linux distributions typically provide an init script or a systemd unit that does it for you
Integration with systemd ⌘
- Should work out of the box if you installed from package
- Otherwise, copy-paste the unit from Arch Linux or from Debian
- Needs systemd >= 225 (released in August 2015)
 
Recommended sysctls ⌘
- vm.overcommit_memory=1
- The default is 0, can prevent a big (> RAMSIZE/2) Redis from forking
- Redis needs to fork for saving its RDB file, for rewriting AOF, and for talking to slaves
 
- net.core.somaxconn=512
- TCP backlog size
 
- Disable transparent hugepages
- They cause latency spikes when Redis forks
- Don't do this using /etc/rc.local - it would be too late!
- The preferred method is the transparent_hugepage=never kernel argument
- Another good way: two lines in /etc/tmpfiles.d/no-thp.conf:
- w /sys/kernel/mm/transparent_hugepage/enabled - - - - never
- w /sys/kernel/mm/transparent_hugepage/defrag - - - - never
 
 
What else is needed ⌘
- Your application that wants to connect to Redis!
- Libraries exist for many programming languages
- C: hiredis
- Python: redis
- Can use hiredis for speedup
 
- PHP: phpredis or Predis
- Node.js: io_redis or node_redis
 
Accessing Redis from C# ⌘
- The most popular libraries are StackExchange.Redis (open-source MIT license) and ServiceStack.Redis (commercial)
- Free trial of ServiceStack.Redis is limited to 6000 queries per hour
- Version 3.9.71 was the last truly free, but ServiceStack.Redis 3.x has been discontinued in 2014
 
- There are other Redis client libraries
 
- Free trial of ServiceStack.Redis is limited to 6000 queries per hour
- We can use StackExchange.Redis
- Available on NuGet as StackExchange.Redis or StackExchange.Redis.StrongName
- Linux/Mono users, please install StackExchange.Redis.Mono instead, or build from source
- Either use monobuild.sh, or use "Mono" configuration
- FEATURE_SOCKET_MODE_POLL is not supported, that's why
 
 
- Linux/Mono users, please install StackExchange.Redis.Mono instead, or build from source
- Reference it like this: using StackExchange.Redis; // even for Linux
 
- Available on NuGet as StackExchange.Redis or StackExchange.Redis.StrongName
Simple example: counter ⌘
Let's demonstrate the INCR command
$ redis-cli 127.0.0.1:6379> incr counter:foo (integer) 1 127.0.0.1:6379> incr counter:foo (integer) 2 127.0.0.1:6379> incr counter:foo (integer) 3 127.0.0.1:6379> incr counter:foo (integer) 4
Naming Redis keys ⌘
- Convention: use multi-part names with ":" (or sometimes "|") as a namespace separator
- "object-type:id" is a good idea
- Various admin tools (like phpRedisAdmin) enforce it
 
- Think about colons in untrusted user data that becomes part of the key name
 
- "object-type:id" is a good idea
- All bytes are OK
- But "*", "#" and "->" have a special meaning for SORT, so may be a problem
- "{" and "}" have a special meaning for Redis Cluster
 
- Redis doesn't care about UTF-8
- Again, except in the SORT command, but that's about values
 
- The empty string is also a valid key
- Too-long names (>1000 bytes) are slow
Simple example: counter (C#) ⌘
using System;
using StackExchange.Redis;
namespace RedisCounter
{
  class MainClass
  {
    public static void Main (string[] args)
    {
      IConnectionMultiplexer conn = 
          ConnectionMultiplexer.Connect ("127.0.0.1");
      IDatabase db = conn.GetDatabase (0);
      long counter = db.StringIncrement ("counter:foo");
      Console.WriteLine (String.Format("The counter is {0}", counter));
    }
  }
}
Simple example: counter (Python) ⌘
#!/usr/bin/python2
import redis
conn = redis.client.StrictRedis.from_url("redis://127.0.0.1/0")
counter = conn.incr("counter:foo")
print "The counter is", counter
The only exercise for today ⌘
- Counting votes
- You can't do it all now. After I explain each data structure, think how it can help with the progress.
 
- Users can vote (+1 or -1 or neutral) on articles
- Each user and each article are identified by a numeric ID
- Articles also have publication dates
 
- Track the score that each article has
- Prevent users from voting twice on the same article
- Or if you want to complicate things: allow them to change their mind!
 
- Find top 10 articles for a time period between 7 days ago and now
- Make sure that votes are lost (and voting is impossible) when an article is deleted
- It should be possible to delete all of the particular user's votes, too
Redis data types ⌘
- Strings
- Lists
- Sets
- Hashes
- Sorted sets
- Bitmaps (in fact, special operations on strings)
- HyperLogLogs (2.8.9+)
- Geoindexes (3.2.0+)
Redis data types, continued ⌘
- The real question: which type is right for my use case?
- Answer: most likely, a combination of several types
- Yes, redundant data - but that's also how SQL databases keep indexes
 
- Consider all possible requests that make sense now from a business standpoint
- Store data separately for each use case
- Common mistake: stored data, but don't know how to find it without the slow KEYS command
 
- Answer: most likely, a combination of several types
Strings ⌘
- Are binary-safe
- Commonly used to represent a number, a string, or just a blob of data
 
- Use these commands:
- GET key
- Returns (nil) if the key does not exist
- (nil) is different from an empty string
 
- SET key value
- APPEND key value
- Non-existing keys are treated as holding an empty string
 
 
- GET key
Redis strings in bindings ⌘
- StackExchange.Redis names for these commands:
- var res = db.StringGet(key);
- key is of type RedisKey, but you can pass a string
- Results are of the RedisValue type, see the next slides
 
- bool success = db.StringSet(key, value);
- Can fail only because of additional arguments not listed here
 
- long len = db.StringAppend(key, value);
 
- var res = db.StringGet(key);
- Pyredis just uses lowercased command names everywhere
Strings as numbers ⌘
- Use these commands:
- GET key, SET key value
- INCR key, INCRBY key by, INCRBYFLOAT key by
- DECR key, DECRBY key by, DECRBYFLOAT key by
- When incrementing or decrementing a non-existing key, it is assumed to hold 0
 
 
- StackExchange.Redis names for these commands:
- StringGet(), StringSet()
- overloaded variants of StringIncrement()
- long StringIncrement (RedisKey key, long value=1);
- double StringIncrement (RedisKey key, double value);
 
- overloaded variants of StringDecrement()
 
RedisKey type ⌘
- Accepted by almost all IDatabase methods
- Used for things that can affect server selection in a cluster scenario
 
- Can be implicitly converted from byte[] or from string
- UTF-8 representation
 
- Deals with key prefixes
- Just a way to say that the key name is a concatenation of two strings
 
- Not needed in Python: strings just work
RedisValue type ⌘
- Returned by StringGet()
- Needed in C# because of static typing
- Provides conversions to string, byte[], int
- Can be tested for (nil): bool isNil = value.IsNull;
- Supports conversions to Nullable<T> types: var value = (int?)db.StringGet("abc");
 
- Not needed in Python
- The variable can contain a string or a None as needed
 
Some more string operations ⌘
- Get string length: STRLEN key
- Returns 0 if the key doesn't exist
- C#: long StringLength(RedisKey key);
 
- Substrings can be stored and retrieved: SETRANGE key offset value, GETRANGE key start end
- In SETRANGE, the assumed length is the length of the value
- The reply is the length of the resulting string
 
- Good for arrays of fixed-size objects with O(1) access time
- C#:
- RedisValue StringSetRange(RedisKey key, long offset, RedisValue value);
- RedisValue StringGetRange(RedisKey key, long start, long end);
 
 
- In SETRANGE, the assumed length is the length of the value
Deleting, testing and renaming keys ⌘
- DEL key1 key2 key2 ...: deleted the named keys
- EXISTS key1 key2 ...: returns the number of listed keys that exist
- RENAME old new: renames the key
- Effectively deletes the new key if both keys exist
 
- C#:
- bool KeyDelete(RedisKey key);
- long KeyDelete(RedisKey[] keys); // Pitfall: empty array causes exception
- bool KeyExists(RedisKey key);
- No way to test multiple keys for existence from C# or Python in one call
 
- bool KeyRename(RedisKey key, RedisKey newKey);
 
- DEL and RENAME commands work with all data types, not only with strings
Hashes ⌘
- Contain a whole mapping of field-value pairs
- Something like Redis inside Redis, under its own name
- Design decision: objects as hashes vs json in strings
 
- There is no Redis inside Redis inside Redis
 
- Something like Redis inside Redis, under its own name
- Important commands: HSET, HGET, HSTRLEN, HEXISTS, HDEL, HINCR, HINCRBY, HINCRBYFLOAT, HDECR, HDECRBY, HDECRBYFLOAT
- You already know them - they work just like their cousins without H, but take the key as the first argument
- HSET key field value
- bool HashSet(RedisKey key, RedisValue hashField, RedisValue value);
 
- There is no HRENAME, HSETRANGE
- HKEYS returns all fields, HVALS gets all values, HGETALL gets both, HLEN counts fields
Lists ⌘
- Ordered collections of strings
- Similar to linked lists - not C++ deque!
 
- Fast Operations: O(1)
- LPUSH/RPUSH key value: prepend/append a value
- LPOP/RPOP key: remove and get the value from either end
- Return (nil) when applied to an empty list
 
- LLEN key: length of a list
 
Lists, continued ⌘
- Slow operations: O(n) where n is the number of elements to traverse
- LINSERT mylist AFTER Hello There: finds "Hello", inserts "There" after it
- LINDEX key index: reads value by index
- index works like in Python: 0 = leftmost, -1 = rightmost (and these cases are fast)
- Use sorted sets instead if fast access by index is needed
 
- LRANGE key start stop: reads multiple values (stop is inclusive, unlike in Python)
- LSET key index value: sets a particular value
- LREM key count value: remove elements equal to value
- count = 0 means all, count > 0 means some first elements, count < 0 traverses from the right
- In the worst case, traverses all elements
 
 
LTRIM ⌘
- Tricky operation: O(n), where n is the number of values to remove
- LTRIM key start stop: remove everything outside the specified range
- So O(1) if it removes at most only one element (use case: capped collection)
- RPUSH log:general "INFO: nothing has happened"
- LTRIM log:general 0 999
 
 
List commands in C# ⌘
- RedisValue ListGetByIndex(RedisKey key, long index);
- long ListInsertBefore(RedisKey key, RedisValue pivot, RedisValue value); // and After
- RedisValue ListLeftPop(RedisKey key); // and Right
- long ListLeftPush(RedisKey key, RedisValue value);
- long ListLeftPush(RedisKey key, RedisValue[] values);
- long ListLength(RedisKey key);
- RedisValue[] ListRange(RedisKey key, long start = 0, long stop = -1);
- long ListRemove(RedisKey key, RedisValue value, long count = 0);
- void ListSetByIndex(RedisKey key, long index, RedisValue value);
- void ListTrim(RedisKey key, long start, long stop);
Sets ⌘
- Like hashes, but without values
- No intrinsic ordering
 
- Important commands:
- SADD key member1 mebber2 ... : adds members to the set
- O(n) where n is the number of members to be added
 
- SREM key member1 member2 ... : removes from the set
- SCARD key : how many members are there?
- SISMEMBER key candidate : does the candidate exist in the set?
- SMEMBERS key : return all of them
- SPOP key: remove and return some random member
 
- SADD key member1 mebber2 ... : adds members to the set
Sets, continued ⌘
- Sets support bulk operations
- But they are slow: O(N + M + ...)
- SINTER set1 set2 ...: returns all members of the intersection
- SUNION set1 set2 ...: union
- SDIFF a b c d ...: elements of a that don't occur in b, c, d or ...
 
- Server-side store available
- SINTERSTORE dest set1 set2 ...
- SUNIONSTORE dest set1 set2 ...
- SDIFFSTORE dest a b c d ...
 
Sorted sets ⌘
- Like sets, but include a rank (a floating-point number) with each member
- Members are intrinsically ordered by rank (ascending)
- Members with the same rank are ordered lexicographically as arrays of bytes
 
- Insertion of an element: ZADD key score1 value1 score2 value2 ...
- O(log N) where N is the set size
 
- Can look at ranges:
- ZRANGE key start stop // by index
- ZRANGEBYLEX key min max  // by dictionary order
- works correctly only if all members in the set have the same score
 
- ZRANGEBYSCORE // by score
 
- The following commands also work:
- ZSCORE key member, ZRANK key member, ZCARD key, ZREM key member
 
Bit arrays ⌘
- Not really a new data type - just a view on strings
- Commands: GETBIT, SETBIT, BITOP [AND|OR|NOT|XOR], BITCOUNT, BITOPS
- Since Redis 3.2.0, it is possible to use a string as a collection of fields representing many small integers
- Use the BITFIELD command
- Wrapping, saturating or overflow-checked, signed or unsigned arithmetics
- Not currently available in Python or C# bindings
 
HyperLogLogs ⌘
- Task: count distinct values in incoming stream of strings
- Constraint: memory
- Cannot store them all
 
- Solution: probabilistic algorithm
- Uses ~12K bytes per the estimator
- Actually a string - no separate data type
 
- Commands: PFADD key value, PFCOUNT key
- Also: PFMERGE dst src src ...
- C# methods: HyperLogLogAdd, HyperLogLogLength, HyperLogLogMerge
Geoindexes ⌘
- Available in Redis 3.2.0+
- No separate data structure - just a special case of sorted set
 
- Not currently available in StackExchange.Redis
- Available in git version of Python redis module
- Store locations (with names and coordinates): GEOADD key lon lat name
- Locations too close (<5 degrees latitude) to the poles cannot be added
- No GEODEL, use ZREM instead
- Get them back: GEOPOS key name
 
- Calculate distances: GEODIST key name1 name2
- Find locations close to a given point: GEORADIUS key lon lat radius m|km|mi|ft
Modules ⌘
- Available in Redis 4.0.0+
- "Shared objects" that can be loaded by Redis
- Loading from config file: loadmodule /path/to/mymodule.so
- MODULE LOAD command also exists
 
- Typically written in C
- Can provide arbitrary commands and data structures
- Redis modules hub
 
- "Shared objects" that can be loaded by Redis
Modules (C#) ⌘
- var result = db.Execute("SOME COMMAND");
- That's a recent API addition, 2017-04-29
 
- The result is of RedisResult type
- You can cast it to string, int, array of strings, RedisKey, RedisValue, and many other types, as needed to correctly interpret Redis output
- Also you can test it for (nil): result.IsNull()
 
- For some "important" modules, like RediSearch, there is explicit support of commands provided by them
- So no need to run db.Execute()
 
Modules (Python) ⌘
- There is a StrictRedis.execute_command() method that can run an arbitrary command
- There is also StrictRedis.set_response_callback() that can be used so that the Redis reply is converted to the correct type
Pub/Sub ⌘
- To subscribe to channels: SUBSCRIBE ch1 ch2 ...
- No further commands allowed other than SUBSCRIBE, PSUBSCRIBE, UNSUBSCRIBE, PUNSUBSCRIBE, PING and QUIT
- PSUBSCRIBE and PUNSUBSCRIBE work on channel name patterns
 
- To publish to a channel: PUBLISH ch message
- Returns the number of clients that receive it
 
- PUBSUB is a debugging command to list channels or count subscribers
- It is possible for Redis to auto-publish certain events in keyspace
- Look for "notify-keyspace-events" in the config file
- Off by default
 
Pub/Sub unreliability ⌘
- Clients sometimes disconnect and reconnect
- Messages sent during that time are lost for them
- Solution (if lost messages are a problem):
- Keep real messages in lists
- Publish only notifications that the list has changed
- On reconnection, assume that the list has changed
 
Pub/Sub from C# ⌘
- Commands are issued on ISubscriver interface
- ISubscriber sub = conn.GetSubscriber();
- long Publish(RedisChannel channel, RedisValue message);
- void Subscribe(RedisChannel channel, Action<RedisChannel, RedisValue> handler);
- That action is just a function that accepts two arguments
- Treat them as strings or byte arrays
 
- Called from a separate thread
- Message ordering is a separate topic
- Want to process messages concurrently? conn.PreserveAsyncOrder = false;
 
 
- That action is just a function that accepts two arguments
Blocking list operations ⌘
- Useful for stacks and queues
- BLPOP, BRPOP key1 key2 ... timeout
- Try to pop a key from one of the specified lists
- Wait until something appears or until the timeout expires
- Lists are checked in the order given
- Timeout is in seconds, integer, 0 = block forever
 
- Not available in StackExchange.Redis
- And will never be, because of incompatibility with multiplexing a single connection among multiple clients
- Instead, use normal list commands for data and Pub/Sub for notifications
 
Transactions ⌘
- A way to group operations so that nothing else happens in between
- It makes sense to combine transactions with pipelining
 
- WATCH key1 key2 ...
- GET ... or other operations
- MULTI
- SET ...
- QUEUED
 
- SET ...
- QUEUED
 
- EXEC
- <actual replies for commands in the transaction>
- Or a (nil) if at least one of the watched keys has changed
 
Transactions (C#) ⌘
- StackExchange.Redis multiplexes connections
- So can't use WATCH directly - it would affect other threads
- Instead, some canned tests are available - and that's all
var tran = db.CreateTransaction();
tran.AddCondition(Condition.HashNotExists(...));
// ^^^ This is inserted between WATCH and MULTI
tran.HashSetAsync(...);
// ^^^ This goes between MULTI and EXEC
bool committed = tran.Execute();
// ^^^ if true: it was applied; if false: it was rolled back
Special cases of the SET command ⌘
- In some cases, you don't need transactions
- SETNX key value
- Sets the key to a given value only if it doesn't exist
- Alternative form: SET key value NX
- In C#: include a "when: When.NotExists" argument
 
- There is also a SET key value XX: sets only if the key already exists
- GETSET key value: reads the value and replaces it with the new one
Lua scripts ⌘
- An alternative to transactions
- In transactions, you are unable to make decisions (or use results from GET) between MULTI and EXEC
- Not a problem by itself, but with StackExchange.Redis you are also not able to make decisions between WATCH and MULTI!
 
- SCRIPT LOAD script
- persists until server restart, or SCRIPT FLUSH
 
- EVAL script numkeys key1 key2 ... arg1 arg2 ...
- EVALSHA sha1 numkeys key1 key2 ... arg1 arg2 ...
- fails if the script does not exist
 
- Lua tutorial
- Lua string library tutorial
Lua scripts, continued ⌘
- Sandboxed
- Run atomically
- Until natural termination or SCRIPT KILL or SHUTDOWN NOSAVE
- SCRIPT KILL works only if there were no writes by the script
 
- Can call commands: redis.call('GET', 'foo');
- Can access keys and arguments as KEYS[] and ARGS[]
- What's a key and what's an argument is important in the context of Redis Cluster
- Please don't access other keys
- It works for a standalone Redis, but will be a problem when you need to migrate to Cluster
 
 
Performance issues ⌘
- "Redis is sometimes slow to respond"
- Ask devs about pipelining
- Check for slow commands
- Call LATENCY DOCTOR
 
- "Redis is fat"
- Tune it, or change data structures
- E.g. hashes are more memory-efficient than collections of "unrelated" strings
 
- Or maybe there are some keys that you forget to delete?
 
- Tune it, or change data structures
- "Network is too slow"
- Often seen on WAN links
- Use VPN with compression
 
- "Cannot assign requested address"
- Reuse connections
 
Benchmarking redis ⌘
- There is an official benchmark tool: redis-benchmark
- Try simulating the real workload:
- type of commands
- number of clients
- client IP address or the usage of UNIX socket
- pipeline depth
 
- Can also repeat a custom command N times with variations
Monitoring Redis ⌘
- INFO
- Look at memory statistics
- mem_fragmentation_ratio should be between 1 and 1.5 (except on empty server)
 
- Latency problems?
- 200 µs is just expected on a gigabit LAN
- redis-cli --latency
- redis-cli --latency-dist
- Maybe it's not Redis? redis-cli --intrinsic-latency 100
- Expect up to 75 µs spikes on commodity hardware
- Does your KVM run with -realtime mlock=on?
- 3-4 ms spikes happen on AliYun
 
 
 
What do other clients do? ⌘
- redis-cli monitor
OK 1477053420.062978 [0 127.0.0.1:57626] "COMMAND" 1477053420.066968 [0 127.0.0.1:57626] "ping" 1477053442.487071 [0 127.0.0.1:57632] "COMMAND" 1477053442.490143 [0 127.0.0.1:57632] "sadd" "foo" "bar"
Monitoring slow commands ⌘
- SLOWLOG GET 10
- Prints last 10 slow commands
- ID, timestamp, time in µs, the command itself
 
- In redis.conf: slowlog-log-slower-than 100
- The unit is µs
 
- Change threshold at runtime: CONFIG SET slowlog-log-slower-than 50
Avoiding network latency ⌘
- In the previous "Counter" example, we waited for the command to finish
- The execution thread was blocked
 
- Sometimes we don't need the result
- CLIENT REPLY SKIP
 
- Sometimes it is possible to do something else while waiting for the result
- A useful trick is to send another command without waiting for the first one to finish
- It's a good idea to write the two commands to the network in one syscall
- This technique is called "Pipelining"
- This is not related to transactions
 
Special cases ⌘
- Modern versions of Redis support getting/setting multiple keys ar once
- MSET, HMSET
- Support is available in StachExchange.Redis:
- public RedisValue[] StringGet(RedisKey[] keys);
- public RedisValue[] HashGet(RedisKey key, RedisValue[] hashFields);
- public bool StringSet(KeyValuePair<RedisKey, RedisValue>[] values);
- public void HashSet(RedisKey key, HashEntry[] hashFields);
 
Avoiding network latency (C#) ⌘
- If you don't need the result:
- Specify "flags: CommandFlags.FireAndForget"
- But it doesn't use CLIENT REPLY SKIP
 
 
- Specify "flags: CommandFlags.FireAndForget"
- If you want to do something else while Redis thinks about your command:
- Use Async variants of methods
- They return a Task, so: using System.Threading.Tasks
- You can Wait() for the task later and then access its Result
- Convenient wrapper: var result = db.Wait (theTask);
 
 
 
- Use Async variants of methods
- If you want to use pipelining, call db.CreateBatch ()
- Batches only support async methods
- After calling some async methods, call batch.Execute() and wait for all tasks
 
Avoiding network latency (Python) ⌘
- Full async style programming is not possible with redis-python
- Alternatives: tornadis (beta), asyncio-redis (Python 3.3+)
 
- Pipelining is still possible
pipe = conn.pipeline(transaction=False)
pipe.set('foo', 'bar')
pipe.get('bing')
pipe.execute()
# returns a list with queued command results: [True, 'baz']
Is pipelining effective? ⌘
- How does a sysadmin know whether developers are using pipelining?
- Read their source code
- Run Wireshark on the client side
- Look for extra commands being sent between a command and its reply
- Look for multiple Redis commands in one TCP packet
 
Pipelining in reality (C#) ⌘
- StackExchange.Redis uses a single Redis connection
- Multiplexing, not pooling!
 
- Uses a separate thread to communicate with Redis and parse replies
- Uses a buffer to accept commands from other threads, interleaves them as needed
- So, some form of pipelining can happen without you writing a single line of code for that!
Commands to avoid ⌘
- KEYS pattern
- Scans all keys, returns those that match the pattern
 
- SMEMBERS bigset
- Problem: scanning of a big dataset
- Redis is single-threaded, so cannot do other work during this
 
What to do instead ⌘
- SCAN, SSCAN, HSCAN, ZSCAN
- All of them accept and return cursors
- The initial value of the cursor should be 0
- They all return a new value of the cursor and some (or maybe no) results
- If the new cursor is 0, the scan is complete
- Guarantees:
- Will return all elements that lived through the whole scan, at least once
- Will not mention any elements that were never there during the scan
 
- May return some elements twice
- May return empty slices
- C# wraps that: public IEnumerable<HashEntry> HashScan(RedisKey key);
How Redis stores data ⌘
- It chooses representations based on data sizes
- Has efficient storage for:
- Strings that represent integers
- Small HyperLogLogs
- Short lists and sets
- Sets consisting of integers
- Compressible data inside lists and sets
 
- Debug: OBJECT ENCODING key
Memory optimization ⌘
- Migrate to Redis 3.2 or later
- Use efficient data types when there is such alternative
- E.g. hashes vs "just collections of strings"
- E.g. bitfields
 
- Use short field names in hashes
- Or even migrate to lists + convention regarding what is kept at what index
 
- Tune thresholds that Redis uses to switch representations
Maybe it's a client problem? ⌘
- Sometimes clients store keys and forget about them
- OBJECT IDLETIME key
Memory buffer safeguards ⌘
- Problem: client pipelined too much
- Replies accumulate in a buffer
 
- Problem: fast client, slow slave
- Replicated commands accumulate in a buffer
 
- Problem: slow Pub/Sub subscriber
- Messages accumulate in a buffer
 
- Solution: client-output-buffer-limit
- Separate limits for normal clients, slaves and Pub/Sub
 
Other safeguards ⌘
- Hard limit on memory usage: maxmemory
- Comes with a policy what to do (evict keys, which ones, or give errors)
 
- Limit on client idle time: timeout
Persistence ⌘
- Two ways: RDB and AOF
- You can enable both
 
- RDB persistence is enabled by the "save" directive
- Redis forks and saves its contents to a file
- Uses atomic replacement so you can copy /var/lib/redis/dump.rdb, and you have a good backup!
 
- save 300 10 means save RDB to disk if there are 10 changes in 300 seconds
 
- Redis forks and saves its contents to a file
- AOF stands for Append-Only File
- Essentially a log of all commands
- Sometimes rewritten to save space
- Enabled with "appendonly yes"
- You also have to specify appendfsync always|everysec|no
 
 
Forcing Redis to persist ⌘
- SAVE
- BGSAVE
- BGREWRITEAOF
Restoring a backup ⌘
- If you enabled only RDB:
- Stop redis
- Copy dump.rdb to /var/lib/redis
- Start redis
 
- If you also enabled AOF:
- Stop Redis
- Delete AOF and temporarily disable it in the config
- Copy dump.rdb to /var/lib/redis
- Start Redis
- CONFIG SET appendonly yes # this will block for a while
 
Inspecting RDB files ⌘
- redis RDB tools (a third-party project)
- Export to JSON
- Compare dumps
- Convert to Redis protocol
- Estimate memory usage
 
More than one Redis: a big picture ⌘
- Just a bunch of Redis nodes
- E.g. one Redis for cache, one for sessions, etc.
- Distributed locking (Redlock)
 
- Partitioning/sharding
- Always done based on the key
- Old solution: twemproxy
- Now supported in Redis Cluster
 
- Replication
- No special tools needed
- Master-master replication is not available
 
- Failover
- Redis Sentinel
- Now also Redis Cluster
 
Distributed locking (or rather, leases) ⌘
- Objectives:
- Make sure that two application server nodes don't concurrently access some shared resource
- Do that using some Redis servers
- With one Redis, that's easy: SETNX
 
- Tolerate failure of a minor fraction of Redis servers
- Tolerate network partitions
- Tolerate crashing clients (i.e. auto-release the lock after a predefined timeout)
 
- Answer: Redlock
- Note: there were major criticisms
- Libraries are available for C# with StackExchange.Redis
 
Redis Replication ⌘
- Master/slave
- Asynchronous
- Disk-based and diskless modes
- Effectively, the slave asks master to dump RDB, restores it, listens to new commands
- While RDB is being dumped and downloaded, commands are buffered
- They consume memory
- Safeguard: CONFIG SET client-output-buffer-limit slave 256mb 64mb 60
- Disconnect the slave if we buffered 256 MB, or coundn't go below 64 MB for 60 sec
 
 
Redis Replication setup ⌘
- In config file: slaveof 1.2.3.4 6379
- Dynamically: SLAVEOF 1.2.3.4 6379
- Promote slave to master: SLAVEOF NO ONE
- Other relevant configuration parameters:
- slave-serve-stale-data yes
- masterauth <password>
- repl-backlog-size 1mb  # on master
- Quick replication after quick restart
 
 
Redis Sentinel ⌘
- High availability solution
- Monitors Redis servers
- Reconfigures master-slave chain if master fails
- Also functions as a way to locate masters and slaves
- SENTINEL get-master-addr-by-name mymaster
- SENTINEL master mymaster
- SENTINEL slaves mymaster
 
Redis Sentinel in practice ⌘
- You will need an odd number of sentinels
- Place them on your application servers
 
- Also some Redis servers
- Configure the master-slave replication manually first
- Place the master like this in redis-sentinel.conf:
- sentinel monitor mymaster 1.2.3.4 6379 2
- 2 = the number of sentinels that must agree that the master is down in order to fail it over
- It will discover slaves by asking the master
- It will announce itself and autodiscover other sentinels through PUB/SUB
 
- Manual failover: redis-cli -p 26379 SENTINEL failover mymaster
Redis Sentinel bugs and limitations ⌘
- Bug in postinst script in Debian backports: it hangs
- Systemd expects Redis Sentinel to daemonize and write the PID file
- Redis Sentinel is not configured to do so
- daemonize yes
- pidfile /var/run/redis/redis-sentinel.pid
- logfile /var/log/redis/redis-sentinel.log
 
 
- Unusable from C# (with StackExchange.Redis) anyway
- Some code exists in tests, though
 
- Authentication between sentinels is unsupported
- So disable protected mode
 
Redis Cluster ⌘
- Enabled in redis.conf by setting "cluster-enabled yes"
- Needs at least 3 masters
- Will shard keys among them
- There are 16384 slots that can be assigned to masters in arbitrary proportions
 
- Sharding is based on key hash
- Except when there is a {...} in the key name
- The first {...} is non-empty: hash it
- The first {} is empty: hash the entire key
 
- Will also manage slaves
Redis Cluster networking requirements ⌘
- As of Redis 3.2.x, not compatible with NAT
- The first stable version of Redis to support that will be 4.0, released as RC1 15th of October 2016
- Fixing this required updating the protocol version
 
- Needs an extra port (base_port + 10000, not configurable) for cluster bus
Redic Cluster: client side ⌘
- Clients must handle the ASK and MOVED errors (they happen if the server is not responsible for the key slot)
- It contains the correct IP and port to connect to
 
- There is also a CLUSTER SLOTS command
- Returns the full map of the cluster
- The client
 
- Clients must not issue cross-slot commands
Redis Cluster: operations ⌘
- There is a CLUSTER command
- Tedious to use - e.g. can move only one slot at a time
- Redis comes with a redis-trib script
- Needs a "redis" Ruby gem
- "gem install redis", if your distribution hasn't packaged it
 
 
