37

I recently discovered that MySQL has a "memory" engine that I wasn't aware of (most of my database work is for hobby projects so I learn what I need as I go). It seems like this option should give me drastically improved performance, so I'm wondering if there are any drawbacks that go with it. The two that I know of are:

  1. I need to have enough RAM to hold the table(s) in question.
  2. The tables are lost if the machine shuts down.

I believe #1 shouldn't be an issue since I'm using AWS EC2 and can move to an instance type with more memory if needed. I believe I can mitigate #2 by dumping back to disk as needed.

What other issues are there? Can the memory engine ever give worse performance than either MyISAM or InnoDB? I think I read something that indices are different with this engine; is this something I need to worry about?

Vérace
  • 29,825
  • 9
  • 70
  • 84
Michael McGowan
  • 789
  • 2
  • 10
  • 20

7 Answers7

30

Looking at the feature availability list at http://dev.mysql.com/doc/refman/5.1/en/memory-storage-engine.html two possible problems jump out:

  1. No transaction or FK support, meaning you will have to manage transactional integrity and referential integrity in your own code were needed (which could end up being a lot less efficient than letting the DB do this for you, though that very much depends on your app's expected behaviour patterns).
  2. Table level locking only: this could be a significant barrier to scalability if your app needs multiple concurrent writers to the same set of tables or in cases where your read operations use locks to ensure consistent data is read - in such cases a disk based table that supports much finer lock granularity will perform far better if enough of its content is currently cached in RAM.

Other than that, assuming you have enough RAM, a memory based table should be faster than a disk based one. Obviously you need to factor in taking snapshots to disk to address the issue of what happens when the server instance is reset, which is likely to completely negate the performance benefit overall if the data needs capturing often (if you can live with losing a day of data in such an instance you could just take a backup once per day, but in most cases that would not be acceptable).

An alternative might be to:

  1. Use disk based tables, but ensure that you have more than enough RAM to hold them all in RAM at any given time (and "enough RAM" might be more than you think as you need to account for any other processes on the machine, OS IO buffers/cache and so forth)
  2. Scan the entire contents (all data and index pages) of the table on each startup to preload the content into memory with SELECT * FROM <table> ORDER BY <pkey fields> for each table followed by SELECT <indexed fields> FROM <table> ORDER BY <index fields> for each index

This way all your data is in RAM, you only have to worry about I/O performance for write operations. If your app's common working set is much smaller than the whole DB (which it usually the case - in most applications most users will only be looking at the most recent data most if the time) you might be better of being more selective about how much you scan to preload into memory, allowing the rest to be loaded from disk on demand.

David Spillett
  • 32,103
  • 3
  • 49
  • 89
  • Why would you need transactional integrity for an in-memory database? If the power goes off, you are losing everything anyway. – osa Dec 06 '14 at 06:33
  • @osa: assuming it allows concurrent access rather than serialising everything, you are going to want some form of integrity management for that. – David Spillett Dec 06 '14 at 21:27
  • mysql on a ram disk is a horrible idea.
  • as soon as you have a power outage you'll have a permanent database error. Mysql since years is built so poor that it's hard to impossible to recover from a lost table, the only way to recover from a reboot would be to completely reconstruct the database in a new directory.

    1. Scanning the entire content can help with performance but only with READ performance. If you have a write-heavy table that doesn't help, it's going to clog up your disk IO anyway.
    – John Apr 04 '21 at 20:40