Performance tuning via reduced durability #54

meteficha · 2015-05-28T17:07:48Z

I have some test code that right now that is taking 582s of wall time and 51s of CPU time to complete. If I completely remove fsync calls by editing the Unix FileIO, I get 16s of wall time and 35s of CPU time. That's a 36-fold improvement to wall time!

I'm not suggesting we should not call fsync as that would rename the package to aci-state. However, many DBMSs provide knobs to adjust performance vs durability. For example:

SQLite provides three different compromises.
Redis allows you to never fsync, to fsync every second or fsync every query.
PostgreSQL has many knobs. You can disable fsync entirely. You can keep fsync but return query results before fsync completes.

I'm not proposing anything concrete that should be done, I'd just like to start a conversation about possible tradeoffs.

The text was updated successfully, but these errors were encountered:

meteficha · 2015-05-28T17:15:15Z

IIUC, currently acid-state will refuse to start if the log is corrupt. So going the Redis way of fsyncing every second seems a bad idea.

I like the option of continuing user code before data is written to the disk. It looked like scheduleUpdate could help me here but it doesn't work for my use case: a query after a scheduleUpdate may not see the update at all. What I wanted was for querys to see the updated state but to avoid waiting the update to be flushed to the disk.

lemmih · 2015-05-28T17:50:43Z

We could add a 'reallyUnsafeUpdate' function which would make the result immediately available without waiting for serialization. This wouldn't work for the Remote backend, obviously. And God have mercy on your soul (and your data) if you use the function carelessly.

meteficha · 2015-05-28T18:02:28Z

Well, it's not "really unsafe", depends on the context.

My test code above implements a storage backend for server-side sessions. On most websites it will be no big deal if the session data from the last few seconds is lost forever. However, adding hundreds on milliseconds per request due to fsync is a big no.

What about, instead of providing a different update function, adding an option that could be set when opening the Local backend? The option could be named "really unsafe" without hurting the legibility of every bit of code that updates something.

lemmih · 2015-05-28T18:37:04Z

Permanently losing data doesn't qualify as "really unsafe"? I hope you're not working in banking. :)

I would be willing to downgrade it from "reallyUnsafe" to just "unsafe". At first I was against adding an option to make this behavior the default but actually that's the only approach that makes sense. An 'unsafeUpdate' function would taint your entire application, not just the thread that used it. So, we want an option for the Local backend to return results immediately without waiting for fsync, and we want a 'waitUntilMyDataIsSafe :: AcidState -> IO ()' function as well.

meteficha · 2015-05-28T18:40:42Z

Well, even on banking I imagine losing session data would be harmless. You could log out some logged in users, or forget that some users had asked to be logged out. Doesn't look that bad :).

meteficha · 2015-05-28T18:41:10Z

BTW, I like your suggestion.

adamgundry added the project label Jan 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance tuning via reduced durability #54

Performance tuning via reduced durability #54

meteficha commented May 28, 2015

meteficha commented May 28, 2015

lemmih commented May 28, 2015

meteficha commented May 28, 2015

lemmih commented May 28, 2015

meteficha commented May 28, 2015

meteficha commented May 28, 2015

Performance tuning via reduced durability #54

Performance tuning via reduced durability #54

Comments

meteficha commented May 28, 2015

meteficha commented May 28, 2015

lemmih commented May 28, 2015

meteficha commented May 28, 2015

lemmih commented May 28, 2015

meteficha commented May 28, 2015

meteficha commented May 28, 2015