More than one person on the Berkeley DB team visits Reddit daily so it's nice to see that part of what makes Reddit work is our product. MemcacheDB (built on Berkeley DB using it's high availability (HA) feature to replicate cached data between nodes) is a natural fit for your permacache (key/value cache as a database of record) and it seems like it's working fairly well.
Scalability, performance, zero-downtime are major targets for our work. We'd be happy to work with you and the engineers behind MemcacheDB to make sure you're getting everything you need from Berkeley DB. Maybe we can shrink your storage requirements by using compression? Maybe we can help scale using partitioning? Maybe we can work to reduce or eliminate the 71 minutes it took to upgrade to a new MemcacheDB layout. Berkeley DB/HA supports live migration now (even across different endian systems). MemcacheDB needs to add this as a feature, good thing we're friends with Steve Chu. While we're at it maybe we could optimize the log write thread (the trickle call) for cloud-based storage so as to avoid the lag you experienced to begin with.
We really like having a challenge, sounds like you've got one, we're here to help.