About Reddit, Lemmy and self-hosting it on Kubernetes

belidzs@fost.hu to Selfhosted@lemmy.world – 23 points –
tech.belidzs.hu
16

You are viewing a single comment

load balancing is automatic between pods thanks to Services: https://kubernetes.io/docs/concepts/services-networking/

I also use kubernetes to run my Lemmy instance. Sadly, pictrs uses their own "database" file which can only be opened by a single pod because it refuse to run if the "database" lock is already acquired by another pod, making scaling up the number of pods impossible. I wish they use postgres instead of inventing their own database. I suspect this is one of the reasons why those large Lemmy instances have difficulty scaling up their server.

This is a really interesting observation. Curious if the devs are aware that this breaks simple scalability efforts

You mean pictrs can't scale, or the other pods cannot as well? I separated lemmy-ui, the backend, and pictrs into different pods. Haven't tried scaling anything yet though, but I did notice the database issue with pictrs when RollingRestart, had to switch to Recreate.

Only pictrs that can't scale. Lemmy ui and backend seems to be stateless.

Great to hear, that will make it super easy if I start allowing users on my instance.

I saw that Lemmy container has scheduled jobs. How did you handled that? IDK I’m not sure about is Lemmy really “stateless”.

https://lemmy.world/post/920294

Right, that's a good point.

So far it's working quite well, however for a micro-sized instance it's no surprise. Worst case scenario I can do the same thing as the admins of lemmy.world did: create a dedicated scheduling pod using the same docker image as the normal ones, but exclude it from the Service's target, so it won't receive any incoming traffic.

The rest of the pods can then be dedicated to serve traffic with their scheduling functionality disabled.

Do they have a write up on their setup?