Too many open files (regression?) #6237

koalalorenzo · 2019-04-19T07:44:34Z

Version information:

go-ipfs version: 0.4.20-
Repo version: 7
System version: amd64/linux
Golang version: go1.12.4

Type: Bug(?)

maybe bug, maybe misconfiguration after upgrading

Description:

I have been meeting this issue on a set of different machines of different types with different networks and different configurations. All the nodes are providing a public writable gateway. ( https://siderus.io/ ) All of them reported this:

Apr 19 02:33:06 usa-valley2 ipfs[3329]: 02:33:06.470 ERROR     flatfs: too many open files, retrying in 100ms flatfs.go:385
Apr 19 07:22:40 apac-tokyo1 ipfs[533]: 2019/04/19 07:22:40 http: Accept error: accept tcp4 127.0.0.1:8080: accept4: too many open files; retrying in 5ms
Apr 19 09:25:33 eu-stockholm1 ipfs[15101]: 2019/04/19 09:25:33 http: Accept error: accept tcp4 127.0.0.1:8080: accept4: too many open files; retrying in 320ms
Apr 19 09:25:33 eu-stockholm1 ipfs[15101]: 09:25:33.959 ERROR     flatfs: too many open files, retrying in 100ms flatfs.go:385

I found this issue after upgrading. The upgrade was executed automatically (no human touch) and all of the machines are similar with the exception of network, cpu and memory that is different based on the average traffic. Since this version we stopped using docker and started using our own Debian packages but previous tests were successful (12 hours of usage of a/b testing). The issue appeared recently and it seems that it might be caused by a set of requests from the users combined with a misconfiguration (ulimit?)

Maybe related #4102 #3792 (?)

Is this a bug or should I change the limits? How can I ensure that IPFS respects that limit?

The text was updated successfully, but these errors were encountered:

Stebalien · 2019-04-19T20:02:09Z

Could you cat /proc/$(pidof ipfs)/limits and the output of ipfs swarm peers | wc -l? Go-ipfs automatically raises it's limits put probably not enough to be run as a public gateway directly, assuming that's what you're doing.

Note: You may also want to consider sticking an nginx reverse proxy out in front. That should reduce the number of inbound connections.

koalalorenzo · 2019-04-20T05:35:38Z

Note: You may also want to consider sticking an nginx reverse proxy out in front. That should reduce the number of inbound connections.

sorry for my words. There is a reverse proxy and a Layer 7 load balancer in front of that for HTTP. I would never leave something unstable like go-ipfs without a HA setup.

The highwater and lowwater previously were increased (with Docker and 0.4.19), since I moved to 0.4.20 I have used default values but enabling the writable gateway and changing the GC and StorageMax

Both of these nodes are slow or not responsive (but coming back after a while). Another node needed a forced restart as IPFS made it unresponsive, so I was not able to get the data.

Peer `eu-stockholm1`

Could you cat /proc/$(pidof ipfs)/limits

Limit                     Soft Limit           Hard Limit           Units
Max cpu time              unlimited            unlimited            seconds
Max file size             unlimited            unlimited            bytes
Max data size             unlimited            unlimited            bytes
Max stack size            8388608              unlimited            bytes
Max core file size        0                    unlimited            bytes
Max resident set          unlimited            unlimited            bytes
Max processes             7664                 7664                 processes
Max open files            2048                 2048                 files
Max locked memory         16777216             16777216             bytes
Max address space         unlimited            unlimited            bytes
Max file locks            unlimited            unlimited            locks
Max pending signals       7664                 7664                 signals
Max msgqueue size         819200               819200               bytes
Max nice priority         0                    0
Max realtime priority     0                    0
Max realtime timeout      unlimited            unlimited            us

The output of ipfs swarm peers | wc -l?

2410 peers

Peer `apac-tokyo1`

Could you cat /proc/$(pidof ipfs)/limits

Limit                     Soft Limit           Hard Limit           Units
Max cpu time              unlimited            unlimited            seconds
Max file size             unlimited            unlimited            bytes
Max data size             unlimited            unlimited            bytes
Max stack size            8388608              unlimited            bytes
Max core file size        0                    unlimited            bytes
Max resident set          unlimited            unlimited            bytes
Max processes             3700                 3700                 processes
Max open files            2048                 2048                 files
Max locked memory         16777216             16777216             bytes
Max address space         unlimited            unlimited            bytes
Max file locks            unlimited            unlimited            locks
Max pending signals       3700                 3700                 signals
Max msgqueue size         819200               819200               bytes
Max nice priority         0                    0
Max realtime priority     0                    0
Max realtime timeout      unlimited            unlimited            us

The output of ipfs swarm peers | wc -l?

698 peers

Stebalien · 2019-04-26T00:39:21Z

eu-stockholm1 definitely has too many peers (although some of those are obviously using non-fd-consuming transports).

~~I have had reports that leveldb is consuming a bunch of file descriptors and this could be due to our new on-disk provider queue (which is using leveldb).~~ edit: this is really unlikely

obo20 · 2019-04-26T16:56:19Z

@Stebalien I'm also running into this issue as well as of 0.4.20. We had roughly 800 peers connected to the node when I encountered the issue with the following limits:

Max cpu time              unlimited            unlimited            seconds   
Max file size             unlimited            unlimited            bytes     
Max data size             unlimited            unlimited            bytes     
Max stack size            8388608              unlimited            bytes     
Max core file size        0                    unlimited            bytes     
Max resident set          unlimited            unlimited            bytes     
Max processes             64115                64115                processes 
Max open files            2048                 2048                 files     
Max locked memory         65536                65536                bytes     
Max address space         unlimited            unlimited            bytes     
Max file locks            unlimited            unlimited            locks     
Max pending signals       64115                64115                signals   
Max msgqueue size         819200               819200               bytes     
Max nice priority         0                    0                    
Max realtime priority     0                    0                    
Max realtime timeout      unlimited            unlimited            us

It's probably worth noting that I have never encountered this problem before 0.4.20 (although this could be entirely coincidental, as I saw quite a few bug reports of this already exist on earlier versions). However it does make me wonder if the updates you mentioned in the relay section of the 0.4.20 release notes had anything to do with this:

We've significantly reduced the amount of memory allocated per-peer.

We've fixed a bug where relays might, in rare cases, try to actively dial a
peer to relay traffic. By default, relays only forward traffic between peers
already connected to the relay.

We've fixed quite a number of performance issues that only show up when
rapidly forming new connections. This will actually help all nodes but will
especially help relays.

Stebalien · 2019-04-26T23:18:32Z

The next time someone runs into this, please run sudo lsof -n -P -p $(pidof ipfs).

koalalorenzo · 2019-04-27T06:36:12Z

Still running into this daily, so I can help as much as I can :)

Et voila: https://siderus.io/ipfs/QmeHWnYK5VpQvfFWV4wPgujB3PwAnNS5TR8iUWfDkghqnd

(Those are other nodes affected, if you want the same as the previous let me know)

hsanjuan · 2019-04-27T15:01:26Z

I'm also observing this errors, particularly on accepting new connections. I think this causes the affected listeners to shutdown.

https://ipfs.io/ipfs/QmX4A8DzvxeR1uC7xoTCTQPbynsHAD64gdsSCm1VPc8brC

I am going to try to run with a much larger FD limit and see if it hits it too.

Stebalien · 2019-04-27T20:01:26Z

@koalalorenzo what are your connection manager limits?

Both of you, does downgrading help?

koalalorenzo · 2019-04-27T20:06:37Z

My connection manager config is the default one (Never touched since I generated the conf file)

"ConnMgr": {
      "Type": "basic",
      "LowWater": 600,
      "HighWater": 900,
      "GracePeriod": "20s"
}

I know that @obo20 downgraded to fix the issue. I will downgrade a percentage of the machines so that I can help debug this, in case. Let me know what I can do to help!

hsanjuan · 2019-04-29T07:36:56Z

FD Limit was 4096 before. I set a new limit at 16k. Right now after 24h or so we have:

ready (5)> lsof -n -P -p $(pidof ipfs) | wc -l
cluster2.fsn.dwebops.pub : 5781
cluster3.fsn.dwebops.pub : 5326
cluster4.fsn.dwebops.pub : 5837
cluster1.fsn.dwebops.pub : 6053
cluster0.fsn.dwebops.pub : 6471

I will report back to see if the figure increases in the next hours. The connManager is at:

    "ConnMgr": {
      "Type": "basic",
      "LowWater": 1500,
      "HighWater": 2000,
      "GracePeriod": "1m"
    }

obo20 · 2019-04-29T14:03:26Z

@hsanjuan Can you provide the specs of the machines that you're using?

I'd also be curious to know how many "root" objects you have on those machines.

hsanjuan · 2019-04-29T14:48:43Z

@obo20 12cores, 64GB RAM, root objects ranging from 3000 to 6000.

koalalorenzo · 2019-04-29T15:07:22Z

@obo20 12cores, 64GB RAM, root objects ranging from 3000 to 6000.

@hsanjuan Have you tried on smaller machines? something like 2 cores, 2/4GB RAM? More into the "consumer/desktop/notebooks" set of machine size?

obo20 · 2019-04-29T15:26:29Z

@hsanjuan Have you tried on smaller machines? something like 2 cores, 2/4GB RAM? More into the "consumer/desktop/notebooks" set of machine size?

I'd like to know this as well. While resources are always important, I'm curious to know how much of a role they play here. From inspecting my nodes after they hit this issue, they haven't seemed maxed out from a resource standpoint at all.

hsanjuan · 2019-05-02T08:20:40Z

@hsanjuan Have you tried on smaller machines? something like 2 cores, 2/4GB RAM? More into the "consumer/desktop/notebooks" set of machine size?

I'd like to know this as well. While resources are always important, I'm curious to know how much of a role they play here. From inspecting my nodes after they hit this issue, they haven't seemed maxed out from a resource standpoint at all.

I have no tried with "normal" machines (or not long enough). I can say that the file descriptors are stable at around 5000-6000 after a couple of days on the cluster. About 60% of them on badger files and the rest are swarm connections.

koalalorenzo · 2019-05-02T08:30:43Z

@hsanjuan Thx! Sadly I can't ask Orion users to use 64GB of RAM.

I will wait for a fix before upgrading Orion to v0.4.2x. I guess that might impact the user experience, if the issue persists.

On the siderus.io gateway I have found a workaround but the problem persists in the logs. Any ETA on the fix? I will build the master and see how it performs.

hsanjuan · 2019-05-02T08:43:30Z

@koalalorenzo The 64GB of RAM are because those machines handle 2TB+ of repository size, but except some spikes, half of the RAM is unused.

The workaround here is to either increase the FD limit from the default (see post above), or reduce the ConnManager limits so there are less connections or play with the badger configuration so it opens less files (I haven't tried this but I think the ipfs config allows to play with all badger options).

I'm not sure at this point if there is an increased FD usage related to just BadgerDS or just connections or both (maybe someone can verify against 0.4.19).

Stebalien · 2019-05-21T04:57:13Z

Fix still in progress, unfortunately. See the discussion on libp2p/go-libp2p-circuit#76.

fixes #6237

Stebalien · 2019-05-22T07:03:41Z

@dokterbob could you try building #6361? I've been running this for a while on one of our boot strappers and it appears to be pretty stable.

May fix ipfs/kubo#6237 Basically, 1. We hang while closing a stream (because `Close` waits). 2. This blocks the connection manager because it assumes that close _doesn't_ wait. This may also fix a stream leak.

* Write coalescing in yamux and mplex. * Correctly tag relay streams. * Reset relay streams instead of closing them. Fixes #6237. * Trim connections in a background task instead of on-connect.

Stebalien · 2019-05-22T20:03:09Z

Reopening waiting on confirmation. The latest master should now have all the required fixes.

koalalorenzo · 2019-05-22T22:25:22Z

Will build & check it tomorrow! Thx for the update

Stebalien · 2019-05-23T01:08:52Z

@koalalorenzo and @obo20, please use #6368 and not master till I get some reviews on the libp2p patches and merge it into master.

TL;DR: There was a bug in libp2p since time immortal. We fixed it this release cycle (#6368) but then introduced a feature that triggers this bug. The branch I linked to above removes this feature.

Stebalien · 2019-05-24T23:17:08Z

The RC3 (#6237) contains the fix. Any updates?

koalalorenzo · 2019-05-25T08:31:53Z

3 hours of running on 30% of the fleet. Still monitoring, nothing happened so far. I will wait for 24h to be sure :)

obo20 · 2019-05-25T15:21:27Z

@Stebalien Running on a few of our nodes right now. Things seem to be working well so far.

Stebalien · 2019-05-25T17:42:39Z

Awesome! Let me know if anything changes. We'll probably ship a final release Monday.

graylan0 · 2019-05-26T02:46:35Z

Running for the past day as well as some friends who also set it up on their nodes, I average around 850 - 900 peers. They average around the same. And the Ram usage is much less (5x).

Stebalien · 2019-05-26T17:41:09Z

Ok. Considering this closed, let the release train roll...

Stebalien · 2019-05-26T17:43:27Z

Also, a big thank you to everyone in this thread for helping debug this!

sanderpick · 2019-05-26T19:59:02Z

Looking good here too! Thank you @Stebalien

dokterbob · 2019-05-27T12:26:09Z

For future reference (I'm about to upgrade to the rc1 now).

Memory usage is just IPFS.

Stebalien · 2019-05-27T17:45:01Z

@dokterbob please upgrade to the RC3, not the RC1. The RC1 doesn't include the fix for this bug.

May fix ipfs/kubo#6237 Basically, 1. We hang while closing a stream (because `Close` waits). 2. This blocks the connection manager because it assumes that close _doesn't_ wait. This may also fix a stream leak.

koalalorenzo changed the title ~~Too many open files (regression?)~~ Too many open files (regression or bad conf?) Apr 19, 2019

Stebalien added kind/bug A bug in existing code (including security flaws) kind/support A question or request for support labels Apr 19, 2019

koalalorenzo changed the title ~~Too many open files (regression or bad conf?)~~ Too many open files (regression?) Apr 26, 2019

Stebalien added regression P0 Critical: Tackled by core team ASAP and removed kind/support A question or request for support labels Apr 26, 2019

Stebalien added the interrupt label Apr 29, 2019

sanderpick mentioned this issue Apr 30, 2019

Investigate unresponsive cafes textileio/go-textile#716

Closed

This comment has been minimized.

Sign in to view

Stebalien added a commit that referenced this issue May 21, 2019

dep: update go-libp2p-circuit

f018f52

fixes #6237

Stebalien mentioned this issue May 21, 2019

dep: update go-libp2p #6361

Merged

Stebalien closed this as completed in libp2p/go-libp2p-circuit#76 May 22, 2019

Stebalien added a commit that referenced this issue May 22, 2019

dep: update go-libp2p

94b2912

* Write coalescing in yamux and mplex. * Correctly tag relay streams. * Reset relay streams instead of closing them. Fixes #6237. * Trim connections in a background task instead of on-connect.

Stebalien reopened this May 22, 2019

Stebalien added a commit that referenced this issue May 22, 2019

dep: update go-libp2p

f331f23

* Write coalescing in yamux and mplex. * Correctly tag relay streams. * Reset relay streams instead of closing them. Fixes #6237. * Trim connections in a background task instead of on-connect.

Stebalien added a commit that referenced this issue May 22, 2019

dep: update go-libp2p

449a511

* Write coalescing in yamux and mplex. * Correctly tag relay streams. * Reset relay streams instead of closing them. Fixes #6237. * Trim connections in a background task instead of on-connect.

vyzo mentioned this issue May 22, 2019

Production go-ipfs going bananas open-services/open-registry#39

Open

Stebalien added a commit that referenced this issue May 22, 2019

dep: update go-libp2p

1eb701f

* Write coalescing in yamux and mplex. * Correctly tag relay streams. * Reset relay streams instead of closing them. Fixes #6237. * Trim connections in a background task instead of on-connect.

Stebalien closed this as completed in #6361 May 22, 2019

Stebalien reopened this May 22, 2019

Stebalien closed this as completed May 26, 2019

Stebalien mentioned this issue Jun 4, 2019

crash: sync.runtime_Semacquire #6400

Closed

Too many open files (regression?) #6237

Too many open files (regression?) #6237

Comments

koalalorenzo commented Apr 19, 2019 • edited Loading

Version information:

Type: Bug(?)

Description:

Stebalien commented Apr 19, 2019

koalalorenzo commented Apr 20, 2019 • edited Loading

Peer eu-stockholm1

Peer apac-tokyo1

Stebalien commented Apr 26, 2019 • edited Loading

obo20 commented Apr 26, 2019 • edited Loading

Stebalien commented Apr 26, 2019

koalalorenzo commented Apr 27, 2019 • edited Loading

hsanjuan commented Apr 27, 2019

Stebalien commented Apr 27, 2019

koalalorenzo commented Apr 27, 2019 • edited Loading

hsanjuan commented Apr 29, 2019

obo20 commented Apr 29, 2019

hsanjuan commented Apr 29, 2019

koalalorenzo commented Apr 29, 2019 • edited Loading

obo20 commented Apr 29, 2019

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

hsanjuan commented May 2, 2019

koalalorenzo commented May 2, 2019 • edited Loading

hsanjuan commented May 2, 2019 • edited Loading

Stebalien commented May 21, 2019

Stebalien commented May 22, 2019

Stebalien commented May 22, 2019

koalalorenzo commented May 22, 2019

Stebalien commented May 23, 2019

Stebalien commented May 24, 2019

koalalorenzo commented May 25, 2019

obo20 commented May 25, 2019

Stebalien commented May 25, 2019

graylan0 commented May 26, 2019

Stebalien commented May 26, 2019

Stebalien commented May 26, 2019

sanderpick commented May 26, 2019

dokterbob commented May 27, 2019

Stebalien commented May 27, 2019 • edited Loading

koalalorenzo commented Apr 19, 2019 •

edited

Loading

koalalorenzo commented Apr 20, 2019 •

edited

Loading

Peer `eu-stockholm1`

Peer `apac-tokyo1`

Stebalien commented Apr 26, 2019 •

edited

Loading

obo20 commented Apr 26, 2019 •

edited

Loading

koalalorenzo commented Apr 27, 2019 •

edited

Loading

koalalorenzo commented Apr 27, 2019 •

edited

Loading

koalalorenzo commented Apr 29, 2019 •

edited

Loading

koalalorenzo commented May 2, 2019 •

edited

Loading

hsanjuan commented May 2, 2019 •

edited

Loading

Stebalien commented May 27, 2019 •

edited

Loading