random read access may raise "file not found" #78

jiangjianping · 2017-06-28T12:42:43Z

I put some so libs in a compressed archive and let program load from archive. Sometimes, the program will encounter :dl-reloc.c: No such file or directory.

hasse69 · 2017-06-28T17:12:48Z

Interesting use-case instead. I quickly checked dl-error.c in glibc but it was not obvious what would cause this error to be thrown. Actually, the error message itself looks a bit strange too since specific files are usually not mentioned like this. Anyway, in order to understand what is going on I would need better traces from both glibc (like an strace) and a fuse log from rar2fs mounting in the foreground using -f.
But without being a party pooper here, random access into compressed archives may or may not work reliable since data in such archives are streamed through libunrar. My guess is that this would work just fine if you did not compress it.

jiangjianping · 2017-06-29T01:25:11Z

@hasse69
You are absolutely right! 1. There was no such error with uncompressed archive till now. 2. When the error is occurred, there are lookup and read operation with that file. Maybe this file is larger? How to do pressure testing with more debug info?

jiangjianping · 2017-06-29T08:05:19Z

@hasse69
I encounter another problem: When I tried to import pyc file in rarfs from python program, It told me RuntimeError: Bad code object in .pyc file

hasse69 · 2017-06-29T14:07:24Z

I encounter another problem: When I tried to import pyc file in rarfs from python program, It told me RuntimeError: Bad code object in .pyc file

The problem is most likely the same, just a different signature.

The RAR compression algorithm is not made to support random access patterns. It is built on a serialized technology in the form of fixed blocks. However, there are a few things you could try to workaround this depending on the size of the files you need to extract. You most likely also need a patch in rar2fs to avoid the detection of a probable video indexing which will force a dummy read. But for that I need to know your exact version of rar2fs and also if you are building rar2fs yourself or if you are picking it up from some pre-built package.

jiangjianping · 2017-06-30T00:49:29Z

@hasse69
thank you. I downloaded the rar2fs from github and built it myself. Another issue is dlopen("xxx.so") and got "bus error". I wonder: 1. file seek is ok? 2. Read an excepted size data from a file handle is ok?

hasse69 · 2017-06-30T06:51:45Z

You can try this patch on master/HEAD.

But you need to tweak your I/O buffer for this to work. By default the history size is 50% of the I/O buffer, you should leave it at that and only focus on the actual I/O buffer itself. You need to set it to at least twice the uncompressed size of the largest file in your archive.

E.g. if your largest file (uncompressed) is 3.5MiB you need to set your I/O buffer to 8 (--iob-size=8) which is the closest power of two. That should allow a far seek to be accepted by rar2fs. But be aware that this also means all the data needs to be extracted until the offset you seek to. That might take time (depending on the size of the file) and if this works or not is really up to the application using it rather than rar2fs.

patch1.txt

jiangjianping · 2017-06-30T08:46:31Z

@hasse69

I will try, thank you.

jiangjianping · 2017-06-30T08:49:22Z

@hasse69
I tried, there was some progress in my environment. but I found some with rar2fs processes with defunct

hasse69 · 2017-06-30T08:55:33Z

I think you need to explain in more detail what you see. And also provide the command line arguments you gave to rar2fs etc.

hasse69 · 2017-06-30T09:06:16Z

Btw, it is expected to get 'defunct' processes if your I/O buffer is large and RAR extraction to it completes before the file is actually closed by the application. My guess here is that the file is still open while you see these zombies because your application has not yet closed it.

jiangjianping · 2017-06-30T09:22:51Z

@hasse69
When defuct, I/O error encountered. Three defucts with three I/O errors on three big file (~3M). Then I adjust the I/O buffer to 8(m), no defuct, but I got bus error(core dumped).

hasse69 · 2017-06-30T09:30:37Z

Again. The defunct processes are there because the application using the file has not closed it properly (the I/O error is due to that the I/O buffer is not big enough).
The bus error needs to be investigated. Please use gdb to analyze the core dump and try to provide a stack trace.

jiangjianping · 2017-06-30T14:20:45Z

@hasse69
I improved the I/O buffer,Bus error is disappeared and my program can be started . My main python program will import many modules and .so libs, these files will not closed during running. I saw many rar2fs defunct subprocesses till my program exited. You mean defunct is normal? How to keep defunct small even no?

jiangjianping · 2017-06-30T14:43:40Z

@hasse69

I need your great help. 1. How to keep most of files in memory cache 2. How to decrease defuncts? I have about 400 defunct subprocesses.

hasse69 · 2017-06-30T17:42:11Z

Let me get back to your questions later, but first try this patch instead.

patch2.txt

jiangjianping · 2017-06-30T23:19:33Z

@hasse69
You are great! The defuncts disappeared and my program seemed working as normal. I will do more testing, thank you again.

jiangjianping · 2017-07-01T11:44:44Z

@hasse69

How to improve read performance/speed? Whether it is possible to cache all files when the memory is enough?

hasse69 · 2017-07-01T22:15:24Z

I improved the I/O buffer,Bus error is disappeared and my program can be started .

Can you elaborate a bit on this? Improved, how? A bus error can not just disappear so it would be interesting to understand why you got it in the first place. Was it rar2fs that caught a bus error or the application?

How to decrease defuncts? I have about 400 defunct subprocesses.

What I did in last patch is to set SIGCHLD to not require a wait after a child is terminated. Since only a close by design will wait for the child, lack of close calls will obviously increase the number of zombies. As I said before, expected really but in your particular use-case, not really optimal.

How to improve read performance/speed? Whether it is possible to cache all files when the memory is enough?

This is very hard to answer since I am not even sure what OS you are running!
But if we assume you are using Linux there is currently nothing that I am aware should affect read performance negatively. Once a part of the file has been read it should populate the page/in-memory cache and subsequent reads should be picked from the cache rather than going through rar2fs. There is obviously a penalty for the first access(s) since it will need to extract that part of the file to the I/O buffer (unless the offsets are really small). If your application is only opening the file once, everything would be in either the I/O buffer or the page-cache. The only thing that would prevent the page cache from being used directly is; many small offset reads not yet placed in the page-cache, direct I/O mode which by-pass the cache or if the cache was invalidated for some reason between read accesses. None of the two latter should apply in your case though. Why do you suspect performance is affected? To give a definitive answer here we need to know more about the access pattern you application is applying towards the files inside the compressed archive.

There is also something you need to understand here. Given a large sized I/O buffer is providing rar2fs a chance to actually extract the entire file to RAM, but it is also going to waste RAM, and possibly a lot of it too! Every open() call made will allocate a new buffer of the specified size, used or not. So if you have many files there will be RAM wasted for sure. Adding to that the page-cache that will also consume memory resources. Your use-case is somewhat different from regular usage of rar2fs. Why do you keep .so files in a compressed RAR archive in the first place? I think I need to understand more about your setup and the rationale behind your use-case to better explain what are the current limitations of rar2fs, if any.

jiangjianping · 2017-07-02T02:42:09Z

@hasse69
Thank for your great response. Firstly What I am testing is put many python files to a rar archive in an Ubuntou linux server environment. My purpose is to simple the python module installation, upgraded with just a big archive files. Secondly, The bus error is raised in my testing program. After I adjust the iobuf from 8 to 16 (one of file is bigger than 8M), it disappeared. Thirdly, For read performance, The python program what I am testing is multi processes, each process is also multi threaded and most of modules (include pyc and .so) it will import located in rar archive. I want to confirm: 1. After the first process have finished importing modules, the python modules in rar archive should have been extracted and I/O buffered, then the subsequent cloned subprocesses will import/read these files from I/O buffer, is that so? 2. When will the files cached in I/O buffer be swapped?

Best Regards

hasse69 · 2017-07-02T07:10:50Z

After the first process have finished importing modules, the python modules in rar archive should have been extracted and I/O buffered, then the subsequent cloned subprocesses will import/read these files from I/O buffer, is that so?

Maybe, maybe not. An I/O buffer is connected to a file descriptor returned by an open() call. Multiple calls to open() equals multiple I/O buffers. You could otherwise imagine what would happen if the I/O buffer was shared. One access could trash the buffer for others rendering it completely useless.
Even if you "import" something from you test application it is impossible to say if that will populate the entire I/O buffer or not. It all depends on how far into the file (offset) you read. Open on its own does not buffer anything, it is read requests that does that. But the I/O buffer will always fill the buffer as much as possible at each read in the background so if your buffer is big enough the entire file should be extracted almost immediately. Then comes the page cache! If you have multiple processes opening a file they will both allocate an I/O buffer, but if the page-cache is already in play multiple processes accessing the file will not even enter rar2fs. But rar2fs has no control of the underlying cache mechanism so it cannot know this, thus the I/O buffer must always be allocated even if it potentially never is used.

When will the files cached in I/O buffer be swapped?

That one is simple :) When you run out of memory and the memory occupied by the I/O buffer is safe to swap out by the kernel and that there is a backing store available for it to do so. Again, this is nothing that rar2fs has control of, it is OS/MMU specific.

hasse69 · 2017-07-02T07:14:15Z

Btw, I need to look deeper into how to publish my last few changes and if they in any way could affect other more common use cases negatively. So an official patch will not be available for a while.

jiangjianping · 2017-07-02T09:01:48Z

@hasse69
Thanks! Another question: My uncompressed rar archive is only ~ 200M, why the rar2fs main process consumed about 6G RES memory with 12GB virtual. and When the rar2fs will spawn subprocess?(I see there are three subprocess)

jiangjianping · 2017-07-02T12:05:47Z

@hasse69

The rar2fs memory usage is increasing continually, 11GB RES and 18GB virtual.

hasse69 · 2017-07-02T12:32:52Z

My guess is that the number of open files follow the same pattern. Check with 'lsof' and grep for your application. The I/O buffer is released at close and if there is no close calls made but new open calls you will eventually run out of memory or file descriptors what ever comes first.

jiangjianping · 2017-07-02T13:35:50Z

@hasse69
I do not understand that the total file size in uncompressed archive is below 200M, Why the I/O buffer sizes is so big? The memory did not release even my program has exited.

jiangjianping · 2017-07-02T13:43:52Z

@hasse69
I count the lsof output: there are about 180,000 items related to rar2fs, most of them are pipe. There are about 2,000 items related to my program (python)

hasse69 · 2017-07-02T14:05:43Z

The size of the archive is irrelevant. It is the number of files in it that matters since every file will allocate at least a 16M I/O buffer when opened. The pipes are created by the spawned child processes. Something is obviously not released properly but I cannot tell what is the root cause here since I have never seen this problem myself. Having 180k entries in lsof simply does not make sense with what I know currently about your use case. Is it possible for me to run your use case / test application? Otherwise I am not sure I would ever be able to explain what is going on. I am really not convinced this has anything to do with rar2fs but rather just a side effect of something else being broken somewhere.

hasse69 · 2017-07-02T14:11:17Z

Try to run rar2fs in the foreground using the -d flag and count the number of open calls and then compare against the number of release calls.

jiangjianping · 2017-07-02T14:22:36Z

@hasse69
I will try. Which is iobuf release call? I check the iobufer.c, It seems that there is no release function?

hasse69 · 2017-07-02T14:36:41Z

There is no release call on the I/O buffer. The fuse "RELEASE" is what is called in rar2fs when a file is closed. It will perform free() on the buffer.

unique: 54, opcode: RELEASE (18), nodeid: 2, insize: 64, pid: 0
release[139654955338896] flags: 0x8000
   unique: 54, success, outsize: 16

hasse69 · 2017-07-02T14:42:17Z

You say that resident memory is not released even after your application exits? Is that also true for the lsof entries? Do a 'lsof' after mounting but before accesing anything, then compare the number of entries after your program exit.

> lsof | grep rar2fs | wc -l

jiangjianping · 2017-07-02T14:44:15Z

@hasse69

Yes, I see. Is it possible to allocate I/O buffer size based on the size of individual file, not the fixed iobuf settings?

hasse69 · 2017-07-02T14:50:08Z

Yes it is possible to introduce a somewhat more clever allocation but it will still need to be based on some specified max limit. It is not feasible to allocate simply based on the file size since it might be of several gigabytes! And I do not understand how it would help in anyway? It will not make the memory and open file descriptor leak to go away! Did you check if lsof also shows a leak after your application terminates?

jiangjianping · 2017-07-02T16:04:06Z

@hasse69
I do a simple testing. Try to counting the lsof result for my program python and rar2fs.

Before Mounting
rar2fs: 0, python: 40
Mount the rar archive with -d
rar2fs: 108, python:40
Start my python main program
rar2fs: 311, python:161, In this point:
a. The python modules is opened with read, after read, I can see release in rar2f console
b. There are many lookups besides those modules. The python will try .so,.pyc sequently
Stop my python main program
rar2fs: 162, python:40

hasse69 · 2017-07-02T20:30:00Z

Ok so it makes perfect sense you see some release calls since the number of open files is going down from 311 to 162, but I think if you count the number of open calls they will not match the number of release calls which might explain why they do not drop down to 108. Could it be that after you terminate the main process there are still python related processes running that has not yet called close? If the number of open and release calls are the same I really cannot figure out why there is such a huge resource leak :(

hasse69 · 2017-07-03T10:19:02Z

Please attach the output from -d here, you can redirect output to a file using 2>d.log. I need to have a look at it to see if there are any abnormalities.

hasse69 · 2017-07-23T08:47:01Z

I am labeling this as an enhancement since the RAR compression algorithm is not made for random access. Using a more clever I/O buffer allocation and using the proper settings at mount time would/should however make it possible at least for smaller files.

flux242 · 2020-03-25T13:43:56Z

I've cloned sources today and built rar2fs. If I try to mount a pdf file inside of a rar file then zathura won't open it saying - 'Document does not contain any pages'. If I copy the pdf file to a regular file system then it can be opened by zathura. I believe this is the same problem.

Additionally I put the same pdf into a zip archive which was also fuse mounted using archivemount and zathura opened it normally too. So it really is rar2fs that produces such strange problem

hasse69 · 2020-03-25T18:25:19Z

@flux242 can you please file a new isssue report since I am not really convinced it is related to this very specific use-case. Thanks,

Also, if possible, try to attach the problematic archive so that I can try it myself.

hasse69 added the Enhancement label Jul 23, 2017

hasse69 changed the title ~~radom read access may raise "file not found"~~ random read access may raise "file not found" Sep 22, 2017

hasse69 added the Priority-Low label Oct 12, 2017

hasse69 added this to To do in Backlog Oct 10, 2019

hasse69 moved this from To do to On hold in Backlog Nov 6, 2019

random read access may raise "file not found" #78

random read access may raise "file not found" #78

Comments

jiangjianping commented Jun 28, 2017

hasse69 commented Jun 28, 2017

jiangjianping commented Jun 29, 2017

jiangjianping commented Jun 29, 2017

hasse69 commented Jun 29, 2017 • edited

jiangjianping commented Jun 30, 2017

hasse69 commented Jun 30, 2017 • edited

jiangjianping commented Jun 30, 2017

jiangjianping commented Jun 30, 2017

hasse69 commented Jun 30, 2017

hasse69 commented Jun 30, 2017

jiangjianping commented Jun 30, 2017

hasse69 commented Jun 30, 2017 • edited

jiangjianping commented Jun 30, 2017

jiangjianping commented Jun 30, 2017

hasse69 commented Jun 30, 2017 • edited

jiangjianping commented Jun 30, 2017

jiangjianping commented Jul 1, 2017

hasse69 commented Jul 1, 2017 • edited

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 • edited

hasse69 commented Jul 2, 2017

jiangjianping commented Jul 2, 2017

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 via email • edited

jiangjianping commented Jul 2, 2017

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 via email

hasse69 commented Jul 2, 2017 via email

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 • edited

hasse69 commented Jul 2, 2017

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 • edited

jiangjianping commented Jul 2, 2017

hasse69 commented Jul 2, 2017 via email

hasse69 commented Jul 3, 2017 via email

hasse69 commented Jul 23, 2017

flux242 commented Mar 25, 2020

hasse69 commented Mar 25, 2020 • edited

hasse69 commented Jun 29, 2017 •

edited

hasse69 commented Jun 30, 2017 •

edited

hasse69 commented Jun 30, 2017 •

edited

hasse69 commented Jun 30, 2017 •

edited

hasse69 commented Jul 1, 2017 •

edited

hasse69 commented Jul 2, 2017 •

edited

hasse69 commented Jul 2, 2017 via email •

edited

hasse69 commented Jul 2, 2017 •

edited

hasse69 commented Jul 2, 2017 •

edited

hasse69 commented Mar 25, 2020 •

edited