Sample Header Ad - 728x90

Unix & Linux Stack Exchange

Q&A for users of Linux, FreeBSD and other Unix-like operating systems

Latest Questions

3 votes
3 answers
5816 views
Regex search in PDF reader
I am using zathura, as I enjoy its minimalist approach, but I would also switch to mupdf or anything else if this would solve my problem. I need to highlight every word (in PDF and epub documents) one by one from start to finish because I can concentrate better on the text if I have some kind of mot...
I am using zathura, as I enjoy its minimalist approach, but I would also switch to mupdf or anything else if this would solve my problem. I need to highlight every word (in PDF and epub documents) one by one from start to finish because I can concentrate better on the text if I have some kind of motion in it. My approach would have been to perform a regex search that matches every word, but neither zathura nor mupdf support regex in searches. Is there a way to do this? I would try to fork zathura but to be honest I don't really want to spend that amount of time if there is another minimal Gnu/Linux compatible document viewer that does what I need. And if there is any way to use terminal tools like pdfgrep for highlighting the results in zathura that would also do the job.
luca (152 rep)
Mar 29, 2020, 03:38 PM • Last activity: Jun 6, 2025, 09:30 AM
4 votes
1 answers
447 views
How can I verify integrity of the document types?
Are there any utilities to verify integrity of common document types used in the office environment (docx, pptx, xlsx, pdf, common image, audio and video files) without using the actual applications? Contents of some documents of ours proved to be damaged due to storage issues. So, I need to sweep a...
Are there any utilities to verify integrity of common document types used in the office environment (docx, pptx, xlsx, pdf, common image, audio and video files) without using the actual applications? Contents of some documents of ours proved to be damaged due to storage issues. So, I need to sweep across the whole filesystem to verify all the files and report which of them are actually impacted. The damaged content can be anywhere in the file, so just verifying file headers - like the *file* command does - or checking the file properties (length, dates), etc., won't be good enough. I need a tool which would open the files, verify integrity of their internal structures, and report the result either on stdout or through an exit code. A solution which would do all the job and provide a final report would do as well, of course, if it already exists - it would save me a lot of scripting work. ;-) I assume random technical issues, not a deliberate tampering - so I do not need to verify the actual contents of the documents. What I need is to identify the files which would probably fail to open in their respective apps. Looking forward to receiving your suggestions.
Jasio (634 rep)
Sep 12, 2024, 11:55 AM • Last activity: Sep 12, 2024, 12:43 PM
12 votes
1 answers
5762 views
Zathura's best-fit command ('a') for documents with varying page size
I'm usually using the wonderful document viewer 'zathura' for reading PDF-files. Typically, for some documents, the first page of the document is slightly larger than the rest (for example, a photograph of a book cover). In this case the 'best-fit' function (key 'a') of zathura always adjusts the zo...
I'm usually using the wonderful document viewer 'zathura' for reading PDF-files. Typically, for some documents, the first page of the document is slightly larger than the rest (for example, a photograph of a book cover). In this case the 'best-fit' function (key 'a') of zathura always adjusts the zoom relative to the first page of the document, implying that for the rest of the document the pages are displayed smaller than they should be. This is particularly annoying when reading documents in 'double-page mode', leading to results as terrible as the following: enter image description here **My Question:** Is there any way to change this behaviour such that the 'best-fit' function adjusts the size relative to the currently viewed page?
Peter Wildemann (221 rep)
Sep 13, 2018, 07:24 PM • Last activity: Aug 12, 2024, 07:13 PM
26 votes
6 answers
21494 views
How do I convert a '.pages' file to something unix-y?
I have a file with the extension '.pages´. I _think_ that means iWork, but I'm not sure. Anyway, searching for "convert pages linux" produces a fair few false negatives! Does anyone happen to know a way to convert it to something reasonable on Linux?
I have a file with the extension '.pages´. I _think_ that means iWork, but I'm not sure. Anyway, searching for "convert pages linux" produces a fair few false negatives! Does anyone happen to know a way to convert it to something reasonable on Linux?
Andrew Stacey (2525 rep)
May 2, 2011, 07:10 PM • Last activity: Aug 1, 2024, 01:01 PM
4 votes
3 answers
913 views
Override page numbers of a djvu document
I have a djvu scan of a book. Let's consider two cases: 1. I'd like to number the pages `0, 1, 2, ...` (usage case: the cover should get be page 0) 2. I'd like to number some pages with Roman numbers and some with Arabic numbers, for example: `i, ii, iii, ..., x, 1, 2, 3, ...` (usage case: some intr...
I have a djvu scan of a book. Let's consider two cases: 1. I'd like to number the pages 0, 1, 2, ... (usage case: the cover should get be page 0) 2. I'd like to number some pages with Roman numbers and some with Arabic numbers, for example: i, ii, iii, ..., x, 1, 2, 3, ... (usage case: some introductory pages are numbered Roman in the book) Is it possible to do it on Linux?
marmistrz (2792 rep)
Jun 17, 2015, 08:14 AM • Last activity: Nov 27, 2023, 10:05 AM
0 votes
2 answers
755 views
Simple Scan crashes almost immediately upon start-up
I'm using `simple-scan` on Debian and it is now always crashing. It worked fine a few months ago. I suspect that something got upgraded (I'm running `bullseye`) and broke things. The output from `simple-scan` says: ``` simple-scan: thread-watch.c:165: avahi_threaded_poll_quit: Assertion `pthread_equ...
I'm using simple-scan on Debian and it is now always crashing. It worked fine a few months ago. I suspect that something got upgraded (I'm running bullseye) and broke things. The output from simple-scan says:
simple-scan: thread-watch.c:165: avahi_threaded_poll_quit: Assertion `pthread_equal(pthread_self(), p->thread_id)' failed.
Abort
I tried running ldd /usr/bin/simple-scan to make sure all the packages associated with all the libraries are recent, including libc6. I also tried running with simple-scan -d, but the debug output proved useless. I also tried strace /usr/sbin/simple-scan to look at all the shared libraries that were being loaded and they all had recent versions. I noticed that the libavahi shared libraries are loaded dynamically (i.e., not visible via ldd) by simple-scan. So, I made sure that those were also recent. I also tried xsane, but it has the same problem. Perhaps unrelatedly, I see the following messages in my system log file:
avahi-daemon: Too many clients, client request failed.
scanimage does not crash, but it fails in an unrelated way. I am at my wit's end and don't know what else can be done.
roberttheprogrammer (21 rep)
Oct 14, 2022, 11:19 PM • Last activity: Jan 2, 2023, 10:13 AM
4 votes
6 answers
6940 views
How to split each page of a djvu file?
In a djvu file, it has two book pages in one djvu page. I would like to split it so that one book page per djvu page. For example,![enter image description here][1] I was wondering if this can be done by some software, preferably command line utilities? Thanks and regards! PS: This is [a file][2] th...
In a djvu file, it has two book pages in one djvu page. I would like to split it so that one book page per djvu page. For example,enter image description here I was wondering if this can be done by some software, preferably command line utilities? Thanks and regards! PS: This is a file that can be used for test.
Tim (106420 rep)
Dec 6, 2011, 05:22 PM • Last activity: Apr 27, 2022, 01:42 AM
0 votes
2 answers
66 views
What is the *nix parallel to Microsoft front page (HTML page builder)?
Just as LibreOffice is the current parallel to Microsoft Office and just as GIMP is the current parallel to Adobe Photoshop, what is the *nix parallel to Microsoft front page (HTML page builder)? Can I use some LibreOffice tool for to create HTML pages in which JavaScript and/or CSS are saved **inli...
Just as LibreOffice is the current parallel to Microsoft Office and just as GIMP is the current parallel to Adobe Photoshop, what is the *nix parallel to Microsoft front page (HTML page builder)? Can I use some LibreOffice tool for to create HTML pages in which JavaScript and/or CSS are saved **inline** or **indocument**)? ## Update Preferably, something with a "drag and drop" mechanics but not necessarily.
AFAIK, WYSIWIG lacks drag and drop mechanics (such as those of Gutenberg/Elementor).
searcher (11 rep)
Apr 22, 2022, 11:46 AM • Last activity: Apr 22, 2022, 05:32 PM
-1 votes
1 answers
300 views
Task focused command-line cheat sheet for linux
There are a lot of "Linux command-line cheat sheets" on the internet. But often they only list the commands, sometimes sort and describe them. What I am looking for is something I would call a "task based" cheat sheet, where I can "ctrl+f" for *what* I want to do and find the corresponding command....
There are a lot of "Linux command-line cheat sheets" on the internet. But often they only list the commands, sometimes sort and describe them. What I am looking for is something I would call a "task based" cheat sheet, where I can "ctrl+f" for *what* I want to do and find the corresponding command. Since beforehand I don't know *how* (i.e. the command) to perform the task. Could someone provide a link or search terms? #### Explanation: When trying to do something on the command-line, I normally use google to find a solution. Depending on the complexity of the task, this takes some (unreasonably high) effort and often combining multiple solutions. Also internet access is mandatory for this to work. I afterwards write this down in a text file and attach some search terms. Expecting to find similar files on the internet, I search for: linux task OR action OR work based cheat sheet, linux howto collection common tasks. Those don't return what I look for.
Paul Smith (263 rep)
Nov 16, 2021, 10:01 PM • Last activity: Dec 14, 2021, 10:22 PM
1 votes
1 answers
2575 views
xfce4-terminal starts in ~/Desktop instead of ~
About a week ago (maybe after an update) the starting directory of xfce4-terminal changed from `~` to `~/Documents` when launched from the panel. I'm pretty sure that wasn't because of anything I have done: there are no `cd` commands in `~/.bashrc` (and that should not be necessary) and the launcher...
About a week ago (maybe after an update) the starting directory of xfce4-terminal changed from ~ to ~/Documents when launched from the panel. I'm pretty sure that wasn't because of anything I have done: there are no cd commands in ~/.bashrc (and that should not be necessary) and the launcher did not contain anything in the field 'Working Directory' (I put $HOME there just to try, but that does not work either). $ grep cd ~/.bashrc $ grep Desktop .config/xfce4/terminal/terminalrc $ I'm runnning Ubuntu 20.04, has anyone had this same thing happen in Ubuntu?
alle_meije (579 rep)
Nov 8, 2021, 12:35 PM • Last activity: Nov 8, 2021, 04:09 PM
2 votes
1 answers
2916 views
How to read cbr in linux?
I tried mcomix to read .cbr but it says archive format not supported. Calibre isn't working either, any idea on how to open .cbr files under linux ?
I tried mcomix to read .cbr but it says archive format not supported. Calibre isn't working either, any idea on how to open .cbr files under linux ?
ChiseledAbs (2301 rep)
Sep 29, 2016, 06:51 PM • Last activity: Jul 7, 2021, 08:00 PM
21 votes
6 answers
77303 views
Linux PPTX presentation Viewer
Besides Open Office, is there any solution for opening `.pptx` presentations on Linux (Debian to be more precise)?
Besides Open Office, is there any solution for opening .pptx presentations on Linux (Debian to be more precise)?
Razvan (312 rep)
Oct 12, 2012, 08:00 PM • Last activity: Jul 22, 2020, 05:48 AM
15 votes
5 answers
7239 views
OCR on Linux systems
I have always found [OCR technology][1] to be behind on open source systems. I've also watched the [Ocropus project][2] since its infancy. I've tried what I've heard is the best OCR engine available for Linux, [Tesseract][3], and have found it woefully lacking for business documents. Are there any o...
I have always found OCR technology to be behind on open source systems. I've also watched the Ocropus project since its infancy. I've tried what I've heard is the best OCR engine available for Linux, Tesseract , and have found it woefully lacking for business documents. Are there any other more promising OCR implementations? What about the even more hopeful goal for interpreting handwriting? What is possible on *nix systems in this field?
jjclarkson (2177 rep)
Aug 16, 2010, 10:27 PM • Last activity: May 10, 2020, 07:14 AM
1 votes
2 answers
1197 views
How can I batch rename text files to be the first word in each text file?
I need to rename a few thousand text files to be the first word in the text file plus the file extension (.txt) using the terminal. All the files are in a single directory, and are currently named int the pattern "xx0001.txt' through 'xx9999.txt'. The first word in each file is of this pattern: '##2...
I need to rename a few thousand text files to be the first word in the text file plus the file extension (.txt) using the terminal. All the files are in a single directory, and are currently named int the pattern "xx0001.txt' through 'xx9999.txt'. The first word in each file is of this pattern: '##221251', so maybe my issue comes from needing to comment out the pound signs. I've tried: rename File.ReadAllText('*.txt').split(' ') Which throws the error: zsh: no matches found: File.ReadAllText(*.txt).split( ) Thanks in advance for your help. I'm a true novice.
mqh5962 (13 rep)
Apr 24, 2020, 05:43 PM • Last activity: Apr 24, 2020, 07:34 PM
22 votes
7 answers
7035 views
Light, easy-to-use presentation software
I am looking for a Linux presentation software and the main features I need are: - **Light** on dependencies and resource consumption (*I use GTK+*) - Easy to use - Align & Distribute information on slide - Uses pictures The extra features I would like are: - On-click animations - .ppt Import/export...
I am looking for a Linux presentation software and the main features I need are: - **Light** on dependencies and resource consumption (*I use GTK+*) - Easy to use - Align & Distribute information on slide - Uses pictures The extra features I would like are: - On-click animations - .ppt Import/export - Use templates/themes for easily unified presentation ---------- Sliderocket looks like a presentation tool matching my needs. Unfortunately it is an online presentation editor and I am looking for an offline one. *I don't mind the software being proprietary and non-free of charge*
danjjl (395 rep)
Aug 20, 2011, 07:58 PM • Last activity: Mar 5, 2020, 07:09 AM
4 votes
1 answers
130 views
Safely handling PDFs and other ebook formats on Linux
I'm running Arch Linux and using Okular for opening PDF files and FBReader for other ebook formats (Epub, Mobi, etc.). Simply put, here's my question: Assuming some of those documents come from unreliable sources and contain malicious code what can I do to mitigate the risk of compromising the syste...
I'm running Arch Linux and using Okular for opening PDF files and FBReader for other ebook formats (Epub, Mobi, etc.). Simply put, here's my question: Assuming some of those documents come from unreliable sources and contain malicious code what can I do to mitigate the risk of compromising the system and opening it for invasion (which can be a common occurrence in this country if you even smell like someone who holds opinions the government disapprove of)? A few more specific questions: Is just opening the referred files enough to put my setup at serious risk? The user I use for this is on the sudoers list, so, if compromised, it could be used for escalation. Suppose I only open the files using a more limited user account, would that at least help? Outside of setting up a virtual machine only for reading (which wouldn't be practical for a few reasons) or using another computer just for that (same thing), is there anything I can do?
Dave (41 rep)
Sep 1, 2018, 04:53 PM • Last activity: Sep 9, 2018, 08:28 AM
4 votes
1 answers
665 views
personal document management system
I'm looking for a personal document management system that would: * track changes in documents in a configured directory tree * save new files to a content repository (thus allowing of their versioning) * offer some kind of web gui for browsing the repository - ideally it would allow tracking change...
I'm looking for a personal document management system that would: * track changes in documents in a configured directory tree * save new files to a content repository (thus allowing of their versioning) * offer some kind of web gui for browsing the repository - ideally it would allow tracking changes in documents etc The files I'd like it to support are: * microsoft word files (doc, docx, xls) * open office files * pdf files * txt files I've looked through a couple of such systems (DMS-grade) but haven't noticed them having the autonomous scanning option. What I'd like to have is a unobtrusive solution that wouldn't require users to upload documents through web gui. All the versioning, backing up stuff should be transparent to them. Only if they were actually interested in downloading the older version they would fire up the web gui. Any comments and suggestions are welcomed.
Marcin Cylke (253 rep)
Apr 5, 2012, 11:21 AM • Last activity: Mar 15, 2017, 06:30 PM
2 votes
1 answers
1298 views
Linux Document Management
I'm looking for a Linux (Freeware preferred) replacement for the Paperport Document Management Software by Nuance. Wants: Scan to PDF. OCR. Simple Indexing and Searching. Data stored locally. GUI. Don't wants: Cloud Based, MySQL. Most of what I am finding in my searches are business solutions (read...
I'm looking for a Linux (Freeware preferred) replacement for the Paperport Document Management Software by Nuance. Wants: Scan to PDF. OCR. Simple Indexing and Searching. Data stored locally. GUI. Don't wants: Cloud Based, MySQL. Most of what I am finding in my searches are business solutions (read expensive), cloud storage and systems that require a MySQL or other database server (service?). I've started using gscan2pdf. My other want, something that will let me scan from the ADF off my HP Printer (Officejet 8600).
user216171 (21 rep)
Feb 15, 2017, 11:25 AM • Last activity: Feb 15, 2017, 11:37 PM
1 votes
1 answers
438 views
figuring out location in a document using vim.
I have a large text file, part of which I am trying to edit in vim. I use / to search/grep through the exact string (it's an sgml file) and I do find it somewhere. During the search it tells me where the pattern is in relation to the length of the document (25-30% or more) but once I take out the pa...
I have a large text file, part of which I am trying to edit in vim. I use / to search/grep through the exact string (it's an sgml file) and I do find it somewhere. During the search it tells me where the pattern is in relation to the length of the document (25-30% or more) but once I take out the pattern, there is no way to tell how big the document is. Is there a way, maybe some configuration variable or something that I need for vim to parse and share with me the location in percentage form whenever I'm in the document.
shirish (12954 rep)
Jan 6, 2017, 07:09 PM • Last activity: Jan 6, 2017, 07:32 PM
5 votes
2 answers
7406 views
Edit .docx files with Emacs?
I'm starting with Emacs, and testing around I wondered if I could edit my own .docx file from there, just to explore its features and that stuff. But my surprise is huge when, having the text perfectly displayed in my screen, I can't manage to write over it. Emacs' DocView tells me that it's a read-...
I'm starting with Emacs, and testing around I wondered if I could edit my own .docx file from there, just to explore its features and that stuff. But my surprise is huge when, having the text perfectly displayed in my screen, I can't manage to write over it. Emacs' DocView tells me that it's a read-only document, and that I have to type C-c C-c in order to edit it. That displays me the display (sorry), but I can't see where can I edit it. Does somebody know anything about this issue? It's the first day I'm trying Emacs, but it seems quite shocking to me it can't deal with docx files.
xvlaze (289 rep)
Feb 9, 2016, 12:04 AM • Last activity: Dec 20, 2016, 08:37 AM
Showing page 1 of 20 total questions