How to find files containing ███ ?

If you are experiencing problems with "Everything", post here for assistance.
Post Reply
Thy Grand Voidinesss
Posts: 697
Joined: Wed Jun 01, 2022 5:01 pm

How to find files containing ███ ?

Post by Thy Grand Voidinesss »

I need to find out all files on my system that have in them rows build from the

Code: Select all

sign; so I would need to get search results containing at least

Code: Select all

███
And I would prefer to have this saved in my configuration as a Bookmark. And preferably I would also need eligible files to be indexed, so that such Bookmark would work instantly in the future; which for me would be formats: AHK, BAT, CMD, PS1, REG, TXT, VBS


Overall I expect this to be impossible to achieve with [at lest with one Bookmark and a pre-cashing feature added] because due to plain text encoding issues - because some programs show me in search results files that have rows built from the

Code: Select all

-
sign


So, can anybody help me with this task?
void
Developer
Posts: 17152
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find files containing ███ ?

Post by void »

Please try the following search:

ext:ahk;bat;cmd;ps1;reg;txt;vbs content:███


content:
Thy Grand Voidinesss
Posts: 697
Joined: Wed Jun 01, 2022 5:01 pm

Re: How to find files containing ███ ?

Post by Thy Grand Voidinesss »

Holy eee!

This works - and it took only a dozens of seconds. I cannot believe it is as simple as that. So one again Everything proofs to be a great piece of software


However there is some bug apparently - because I have created 4 TXT test files, with each of them with one of the following content:

Code: Select all

███

Code: Select all

_███

Code: Select all

_███_

Code: Select all

███_
in which the

Code: Select all

_
sign above represent an ordinary pause sign [for the purpose of visualizing the problem]


And now when I use

Bookmarks... > Organize Bookmarks > New > Search > Custom

set to such expression

Code: Select all

ext:ahk;bat;cmd;ps1;reg;txt;vbs content:"███"
then in the main window I get to see files with

Code: Select all

_███

Code: Select all

_███_

Code: Select all

███_
but not that first one with

Code: Select all

███
i.e. that only one which does not have any pause signs. [And yes, for my specific case scenario I literally do need to differentiate between files with literally
sign written 3 in times in a row with or without pauses adjusted to such short line built from blocks]
therube
Posts: 5056
Joined: Thu Sep 03, 2009 6:48 pm

Re: How to find files containing ███ ?

Post by therube »

So that is a "block" character, rather then a "-" or "_" that is "highlighed"?

So a "Full Block", Unicode Character “█” (U+2588)?
UTF-8 Block Elements


How do you go about getting such a character into a .bat (or otherwise) file?
Thy Grand Voidinesss
Posts: 697
Joined: Wed Jun 01, 2022 5:01 pm

Re: How to find files containing ███ ?

Post by Thy Grand Voidinesss »

Throughout my files / programs I use various block characters and other unusual signs. But for me, after many tests, I cam to a conclusion that this character [U+2588 as described at https://en.wikipedia.org/wiki/Block_Elements] is the optimal one to be used for building separation lines and indicators of chapter / titles in my files. In script files I use them as comments, e.g.

Code: Select all

:: █████████████████████████████████████████████████████████
:: ███ System Reset ███ message shown before execution ███ 
in a BAT file - which requires it to be saves with UTF-8 encoding


The task at hand was finding multiplication of

Code: Select all

and I already did that - thank you. But now I am just reporting what seems to be bug. In that example of mine I can adjust content of files to

Code: Select all

███

Code: Select all

a███

Code: Select all

a███a

Code: Select all

███a
and still when the filtering expression is

Code: Select all

content:"███"
then the first file with only
sign / without
a
character in it is not shown in results
therube
Posts: 5056
Joined: Thu Sep 03, 2009 6:48 pm

Re: How to find files containing ███ ?

Post by therube »

C:\TMP\BRU\spaces>ES TEST*.txt content:█

C:\TMP\BRU\spaces\test.TXT
C:\TMP\BRU\spaces\test3.TXT

In this case, test3.TXT only contains the character, █ , (& is utf8).


(Likewise, C:\TMP\BRU\spaces>ES TEST*.txt content:"█", works.
Likewise, C:\TMP\BRU\spaces>ES TEST333.txt content:"███", which contains ONLY 3 █, works.)


So it does look to be working on my end.
void
Developer
Posts: 17152
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find files containing ███ ?

Post by void »

Any other search options enabled? (What is checked under the Search menu?)

I am able to find the file that only contains ███

Maybe it's an encoding issue?..
Could you please attach the file that contains only ███
Thy Grand Voidinesss
Posts: 697
Joined: Wed Jun 01, 2022 5:01 pm

Re: How to find files containing ███ ?

Post by Thy Grand Voidinesss »

void wrote: Tue Nov 19, 2024 4:04 am Any other search options enabled? (What is checked under the Search menu?)
Match Diacritics

Turning it off did not make the issue go away

void wrote: Tue Nov 19, 2024 4:04 am Could you please attach the file that contains only ███
I have just sent you an e-mail with archive attachement which hold all my test files
void
Developer
Posts: 17152
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find files containing ███ ?

Post by void »

Thank you for sending the txt files.



1.txt in hex:

Code: Select all

882588258825
█ is stored as 88 25 (in hex)

Everything will not match █ inside this file with content:

The file is using Unicode encoding with no BOM.
There's no way for Everything to know the 1.txt file is using Unicode encoding.
Everything will treat the file as ANSI as it is valid ANSI text.



Notepad also opens the 1.txt file as:
ˆ%ˆ%ˆ%

(File is opened as ANSI)



Please store your txt files with a BOM.

-or-

Please try the following search:
binarycontent:█

binarycontent: will try to find text as ASCII, ANSI, UTF-8, UTF-16LE and UTF-16BE.
binarycontent:

-or-

If you know the file is using Unicode encoding, please try the following search:
unicodecontent:█

unicodecontent:



2.txt in hex:

Code: Select all

6100882588258825
a is stored as 61 00 (in hex)

Everything treats NULL bytes (00) as invalid ANSI text and will favor Unicode.

Notepad will do the same.
If you open the 2.txt file in Notepad you will see:
a███

(File is opened as Unicode)
Thy Grand Voidinesss
Posts: 697
Joined: Wed Jun 01, 2022 5:01 pm

Re: How to find files containing ███ ?

Post by Thy Grand Voidinesss »

Thank you for this detailed explanation


It seems that the optimal approach would be to add a Bookmark with such Custom field of Search

Code: Select all

ext:ahk;bat;cmd;ps1;reg;txt;vbs binarycontent:"███"
And that is because it is expected for a user to forget about having to create / modify / save every new file with BOM - just in case in future that file will start utilizing the
character for a line. And it is also absolutely impossible for a human to not make some sort of mistake somewhere along the line


Overall, as I just tested, the

Code: Select all

binarycontent:
version have found all of the files I had missed back then; I hope
Last edited by Thy Grand Voidinesss on Mon Dec 09, 2024 8:34 pm, edited 1 time in total.
therube
Posts: 5056
Joined: Thu Sep 03, 2009 6:48 pm

Re: How to find files containing ███ ?

Post by therube »

Everything treats NULL bytes (00) as invalid ANSI text and will favor Unicode.
How many (opening ?) bytes does Everything inspect when making that determination?
void
Developer
Posts: 17152
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find files containing ███ ?

Post by void »

The whole file.
Post Reply