added 277 characters in body

Source Link

edited May 13, 2024 at 14:29

33
6

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio
#1713308690
ls
#1713308691
ls

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++awk, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

EDIT: as suggested by Ed Morton on the comment, here's the expected output:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308690
ls

as a workaround, I add the delete functionality to this program. but I'm still open to other approaches that use existing commands.

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio
#1713308690
ls
#1713308691
ls

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

EDIT: as suggested by Ed Morton on the comment, here's the expected output:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308690
ls

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio
#1713308690
ls
#1713308691
ls

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

EDIT: as suggested by Ed Morton on the comment, here's the expected output:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308690
ls

as a workaround, I add the delete functionality to this program. but I'm still open to other approaches that use existing commands.

added 275 characters in body

Source Link

edited May 13, 2024 at 12:44

ReYuki

33
6

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio | less
#1713308690
ls
#1713308691
ls

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

EDIT: as suggested by Ed Morton on the comment, here's the expected output:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308690
ls

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio | less

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio
#1713308690
ls
#1713308691
ls

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

EDIT: as suggested by Ed Morton on the comment, here's the expected output:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308690
ls

added 438 characters in body; edited tags; edited title

Source Link

edited May 13, 2024 at 8:01

ReYuki

33
6

how to remove duplicatede-duplicate block (timestamp+command) from bash history?

I'm working with a textbash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the textbash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio | less

My goal is to remove duplicatede-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

how to remove duplicate block?

I'm working with a text file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the text:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio | less

My goal is to remove duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

how to de-duplicate block (timestamp+command) from bash history?

I'm working with bash_history file containing blocks with the following format: #unixtimestamp\ncommand\n

here's sample of the bash_history file:

#1713308636
cat > ./initramfs/init << "EOF"
#!/bin/sh
/bin/sh
EOF
#1713308642
file initramfs/init
#1713308686
cpio -v -t -F init.cpio
#1713308689
cpio -v -t -F init.cpio | less

My goal is to de-duplicate blocks entirely, meaning both the timestamp and the associated commands. I've attempted using awk with !seen[$0]++, but this approach processes lines individually, not considering them as part of a block.

I've heard that using ignoredups prevents deduplication, but it won't work in this case (unless you retype the exact command) because the duplicate command is already there.

I'd appreciate suggestions on a more effective way to achieve this de-duplication.

Source Link

asked May 13, 2024 at 6:43

ReYuki

33
6

Loading

Stack Exchange Network

Return to Question

how to remove duplicatede-duplicate block (timestamp+command) from bash history?

how to remove duplicate block?

how to de-duplicate block (timestamp+command) from bash history?