Shell Script Help on file reading


Recommended Posts

Hi,

I need a little help on file reading. I have a file and its contents are like

123456789123456789123456789

987654321987654321987654321

231297301283120381290382133

Now I need script to format this file like

A B C D E F G H I J K L M

123 456 789 123 456 789 12 3 4 56 7 8 9

987 654 321 987 654 321 98 7 6 54 3 2 1

231 297 301 283 120 381 29 0 3 82 1 3 3

i.e the first line shows the differentiating columns and the rest of the lines following those columns.

Please help me on how this can be done.

Thank you.

Link to comment
https://www.neowin.net/forum/topic/1136228-shell-script-help-on-file-reading/
Share on other sites

Is the break down the same,

3-digits 3-digits 3-digits 3-digits 3-digits 3-digits 2-digits 1-digit 1-digit 2-digits 1-digit 1-digit?

This might be an awkward way about the problem but,

#! /bin/bash

for i in `cat filename.ext`;

do

echo ${i:0:2} ${i:3:6} ${i:7:9} ${i:10:12} ${i:13:15} ${i:16:18} ${i:19:20} ${i:21:21} ${i:22:22} ${i:23:24} ${i:25:25} ${i:26:26} >> outputfiles.ext

done;

Clearly not the cleanest way of doing it - P.s. I have not tested this as I am on an android phone lol

You are very close. This is exactly what I want. In simple terms, I want to break the characters and present them in a format like first three characters. then 4-6. then 7-12, then 8-9 and so on. The length of each line in the file remains the same and so I need to format it in different ways.

I tested your code, but the output is not coming according to the code. It would have been perfect otherwise.

Thank you :)

I tested your code, but the output is not coming according to the code. It would have been perfect otherwise.

Do you mean,

1: The wrong number of digits broke down? (Which is possible as I wrote that on an Andriod phone and I cannot see the whole block of code.)

2: The IFS the internal field separate is not triggering the line breaks? (Which is possible depending on where and what is used to create the file in the first place)

This is coming as output of your code

12 456789 891234567 23456789 56789 89

98 654321 219876543 87654321 54321 21

12 121112 123123124 12312412 12412 12

This is not consistent with the code. Your code has divided each line into 12 parts. Therefore the output should have been in 12 parts. The file has been created just by typing the lines to check how it can be broken up.

I would be interested in knowing what this is for LOL

I did the example from memory and I made a mistake in thinking is Variable:Start Reference:End Reference where in actuality it is Variable:Start Reference:Length

#

# YOUR EXAMPLE

#

[Tim.IT-8] ? cat example.txt

123456789123456789123456789

987654321987654321987654321

231297301283120381290382133

#

# SCRIPT

#

[Tim.IT-8] ? cat example.sh

#! /bin/bash

# VARIABLES

INPUTFILE=example.txt

OUTPUTFILE=example_completed.txt

# CLEAR THE SCREEN

clear

# PROCESS

for i in `cat $INPUTFILE`;

do

echo "${i:0:3} ${i:3:3} ${i:6:3} ${i:9:3} ${i:12:3} ${i:15:3} ${i:18:2} ${i:20:1} ${i:21:1} ${i:22:2} ${i:24:1} ${i:25:1} ${i:26:1}" >> $OUTPUTFILE;

done;

# CLEANUP

unset INPUTFILE;

unset OUTPUTFILE;

#

# RESULTS

#

[Tim.IT-8] ? cat example_completed.txt

123 456 789 123 456 789 12 3 4 56 7 8 9

987 654 321 987 654 321 98 7 6 54 3 2 1

231 297 301 283 120 381 29 0 3 82 1 3 3

#

# YOUR EXPECTATION

#

A B C D E F G H I J K L M

123 456 789 123 456 789 12 3 4 56 7 8 9

987 654 321 987 654 321 98 7 6 54 3 2 1

231 297 301 283 120 381 29 0 3 82 1 3 3

  • 2 months later...

Hi tim, sorry for the delayed response.

I checked your code and thanx a ton for this. It is now coming as expected. I also understood the code this time. I have a file where the lines are like

xyz12345	 12345    mprls   12345

Here the blank spaces are also character and every bit of it represents a format. e.g 1st 3 letters will represent something, 4 something else, 5-6, 7-9 and so on. It is for my own clarity, otherwise I have to debug the code manually where the debug file consists of thousands of such lines of equal width and the same format.

Thanx a lot for this. I will check this again on the raw file and if it can format the blank spaces as well.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Microsoft had to shut down 70+ GitHub repos after getting hacked, brings back some by Aditya Tiwari The self-replicating malware campaign known as Miasma took the open-source world by storm. It was reported that almost 73 Microsoft GitHub repositories were infected by the worm and had to be temporarily shut down to determine how attackers compromised projects and stuffed password-stealing malware in the code. These GitHub repos span across different organizations, including Microsoft Azure, Azure-Samples, Microsoft, and MicrosoftDocs. The malware enabled attackers to steal passwords and credentials when compromised tools were opened in popular AI coding apps, including Claude Code, Gemini CLI, VS Code, and Cursor. The security firm Cloudsmith, malware analysis site OpenSourceMalware, and 404 Media were among the first to report the hack. For background, Miasma is a variant of the Mini Shai-Hulud worm, open-sourced by the threat group TeamPCP. It started its journey by compromising a Red Hat employee's GitHub account to attack the @redhat-cloud-services npm namespace. Earlier this month, Microsoft Threat Intelligence reported that the Miasma attackers published 32 malicious packages across more than 90 versions under the @redhat-cloud-services npm scope to steal cloud credentials. The worm didn't take long to start attacking source repos directly rather than package registries. It is known to skip the npm registry entirely for several targets and plant malicious code straight into public repos like "icflorescu/mantine-datatable." The delivery approach was designed to weaponize AI coding tools. Miasma's malicious payload embedded into projects can trigger automatic code execution when the infected repo is opened in an AI coding tool or IDE. The list of affected projects includes "durabletask", a Python package compromised by TeamPCP a month earlier to deliver an information stealer designed for Linux systems. That said, Microsoft has begun restoring some repos affected by the malware campaign, The Hacker News reports. A company spokesperson stated the following: Microsoft will continue to investigate the attack. It has notified a small number of customers who may have removed their content from the affected repos. The company will reach out to customers again through established support channels "if anything further is identified that requires customer action."
    • Why is Opera doing this notification at all? They have their own extension store. They don't have to obey anything dictated by Google. Others like Brave and Vivaldi that rely on Chrome's extension store, not so much. Firefox is entirely separate as well with its own extensions store. I honestly don't understand why entire world is just insisting on Chrome. Like, why? It's a stupid fat browser with barely any functionality. But sure, it's installed on everything by default. I don't understand how people even use web that's filled with tracking garbage and ads all over the place.
    • Just for anyone reading, AdGuard (the free, standalone MV3 extension) is quite good now, a direct competitor to uBlock Origin Lite and much more built-out than it.
  • Recent Achievements

    • Week One Done
      rubentuben8 earned a badge
      Week One Done
    • Week One Done
      ARaclen earned a badge
      Week One Done
    • One Year In
      jojodbn earned a badge
      One Year In
    • One Month Later
      jojodbn earned a badge
      One Month Later
    • Week One Done
      jojodbn earned a badge
      Week One Done
  • Popular Contributors

    1. 1
      +primortal
      531
    2. 2
      PsYcHoKiLLa
      231
    3. 3
      +Edouard
      130
    4. 4
      ATLien_0
      88
    5. 5
      Steven P.
      83
  • Tell a friend

    Love Neowin? Tell a friend!