Warm tip: This article is reproduced from serverfault.com, please click

bash scripting shell

How to write a shell script to swap columns in txt file?

发布于 2020-11-28 13:14:25

I was trying to solve one of my old assignment I am literally stuck in this one Can anyone help me?

There is a file called "datafile". This file has names of some friends and their

ages. But unfortunately, the names are not in the correct format. They should be

lastname, firstname

But, by mistake they are firstname,lastname

The task of the problem is writing a shell script called fix_datafile

to correct the problem, and sort the names alphabetically. The corrected filename

is called datafile.fix .

Please make sure the original structure of the file should be kept untouched.

The following is the sample of datafile.fix file:

#personal information

#******** Name ********* ***** age *****

Alexanderovich,Franklin 47

Amber,Christine 54

Applesum,Franky 33

Attaboal,Arman 18

Balad,George 38

Balad,Sam 19

Balsamic,Shery 22

Bojack,Steven 33

Chantell,Alex 60

Doyle,Jefry 45

Farland,Pamela 40

Handerman,jimmy 23

Kashman,Jenifer 25

Kasting,Ellen 33

Lorux,Allen 29

Mathis,Johny 26

Maxter,Jefry 31

Newton,Gerisha 40

Osama,Franklin 33

Osana,Gabriel 61

Oxnard,George 20

Palomar,Frank 24

Plomer,Susan 29

Poolank,John 31

Rochester,Benjami 40

Stanock,Verona 38

Tenesik,Gabriel 29

Whelsh,Elsa 21

Questioner

kjbjjknjkbasjx

Viewed

0

Taras Khalymon 2020-11-28 22:04:01

If you can use awk (I suppose you can), than this there's a script which does what you need:

#!/bin/bash
RESULT_FILE_NAME="datafile.new"
cat datafile.fix | head -4 > datafile.new
cat datafile.fix | tail -n +5 | awk -F"[, ]" '{if(!$2){print()}else{print($2","$1, $3)}}' >> datafile.new

Passing -F"[, ]" allows awk to split columns both by , and space and all that remains is just print columns in a needed format. The downsides are that we should use if statement to preserve empty lines and file header also should be treated separately.

Another option is using sed:

cat datafile.fix | sed -E 's/([a-zA-Z]+),([a-zA-Z]+) ([0-9]+)/\2,\1 \3/g' > datafile.new

The downside is that it requires regex that is not as obvious as awk syntax.

kjbjjknjkbasjx 2020-11-28 13:45:34

What could be done by sed?

Taras Khalymon 2020-11-28 13:47:15

there will be a similar expression as awk , but using sed. It's just another test processor and it doesn't make a major difference

Taras Khalymon 2020-11-28 13:58:28

Just added a solution using sed. It works better because it preserves file structure

Raman Sailopal 2020-11-28 16:12:28

There is no need for cat and head when using awk as awk can do all this processing for you.

热门帖子

1

难道 Go 就没有好用的工作审批流框架吗

2

跟风贴自家软路由实现

3

Mac 上有什么 pdf 阅读器比较好用？

4

再开一个贴，问界发布新 M5，请各位老哥或者老车主骂醒我

5

我这辈子是不是彻底和 Airpods Pro 无缘了

6

最近对很多事情都没有感觉了

7

微信读书非付费会员有每月导入数量限制了

8

不知道从什么时候，普通 USB-C 双头线不能给 iPhone15 充电了

9

程序员想开发漂亮的个人网站是不是用 react 会比 vue 简单一些？

10

有关于上海居住证 120 积分

热门github

1

A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input

2

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

3

shadcn/ui, but for Svelte. ✨

4

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

5

Performance-portable, length-agnostic SIMD with runtime dispatch

6

ZK Credo

7

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

8

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

9

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

10

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

11

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

12

🎓 Path to a free self-taught education in Computer Science!

13

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

14

A collective list of free APIs

15

📚 Freely available programming books