Warm tip: This article is reproduced from serverfault.com, please click

stata

Merge all .dta files in one folder?

发布于 2020-12-04 23:02:39

I have a folder with 36 .dta files which are all structured the same. Each one has 2 fields: RowID and value. Each file also has the same number of rows (2,500). The name of the "value" variable is unique to each file. I would like to construct a loop that loads in the first .dta file and then merges the "value" variable from each of the other 35 files. Any help will be greatly appreciated.

Here are sample data from 3 of the .dta files:

Example 1:
input int rowid_ float value_ex_1
 1 0
 2 0
 3 0
 4 1
 5 1
 6 1
 7 1
 8 1
 9 1
10 1

Example 2:
input int rowid_ float value_ex_2
 1 1
 2 0
 3 0
 4 1
 5 1
 6 0
 7 0
 8 0
 9 0
10 0

Example 3:
input int rowid_ float value_ex_3
 1 0
 2 0
 3 0
 4 0
 5 0
 6 1
 7 1
 8 0
 9 0
10 1

Questioner

Nick M.

Viewed

0

krasnapolsky 2020-12-06 20:05:37

In order to loop over all your .dta files, first make sure they are named after a logical order (i.e example_1.dta, example_2.dta, example_3.dta etc).

Then, you can load the first dataset and loop over the other ones with a forvalues loop:

cd "path/to/your/datasets"

use example_1.dta, clear

forvalues i = 2(1)35 { 
    merge 1:1 rowid_ using example_`i'.dta
    drop _merge
}

热门帖子

1

这大约是独立开发的顶流了吧， v 友们怎么看

2

V 友们今年刚需需要买车预算 20 出头极氪 007 可以入吗

3

黑群和白群的差异，如何选择？

4

急收点发票（电脑、笔记本、硬盘、nas 、 iPad 、服务器、交通、鼠标、键盘）

5

自研摄影 App 胶片拾光，五一限时促销： 1 元购买终身会员

6

现在找工作怎么这么难

7

请问三星 S24 港版、美版，大家一般在哪儿购买？有没有渠道，介绍一下。

8

收一台 apple watch s7or s8 不锈钢（银钢）45mm，预算 2k 左右

9

私自把 Sim 卡数据写入 Esim，会被喝茶吗？

10

存款大概有 1.5%的利息差，我们能怎么做。。

热门github

1

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

2

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

3

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

4

该项目可以让你通过订阅的方式使用Cloudflare WARP+，自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

5

Multi functional app to find duplicates, empty folders, similar images etc.

6

Xray panel supporting multi-protocol multi-user expire day & traffic & ip limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)

7

The Free Software Media System

8

lightweight, standalone C++ inference engine for Google's Gemma models.

9

📚 Freely available programming books

10

A collective list of free APIs

11

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

12

🎓 Path to a free self-taught education in Computer Science!

13

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

14

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

15

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.