温馨提示:本文翻译自stackoverflow.com，查看原文请点击：grouping - Display how many rows appear by each ID when data is not a panel (R)

grouping r

grouping - 当数据不是面板（R）时，显示每个ID出现多少行

发布于 2020-03-27 11:07:43

我正在使用一个纵向数据集，该数据集在单个时间单位中每个ID具有多个行。我以前从未见过这样的情况，也找不到任何类似的问题。

在此示例中，团体借钱。每个小组由多个客户组成，每个信用额可能会在多个月内出现（数据是纵向的）。如果单个组有多个贷方，我想显示这是贷方提供的第一，第二还是第三贷方。

在下面的示例中，我想声明column Iteration。让由客户1和2组成的组1获得两笔贷款：2018年1月的Credit_ID 100和3月的Credit_ID 233。

> dt
Client  Group  Credit_ID     Crop  File_origin  Iteration
     1      1        100  2018-01      2018-01          1
     2      1        100  2018-01      2018-01          1
     1      1        100  2018-01      2018-02          1
     2      1        100  2018-01      2018-02          1
     1      1        233  2018-03      2018-03          2
     2      1        233  2018-03      2018-03          2

如何定义Iteration列？我认为关键在于每次Group和Credit_ID更改时都要关注。

我试过了：

    library(data.table)
    dt[, 1:.N, by = list(Group, Credit_ID)]

但这枚举每个组和Credit_ID的行数。

提问者

Arturo Sbr

被浏览

222

查看英文版

查看原文

tmfmnk 2019-07-03 22:22

一种dplyr可能是：

df %>%
 group_by(Group, Client) %>%
 mutate(Res = cumsum(!duplicated(Credit_ID)))

  Client Group Credit_ID Crop    File_origin Iteration   Res
   <int> <int>     <int> <chr>   <chr>           <int> <int>
1      1     1       100 2018-01 2018-01             1     1
2      2     1       100 2018-01 2018-01             1     1
3      1     1       100 2018-01 2018-02             1     1
4      2     1       100 2018-01 2018-02             1     1
5      1     1       233 2018-03 2018-03             2     2
6      2     1       233 2018-03 2018-03             2     2

或与base R：

with(df, ave(Credit_ID, Group, Client, FUN = function(x) cumsum(!duplicated(x))))

相关问题

1

过滤具有特定条件的所有列的行

2

ggplot2绘图区域内的两个轴标签

3

错误：在R中找不到函数...

4

创建加载消息，这些消息将根据 shiny 的应用程序中情节的加载时间而改变

5

热图生成R中的cut.default错误

6

r中的apply函数存在问题：仅在第一列中应用

7

R在滑动窗口时间段内创建先前事件的计数

8

使用setDT将一个数据帧中的许多列合并到另一数据帧中

9

根据 shiny dashboard 其他选项卡中的操作在选项卡中显示下载按钮

10

用奇怪的格式解析R中的日期

热门github

1

A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET, TFTP, WS and WSS. libcurl offers a myriad of powerful features (翻译：Curl 是一个命令行工具，用于传输使用 URL 语法指定的数据。)

2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

3

Flutter makes it easy and fast to build beautiful apps for mobile and beyond (翻译：Flutter 可以轻松快速地为移动设备及其他应用构建漂亮的应用程序)

4

Powerful menu bar manager for macOS

5

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

6

AI coding agent, built for the terminal.

7

Tongyi DeepResearch, the Leading Open-source DeepResearch Agent

8

An AI Hedge Fund Team

9

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

10

基于大模型和 RAG 的智能问数系统。Text-to-SQL Generation via LLMs using RAG.

11

🔥 🔥 🔥 Open Source Airtable Alternative (翻译：将任何 MySQL、PostgreSQL、SQL Server、SQLite 和 MariaDB 转换为智能电子表格。)

12

Lightweight coding agent that runs in your terminal

13

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

14

Home of the WebKit project, the browser engine used by Safari, Mail, App Store and many other applications on macOS, iOS and Linux. (翻译：WebKit 项目的主页，Safari、Mail、App Store 和 macOS、iOS 和 Linux 上的许多其他应用程序使用的浏览器引擎。)

15