温馨提示:本文翻译自stackoverflow.com，查看原文请点击：其他 - How to group by consecutive rows in a R dataframe?

r

其他 - 如何按R数据帧中的连续行分组？

发布于 2020-03-27 12:05:25

我在时间序列数据中有一个带有时间戳，类型，值列的数据框。类型指的是峰还是谷。我想要：

Group all data by consecutive types For groups of "peak" type I want to select the highest For groups if "valley" type I want to select the lowest Filter the dataframe by these highest/lowest Expectation: I would have a dataframe that alternated each row between the highest peak and lowest valley.

The only way I know how to do this is by using a for loop and then adding consecutive values into a vector and then getting the max, then shoving this in a new dataframe and so on.

For those who know python, this is what I did in that (I need to transfer my code to R though):

segmentation['min_v'] = segmentation.groupby( segmentation.pv_type.ne(segmentation.pv_type.shift()).cumsum() ).price.transform(min)
segmentation['max_p'] = segmentation.groupby( segmentation.segmentation.pv_type.ne(segmentation.pv_type.shift()).cumsum() ).price.transform(max)

EDIT

Sample data set:

types <- c('peak', 'peak', 'valley', 'peak', 'valley', 'valley', 'valley')
values <- c(1.01,   1.00,    0.4,     1.2,     0.3,      0.1,      0.2)
segmentation <- data.frame(types, values)
segmentation

expectedTypes <- c('peak', 'valley', 'peak', 'valley')
expectedValues <- c(1.00, 0.4, 1.2, 0.1 )
expectedResult <- data.frame(expectedTypes, expectedValues)
expectedResult

I dont know a better way to generate the data.

提问者

Fred Johnson

被浏览

18

查看英文版

查看原文

akrun 2019-07-04 03:18

使用时R，一种实现dplyr方式是将'pv_type'与'pv_type'的逻辑比较的累积和lag作为分组列，然后将'price' min和max'price'作为两个新列

library(dplyr)
segmentation %>%
       group_by(pv_type_group = cumsum(pv_type != lag(pv_type,
                 default = first(pv_type))) %>%
       mutate(min_v = min(price), max_p = max(price))

更新资料

在OP的示例中，预期输出为summarised，因此我们使用summarise代替mutate。另外，使用rleid（from data.table）代替逻辑累积和

library(data.table)
segmentation %>% 
    group_by(grp = rleid(types)) %>% 
    summarise(types = first(types), expectedvalues = min(values)) %>%
    ungroup %>%
    select(-grp)
# A tibble: 4 x 2
#  types  expectedvalues
# <fct>           <dbl>
#1 peak              1  
#2 valley            0.4
#3 peak              1.2
#4 valley            0.1

Fred Johnson 2019-07-04 00:19:12

％>％的作用/含义是什么？

akrun 2019-07-04 00:20:28

@ user2330270是chan运算符，它连接用于进一步处理的lhs输出

akrun 2019-07-04 01:15:47

许多人无法同时掌握两种语言。通过降低投票率，它降低了人们的价值，从而阻止人们回答代码转换问题。的确，OP没有提供可复制的示例，但是代码转换实际上并不需要

Fred Johnson 2019-07-04 02:08:27

好的，谢谢，现在测试您的答案

akrun 2019-07-04 05:10:54

@ user2330270第二个是摘要输出。在python中，您正在transform创建一个新列。另外，示例中的列名也不同

相关问题

1

过滤具有特定条件的所有列的行

2

ggplot2绘图区域内的两个轴标签

3

错误：在R中找不到函数...

4

创建加载消息，这些消息将根据 shiny 的应用程序中情节的加载时间而改变

5

热图生成R中的cut.default错误

6

r中的apply函数存在问题：仅在第一列中应用

7

R在滑动窗口时间段内创建先前事件的计数

8

使用setDT将一个数据帧中的许多列合并到另一数据帧中

9

根据 shiny dashboard 其他选项卡中的操作在选项卡中显示下载按钮

10

用奇怪的格式解析R中的日期

热门github

1

Suna - Open Source Generalist AI Agent

2

The new Windows Terminal and the original Windows console host, all in the same place! (翻译：新版Windows 终端)

3

A simple, easy to use PowerShell script to remove pre-installed apps from Windows, disable telemetry, remove Bing from Windows search as well as perform various other changes to declutter and improve your Windows experience. This script works for both Windows 10 and Windows 11.

4

AeroSpace is an i3-like tiling window manager for macOS

5

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

6

Claude can perform Web Search | Exa with MCP (Model Context Protocol)

7

Lightning-fast and Powerful Code Editor written in Rust (翻译：使用Rust编写的快速、强大的代码编辑器)

8

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others (翻译：一个 V2Ray 的 Windows 客户端，支持 Xray 核心和 v2fly 核心)

9

Agent S: an open agentic framework that uses computers like a human

10

Master programming by recreating your favorite technologies from scratch. (翻译：在这个项目中，你能学会如何创造自己的各种工具，引擎，游戏，框架，库......)

11

ChatGPT DAN, Jailbreaks prompt

12

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more. (翻译：这个存储库是我每天在工作中使用的各种材料和工具的集合。)

13

Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

14

#1 Locally hosted web application that allows you to perform various operations on PDF files

15

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions (翻译：包含Linux、Jenkins、AWS、SRE、Prometheus、Docker、Python、Ansible、Git、Kubernetes、Terraform、OpenStack、SQL、NoSQL、Azure、GCP、DNS、弹性、网络、虚拟化等DevOps 面试问题)