温馨提示:本文翻译自stackoverflow.com，查看原文请点击：r - Counting the number of rows for different combinations of factors

dplyr r

r - 计算不同因素组合的行数

发布于 2020-03-31 23:44:40

考虑到诸如classic之类的数据集mtcars，我想知道按不同水平的因素将观察值（=行）分开或同时进行的情况。

例如，以下代码将生成一列N，其中包含每级气缸和齿轮的观测值数量，而不是分别针对气缸和齿轮的观测值数量。

mtcars %>% dplyr::group_by(cyl, gear) %>% dplyr::summarise(N = n())

我知道可以以类似的方式获得气缸和齿轮的单独观测值，创建单独的数据框，然后将它们合并在一起。以下将生成预期的输出：

df <- mtcars %>% dplyr::group_by(cyl, gear) %>% dplyr::summarise(N = n())
df_gear <- mtcars %>% dplyr::group_by(gear) %>% dplyr::summarise(Ngear = n())
df_cyl <- mtcars %>% dplyr::group_by(cyl) %>% dplyr::summarise(Ncyl = n())
df %>% dplyr::left_join(df_cyl) %>% dplyr::left_join(df_gear)

但是我想知道是否有一种更干净的方法来生成此数据集，希望无需生成中间数据集。

提问者

elcortegano

被浏览

13

查看英文版

查看原文

H 1 2020-01-31 20:44

下面是你可能会接近这个，依靠一种方式mutate()和ave()替代group_by()和summarise()紧凑性：

library(dplyr)

mtcars %>% 
  mutate(n = ave(cyl, cyl, gear, FUN = length),
         n_cyl = ave(cyl, cyl, FUN = length),
         n_gear = ave(gear, gear, FUN = length)) %>%
  select(gear, cyl, n, n_cyl, n_gear) %>%
  distinct()

  gear cyl  n n_cyl n_gear
1    4   6  4     7     12
2    4   4  8    11     12
3    3   6  2     7     15
4    3   8 12    14     15
5    3   4  1    11     15
6    5   4  2    11      5
7    5   8  2    14      5
8    5   6  1     7      5

相关问题

1

过滤具有特定条件的所有列的行

2

ggplot2绘图区域内的两个轴标签

3

错误：在R中找不到函数...

4

创建加载消息，这些消息将根据 shiny 的应用程序中情节的加载时间而改变

5

热图生成R中的cut.default错误

6

r中的apply函数存在问题：仅在第一列中应用

7

R在滑动窗口时间段内创建先前事件的计数

8

使用setDT将一个数据帧中的许多列合并到另一数据帧中

9

根据 shiny dashboard 其他选项卡中的操作在选项卡中显示下载按钮

10

用奇怪的格式解析R中的日期

热门github

1

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface. (翻译：SkyPilot 是一个框架，可通过统一界面在任何云上轻松运行机器学习工作负载。)

2

1 min voice data can also be used to train a good TTS model! (few shot voice cloning) (翻译：1分钟的语音数据也可以用来训练一个好的TTS模型！（几张声音克隆镜头）)

3

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

4

科技爱好者周刊，每周五发布

5

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

6

Build Real-Time Knowledge Graphs for AI Agents

7

AI Notepad for back-to-back meetings. Local-first & Extensible.

8

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ (翻译：12 节课程，开始使用生成式 AI 进行构建)

9

A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today! (翻译：Reactive Resume 是一款免费开源的简历生成器，支持定制和移植、安全、开源且永久免费。赶紧试试吧！)

10

Collection of leaked system prompts

11

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more. (翻译：这个存储库是我每天在工作中使用的各种材料和工具的集合。)

12

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions (翻译：包含Linux、Jenkins、AWS、SRE、Prometheus、Docker、Python、Ansible、Git、Kubernetes、Terraform、OpenStack、SQL、NoSQL、Azure、GCP、DNS、弹性、网络、虚拟化等DevOps 面试问题)

13

Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more (翻译：容器映像、文件系统和 Git 存储库中的漏洞以及配置问题和硬编码机密的扫描程序)

14

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

15

AI-powered multi-agent builder