Warm tip: This article is reproduced from serverfault.com, please click

pandas python

Pandas once condition is met in column delete n rows

发布于 2020-11-28 01:04:27

I have a DataFrame that looks like df = pd.DataFrame({'col1': [.8,.9,1,1,1,.9,1,.9,.8]}).

The goal I have is once a number 1 in 'col1' is found, remove the next five rows.

Example

Expected Output

Any ideas?

Questioner

Andrew Horowitz

Viewed

0

sammywemmy 2020-11-28 13:01:25

You could use numpy.r_ to generate the integers:

position_of_1 = np.argmax(df.col1.eq(1)) # df.col1.eq(1).idxmax() not fool-proof

integers = np.r_[: position_of_1 + 1, 
                 range(position_of_1 + 6, len(df))
                 ]

df.iloc[integers]


col1
0   0.8
1   0.9
2   1.0
8   0.8

Thanks to @Ben, for the suggestion on np.argmax; it would be much better/safer to use np.argmax, for scenarios where the index are not numbers or not in proper form:

Johnson Francis 2020-11-28 02:43:49

I see it is not working with df = pd.DataFrame({'col1': [.8,.9,1,1,1,.9,1,.9,.8,.7,.6,.5,1,.4,.3,.5,.7,.9,.5,.4]})

sammywemmy 2020-11-28 02:45:52

@JohnsonFrancis. what should be the output.

Ben 2020-11-28 04:38:58

Nice, but I would change df.col1.eq(1).idxmax() to np.argmax(df.col1.eq(1)) to make this a little more bullet-proof. (Consider the case where df's index is not 0, 1, 2, ...`

sammywemmy 2020-11-28 04:59:17

thanks @Ben, I'll edit now. That's a great suggestion

Johnson Francis 2020-11-28 06:08:46

I believe, the requirement is to skip next 5 rows, when a '1' is found. So, if there are multiple '1's this code won't work. This code takes care of only the first '1' found.

热门帖子

1

这里分享一个免费的在线 PDF 总结工具： NoteGPT

2

没想到 Arc 浏览器对网络要求如此严格

3

[上海] 招中级前端开发工程师

4

澳大利亚🇦🇺归来~第一次去南半球，虽然看过很多次照片，亲临大洋路时仍觉震撼

5

Apple Watch Terminal 风格表盘

6

失业三个月，面试寥寥无几，朋友失业的也很多

7

开发了一个在线批量图片压缩网站

8

路由器批量端口映射到 NAS 的问题，求教～

9

chatgpt-4o 实时语音功能入口在哪里呀

10

出二手书，有需要的朋友可以看看。

热门github

1

A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input

2

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

3

shadcn/ui, but for Svelte. ✨

4

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

5

Performance-portable, length-agnostic SIMD with runtime dispatch

6

ZK Credo

7

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

8

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

9

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

10

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

11

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

12

🎓 Path to a free self-taught education in Computer Science!

13

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

14

A collective list of free APIs

15

📚 Freely available programming books