温馨提示:本文翻译自stackoverflow.com，查看原文请点击：其他 - Remove text between [quote= and [/quote] in Python

python-3.x

其他 - 在Python中删除[quote =和[/ quote]之间的文本

发布于 2020-03-27 12:00:13

我正在读取一个应用NLP的csv文件，并且想对数据进行预处理。我从一个在线论坛收到数据，因此上面有引号。如何删除它们？举个例子;

a='[b]Re:[/b] 
[quote="xxx"] How can I do that blah blah xxx [/quote]
 Hello xxx, I will tell you how you can do it blah blah blah.'

我想要下面的表格；

一个='你好xxx，我会告诉你你怎么能做到的等等。

我想检测到[quote =“并开始删除直到看到[/ quote]的正则表达式。这可能吗？

我已经尝试过了，但是没有用。

  def quotes(text):
   return re.sub('\[([^\]=]+)(?:=[^\]]+)?\].*?\[\/\\1\]', '', text)

  data['message'] = data['message'].apply(quotes)

提问者

nurlubanu

被浏览

17

查看英文版

查看原文

nurlubanu 2019-07-04 23:43

答案其实太简单了

def quotes(text):
 return re.sub(r'\[quote.+quote\]','',text)
data['message'] = data['message'].apply(quotes)

只是。

相关问题

1

从具有特定条件的列表列表创建字典

2

使用For循环进行迭代时从列表中删除项目

3

使用numpy确定两个矩阵之间的距离

4

当我从Python中的Google搜索查询中提取链接时，我无法返回HTML链接

5

是否可以创建嵌套的类属性？

6

在PyCharm中连接到SQLite3

7

EVENT LOOP已关闭discord.py，可能出现令牌错误

8

* args返回仅包含偶数个参数的列表

9

无法获取client.command参数以通过discord.py中的键值解析API响应

10

创建一个名字为键的字典，与该键关联的全名为值

热门github

1

Fast, easy and reliable testing for anything that runs in a browser. (翻译：Cypress 是为现代网络而构建的下一代前端测试工具，用于解决开发者和 QA 工程师在测试现代应用程序时面临的关键难题)

2

An AI Hedge Fund Team

3

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

4

Lightweight coding agent that runs in your terminal

5

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

6

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages. (翻译：PaddleOCR旨在打造一套丰富、领先、且实用的OCR工具库，助力开发者训练出更好的模型，并应用落地。)

7

3D Reconstruction for all

8

基于大模型和 RAG 的智能问数系统。Text-to-SQL Generation via LLMs using RAG.

9

Main repository for the Linera protocol

10

AI wearables. Put it on, speak, transcribe, automatically

11

A complete computer science study plan to become a software engineer. (翻译：一个如何成为软件工程师的完整、科学的学习计划。)

12

Open-source framework for conversational voice AI agents.

13

14

Tongyi DeepResearch, the Leading Open-source DeepResearch Agent

15

Flutter makes it easy and fast to build beautiful apps for mobile and beyond (翻译：Flutter 可以轻松快速地为移动设备及其他应用构建漂亮的应用程序)