温馨提示:本文翻译自stackoverflow.com，查看原文请点击：python - Xpath returning same result

python scrapy xpath

python - Xpath返回相同的结果

发布于 2020-03-27 11:03:54

尝试使用Xpath从以下网站上刮取卡名：https: //www2.trollandtoad.com/buylist/?_ga=2.123753418.115346513.1562026676-1813285172.1559913561#!/M/10591 ，但每次都始终返回相同的结果。我需要它来输出该链接中的所有卡名，但是它一次又一次地给了我相同的卡名。

def parse(self, response):
        #  Initialize item to function GameItem located in items.py, will be called multiple times
        item = GameItem()
        # Extract card category from URL using html code from website that identifies the category.  Will be outputted before rest of data
        for data in response.css('tr.ng-scope'):
            item["Set"] =data.css("a.ng-binding.ng-scope::text").get()
            if item["Set"] == None:
                item["Set"] = data.css("span.ng-binding.ng-scope::text").get()
            item["Card_Name"]  = data.xpath("//div/table/tbody/tr/td[contains(@class,'buylist_productname item')]/a/text()").get()

我尝试使用getall（），但也无法正常工作。它会退还所有卡名，但不会与我正确抓取的其他数据配对。而不是以一个价格输出一个卡名，依此类推，它将为我提供所有卡名以及第一张卡的价格，以此类推。

提问者

Tom

被浏览

123

查看英文版

查看原文

gangabass 2019-07-03 22:21

您需要相对的 XPath：

item["Card_Name"]  = data.xpath(".//td[2]/a/text()").get()

更新修复了您的XPath

Tom 2019-07-03 22:18:54

我已经尝试过了，它只是为每个值返回null。

gangabass 2019-07-03 22:22:03

@Timmy哦，您的XPath错误。查看最新答案。

相关问题

1

如何使用python cut方法创建bin，接受一个参数并返回适当的bin？

2

从具有特定条件的列表列表创建字典

3

根据行值选择列，Python，Pandas

4

在数据框中绘制零和一的计数

5

python函数。

6

在两个DataFrame之间执行大量Pandas查找的最佳方法

7

如何获取Pandas数据透视表中的列数和每列的宽度？

8

在Pandas数据框中分组时缺少所需值时显示一列

9

Python隐藏壁虱但显示壁虱标签

10

获取Entry和checkbutton值Tkinter时出现问题

热门github

1

2

the LLM vulnerability scanner

3

🚀 Efficient implementations of state-of-the-art linear attention models

4

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

5

📚 从零开始的大语言模型原理与实践教程

6

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training. (翻译：🤗Transformers：用于 Pytorch、TensorFlow 和 JAX 的最先进的机器学习。)

7

An open-source C++ library developed and used at Facebook. (翻译：Facebook 开发和使用的开源 C++ 库。)

8

Protocol Buffers - Google's data interchange format (翻译：Protocol Buffers - Google 的数据交换格式)

9

Generate code from the terminal!

10

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python. (翻译：此存储库用于主动开发 Azure SDK for Python。)

11

🧩 Patches for ReVanced (翻译：🧩ReVanced 维护的官方补丁)

12

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

13

The Go language implementation of gRPC. HTTP/2 based RPC

14

Modern Backend Framework that unifies APIs, background jobs, workflows, and AI Agents into a single core primitive with built-in observability and state management.

15

the elegant TypeScript UI framework