温馨提示:本文翻译自stackoverflow.com，查看原文请点击：python - Restrict the sum of outputs in a neural network regression (Keras)

keras python tensorflow activation-function

python - 限制神经网络回归中的输出总和（Keras）

发布于 2020-03-31 23:51:57

我正在预测7个目标，这是一个值的比率，因此对于每个样本，所有预测值的总和应为1。除了softmax在输出端使用（似乎显然不正确）之外，我只是想不出其他方法来限制总和所有预测的输出为= 1 ..
谢谢您的建议。

input_x = Input(shape=(input_size,))
output = Dense(512, activation=PReLU())(input_x)
output = Dropout(0.5)(output)
output = Dense(512, activation=PReLU())(output)
output = Dropout(0.5)(output)
output = Dense(16, activation=PReLU())(output)
output = Dropout(0.3)(output)
outputs = Dense(output_size, activation='softmax')(output)
#outputs = [Dense(1, activation=PReLU())(output) for i in range(output_size)] #multioutput nn

nn = Model(inputs=input_x, outputs=outputs)
es = EarlyStopping(monitor='val_loss',min_delta=0,patience=10,verbose=1, mode='auto')
opt=Adam(lr=0.001, decay=1-0.995)
nn.compile(loss='mean_absolute_error', optimizer=opt)
history = nn.fit(X, Y, validation_data = (X_t, Y_t), epochs=100, verbose=1, callbacks=[es])

目标示例：

因此，这是一个要素的所有比率，每一行的总和= 1。
例如功能-“总计” = 100分，A = 25分，B = 25分，其他所有-10分。因此，我的7个目标比率将为0.25 / 0.25 / 0.1 / 0.1 / 0.1 / 0.1 / 0.1 / 0.1。

我需要训练和预测这样的比率，因此在将来知道“总计”时，我们可以从预测的比率中恢复点。

提问者

Oleksii

被浏览

158

查看英文版

查看原文

Tomasz Gandor 2020-02-06 06:47

我想我了解您的动机，以及为什么“ softmax不会削减它”。

这是因为softmax不能线性缩放，因此：

>>> from scipy.special import softmax
>>> softmax([1, 2, 3, 4])
array([0.0320586 , 0.08714432, 0.23688282, 0.64391426])
>>> softmax([1, 2, 3, 4]) * 10
array([0.32058603, 0.87144319, 2.36882818, 6.4391426 ])

看起来与原始数组完全不同。

不过，不要过于轻视softmax-它可以处理特殊情况，例如负值，零，预激活信号的零和...。但是，如果您希望将最终回归归一化，并期望结果为非-负数，您可以简单地将其除以和：

input_x = Input(shape=(input_size,))
output = Dense(512, activation=PReLU())(input_x)
output = Dropout(0.5)(output)
output = Dense(512, activation=PReLU())(output)
output = Dropout(0.5)(output)
output = Dense(16, activation=PReLU())(output)
output = Dropout(0.3)(output)
outputs = Dense(output_size, activation='relu')(output)
outputs = Lambda(lambda x: x / K.sum(x))(outputs)

nn = Model(inputs=input_x, outputs=outputs)

Dense当然，该层需要与激活不同的激活'softmax'（relu甚至线性都可以）。

Tomasz Gandor 2020-02-06 06:49:38

当然，为了使该体系结构有意义，训练和验证集（Y，Y_t）也应具有此属性-每行的总和应= 1。

Oleksii 2020-02-07 20:31:41

谢谢！我将尝试比较两种情况。

Oleksii 2020-02-18 16:49:12

最终回归解决方案失败-无法收敛。

相关问题

1

如何使用python cut方法创建bin，接受一个参数并返回适当的bin？

2

从具有特定条件的列表列表创建字典

3

根据行值选择列，Python，Pandas

4

在数据框中绘制零和一的计数

5

python函数。

6

在两个DataFrame之间执行大量Pandas查找的最佳方法

7

如何获取Pandas数据透视表中的列数和每列的宽度？

8

在Pandas数据框中分组时缺少所需值时显示一列

9

Python隐藏壁虱但显示壁虱标签

10

获取Entry和checkbutton值Tkinter时出现问题

热门github

1

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application. (翻译：LobeChat 是开源的高性能聊天机器人框架，支持语音合成、多模态、可扩展的（Function Call）插件系统。)

2

Collection of leaked system prompts

3

Jelly Evolution Simulator

4

Master programming by recreating your favorite technologies from scratch. (翻译：在这个项目中，你能学会如何创造自己的各种工具，引擎，游戏，框架，库......)

5

Agent S: an open agentic framework that uses computers like a human

6

An open source payments switch written in Rust to make payments fast, reliable and affordable (翻译：YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite)

7

Python - 100天从新手到大师

8

Truly independent web browser

9

Curated list of project-based tutorials (翻译：收藏了基于项目的教程列表)

10

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ (翻译：12 节课程，开始使用生成式 AI 进行构建)

11

ChatGPT DAN, Jailbreaks prompt

12

A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage

13

real time face swap and one-click video deepfake with only a single image