Warm tip: This article is reproduced from serverfault.com, please click

python tensorflow

TensorFlow layer that converts a 2D matrix to a vector of certain length

发布于 2020-11-28 14:44:00

I am trying to build a neural network that takes in data in form of a matrix and outputs a vector but I don't know what layers to use to perform that. My input has shape (10,4) and my desired output has shape (3,). My current model is the following :

model = tf.keras.Sequential([
    tf.keras.layers.Dense(256,activation="relu"),
    tf.keras.layers.Dense(256,activation="relu"),
    tf.keras.layers.Dense(1),
])

this at least results in a vector instead of a matrix but it has (10,) instead of (3,). I could probably find a way to reduce that to (3,) but I doubt I am doing the correct thing with this approach.

Questioner

Lukas Gradl

Viewed

0

Akshay Sehgal 2020-11-28 22:54:12

Assuming that your (10,4) is a matrix which doesn't represent a 10 length sequence (where you will need an LSTM) OR an image (where you will need a 2D CNN), you can simply flatten() the input matrix and pass it through to the next few dense layers as below.

from tensorflow.keras import layers, Model

inp = layers.Input((10,4)) #none,10,4
x = layers.Flatten()(inp)  #none,40
x = layers.Dense(256)(x)   #none,256
out = layers.Dense(3)(x)   #none,3

model = Model(inp, out)
model.summary()

Layer (type)                 Output Shape              Param #   
=================================================================
input_43 (InputLayer)        [(None, 10, 4)]           0         
_________________________________________________________________
flatten_1 (Flatten)          (None, 40)                0         
_________________________________________________________________
dense_82 (Dense)             (None, 256)               10496     
_________________________________________________________________
dense_83 (Dense)             (None, 3)                 771       
=================================================================
Total params: 11,267
Trainable params: 11,267
Non-trainable params: 0

Lukas Gradl 2020-11-28 15:12:28

I tried this, but it doesn't work for me for some reason. data.shape TensorShape([10, 4]) x = tf.keras.layers.Flatten()(data) x = tf.keras.layers.Dense(256)(x) x = tf.keras.layers.Dense(3)(x) x.shape TensorShape([10, 4])

Lukas Gradl 2020-11-28 15:29:07

it does work, my bad. Apparently the Flatten() Layer doesn't do anything to the matrix because it expects the first dimension to be the batch size. If I use model.fit(data) it will internally add the batch size as a dimension(i think) and work as you described.

热门帖子

1

出一些有意思的域名-明盘

2

为什么我的 TG 始终收不到提醒通知？

3

求助一个排查了半年没解决的 MySQL order by 子句导致索引失效的问题， 500 多万条记录的小表要查快两分钟

4

Android 开发方向：传统 View 开发 or 拥抱 Jetpack Compose

5

浙江 ISP 会屏蔽 Wireguard UDP 端口

6

高级 Java 岗位一定需要有管理经验吗

7

兄弟们，老是倒在大厂二面的怎么办？

8

最近很多小伙伴弄港卡，分享一下个人开卡经历

9

vercel 免费版 3 个指标都超了，好像我的网站已经获得了基础流量。还好套了 cloudflare 不然流量也超了。

10

最近换了真我手机，发现这系统连应用的数据都无法备份

热门github

1

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

2

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

3

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

4

该项目可以让你通过订阅的方式使用Cloudflare WARP+，自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

5

Multi functional app to find duplicates, empty folders, similar images etc.

6

Xray panel supporting multi-protocol multi-user expire day & traffic & ip limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)

7

The Free Software Media System

8

lightweight, standalone C++ inference engine for Google's Gemma models.

9

📚 Freely available programming books

10

A collective list of free APIs

11

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

12

🎓 Path to a free self-taught education in Computer Science!

13

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

14

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

15

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.