Warm tip: This article is reproduced from serverfault.com, please click

amazon-kinesis amazon-s3 amazon-web-services

query on aws kinesis put-record through cli

发布于 2020-11-29 05:39:14

This is regarding aws kinesis put-record command through AWS CLI.

I am able to input text data using kinesis cli.

aws kinesis put-record --cli-binary-format raw-in-base64-out --stream-name NagaTZeusTestStream --partition-key 1 --data 2 --region  us-west-2

Here the data is 2

But how can i put a csv file in place of 2 as data.

And how can i put a csv file which is in s3

For example :

aws kinesis put-record --cli-binary-format raw-in-base64-out --stream-name NagaTZeusStream --partition-key 1 --data s3://cona-sample-salesforce-data/testdata/ --region  us-west

In this case the file csv file in the s3 bucket should be uploaded as a data record , but kinesis is considering the s3 path itself as a data string.

Any help would be apprecieated. Thanks in advance

Questioner

nagasatish chilakamarti

Viewed

0

Adam Batkin 2020-11-29 13:47:44

Kinesis Streams lets you write opaque blobs of data. The Kinesis PutRecord API (which is what the AWS CLI kinesis put-record command calls) expects you to give it a blob of data. If the data is stored in S3, it is your responsibility to load that data to send to Kinesis.

A common Kinesis pattern when working with "large" data is to put actual data into some other storage system (S3 being a great example) and then writing the "location" of that data (in this case, an S3 path) to Kinesis. With Kinesis Streams, your throughput (and costs) to/from Kinesis are directly affected by the amount of data you read/write. This of course requires coordination between the publisher and consumer, on the exact format (and semantics) of messages. If this is the case, you should look at what your consumers are expecting the format of your message to be.

But the moral of the story here is that Kinesis (and the CLI's put-record) are going to put/write exactly what you give it.

nagasatish chilakamarti 2020-11-29 05:49:42

Hi Adam. Thanks for responding.

Adam Batkin 2020-11-29 05:51:55

Hi @nagasatishchilakamarti - if you find an answer useful, feel free to give it an upvote, but more importantly, if it successfully answered your question, be sure to mark it as the "Accepted" answer.

热门帖子

1

寻找一个跨境支付行业的技术合伙人

2

现在的环境，被裁员+几个月空窗期，是不是意味着职业生命就结束了……

3

试试这个小技巧，直接拿到 ChatGPT Mac 版的使用权限！

4

黑群晖突然中了勒索病毒，每个文件夹都被放了这么一个文件，怎么排查从哪进的？

5

现在想系统的学习下 Python 大佬们有什么推荐的视频课程没

6

在[万兴长沙]的小伙伴, 能分享一下亲身的工作体验吗?

7

iPhone 的信号问题，可能主要在基站切换策略上

8

朋友们，受到 BlockSite 启发，我打算做一款 bilibiliBlock 插件,大家觉得怎么样？

9

大家帮忙推荐 <黑苹果> 机器 (不是讨论是否要黑, 而是推荐机器)

10

2024 年了，有用 Longhorn 的朋友吗？想问问性能和稳定性咋样？

热门github

1

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

2

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

3

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

4

该项目可以让你通过订阅的方式使用Cloudflare WARP+，自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

5

Multi functional app to find duplicates, empty folders, similar images etc.

6

Xray panel supporting multi-protocol multi-user expire day & traffic & ip limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)

7

The Free Software Media System

8

lightweight, standalone C++ inference engine for Google's Gemma models.

9

📚 Freely available programming books

10

A collective list of free APIs

11

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

12

🎓 Path to a free self-taught education in Computer Science!

13

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

14

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

15

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.