Warm tip: This article is reproduced from serverfault.com, please click

azure azure-cosmosdb azure-cosmosdb-gremlinapi

How is cosmosDB RU throughput enforced

发布于 2020-07-20 21:03:54

I have a cosmosGB gremlin API set up with 400 RU/s. If I have to run a query that needs 800 RUs, does it mean that this query takes 2 sec to execute? If i increase the throughput to 1600 RU/s, does this query execute in half a second? I am not seeing any significant changes in query performance by playing around with the RUs.

Questioner

Michael Scott

Viewed

0

David Makogon 2020-07-21 09:54:33

As I explained in a different, but somewhat related answer here, Request Units are allocated on a per-second basis. In the event a given query will cost more than the number of Request Units available in that one-second window:

The query will be executed
You will now be in "debt" by the overage in Request Units
You will be throttled until your "debt" is paid off

Let's say you had 400 RU/sec, and you executed a query that cost 800 RU. It would complete, but then you'd be in debt for around 2 seconds (400 RU per second, times two seconds). At this point, you wouldn't be throttled anymore.

The speed in which a query executes does not depend on the number of RU allocated. Whether you had 1,000 RU/second OR 100,000 RU/second, a query would run in the same amount of time (aside from any throttle time preventing the query from running initially). So, aside from throttling, your 800 RU query would run consistently, regardless of RU count.

Michael Scott 2020-07-21 20:38:20

Makes sense, thank you. So if I have batch jobs to run(more RUs), would it be a good idea to run those during off-peak hours in order to make sure customers are not throttled during regular business hours? In other words, if I am ok with some downtime in offpeak hours, can i keep my throughput at the minimum, and run the expensive ones in offpeak?

David Makogon 2020-07-21 20:57:17

@MichaelScott - honestly the way you distribute traffic is up to you. However, if I were in your position, I'd likely increase my RU capacity during peak hours, and decrease in non-peak. You have complete flexibility over RU allocation - you can adjust it any time. Just consider the cost of an extra few hundred RU - it's fairly negligible, even moreso if you only raise RU for a subset of each day.

热门帖子

1

寻找一个跨境支付行业的技术合伙人

2

现在的环境，被裁员+几个月空窗期，是不是意味着职业生命就结束了……

3

试试这个小技巧，直接拿到 ChatGPT Mac 版的使用权限！

4

黑群晖突然中了勒索病毒，每个文件夹都被放了这么一个文件，怎么排查从哪进的？

5

学了一周的 chrome 扩展，用 gpt 手搓了一个阅读模式的浏览器扩展

6

现在想系统的学习下 Python 大佬们有什么推荐的视频课程没

7

在[万兴长沙]的小伙伴, 能分享一下亲身的工作体验吗?

8

iPhone 的信号问题，可能主要在基站切换策略上

9

朋友们，受到 BlockSite 启发，我打算做一款 bilibiliBlock 插件,大家觉得怎么样？

10

大家帮忙推荐 <黑苹果> 机器 (不是讨论是否要黑, 而是推荐机器)

热门github

1

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

2

A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.

3

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

4

该项目可以让你通过订阅的方式使用Cloudflare WARP+，自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

5

Multi functional app to find duplicates, empty folders, similar images etc.

6

Xray panel supporting multi-protocol multi-user expire day & traffic & ip limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)

7

The Free Software Media System

8

lightweight, standalone C++ inference engine for Google's Gemma models.

9

📚 Freely available programming books

10

A collective list of free APIs

11

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

12

🎓 Path to a free self-taught education in Computer Science!

13

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

14

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

15

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.