Warm tip: This article is reproduced from serverfault.com, please click

find julia matlab optimization search

Efficiently implementing Matlab's "Find" function in Julia

发布于 2020-12-01 23:59:20

I'm trying to implement Matlab's Find function in Julia. In Matlab, the code is

find(A==0)

where A is a very, very large n by m matrix, and where I iterate and update the above over a series of about 500 steps. In Julia, I implement the above via

[findall(x->x==0, D_tot)[j][2] for j in 1:count(x->x==0,D_tot)]

This seems to work nicely, except it goes very slow as I progress with my iteration. For example, for the first step, @time yields

0.000432 seconds (33 allocations: 3.141 KiB)

Step 25:

0.546958 seconds (40.37 k allocations: 389.997 MiB, 7.40% gc time)

Step 65:

1.765892 seconds (86.73 k allocations: 1.516 GiB, 9.63% gc time)

At each step, A remains the same size but becomes more complex, and Julia seems to have trouble finding the zeroes. Is there a better way of implementing Matlab's "Find" function than what I did above?

Questioner

Joshuah Heath

Viewed

0

Przemyslaw Szufel 2020-12-02 08:44:03

Going through the Matlab documentation I understand that you want to find

"a vector containing the linear indices of each nonzero element in array X"

and by non-zero you meant true values in Matlab's expression A==0

In that case this can be accomplished as

findall(==(0),vec(D_tot))

And a small benchmark:

D_tot=rand(0:100,1000,1000)
using BenchmarkTools

Running:

julia> @btime findall(==(0), vec($D_tot));
  615.100 μs (17 allocations: 256.80 KiB)

julia> @btime findall(iszero, vec($D_tot));
  665.799 μs (17 allocations: 256.80 KiB)

DNF 2020-12-02 00:50:30

That timing is very strange. What's going on with iszero? Have you tried several times?

Przemyslaw Szufel 2020-12-02 01:13:42

yes several times and it is strange indeed. I run @code_native and there are tiny differences in the assembly code (even the number of assembly instructions differs by 3) - so this seems not to be random.

热门帖子

1

C++新手，求助一个关于怎么使用第三方库的问题

2

关于英语学习的重要性的思考

3

出美版 iPhone 13promax， V 友 4600 吧

4

这里分享一个免费的在线 PDF 总结工具： NoteGPT

5

没想到 Arc 浏览器对网络要求如此严格

6

深陷消费主义陷阱的背后，是我空洞的灵魂

7

做了一个以凡人修仙传境界为基础的 Github 统计卡片

8

从程序员角度思考一下，为什么我的支付宝能收到其他人的 12306 购票推送消息

9

[上海] 招中级前端开发工程师

10

澳大利亚🇦🇺归来~第一次去南半球，虽然看过很多次照片，亲临大洋路时仍觉震撼

热门github

1

A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input

2

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

3

shadcn/ui, but for Svelte. ✨

4

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

5

Performance-portable, length-agnostic SIMD with runtime dispatch

6

ZK Credo

7

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

8

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

9

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

10

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

11

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

12

🎓 Path to a free self-taught education in Computer Science!

13

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

14

A collective list of free APIs

15

📚 Freely available programming books