Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2d array indexing is very slow #95

Closed
ViralBShah opened this issue Jul 7, 2011 · 9 comments
Closed

2d array indexing is very slow #95

ViralBShah opened this issue Jul 7, 2011 · 9 comments
Assignees
Labels
performance Must go faster

Comments

@ViralBShah
Copy link
Member

2d array indexing is significantly slower than matlab for us.

*** MATLAB ***

function t = mytranspose(x)
[m, n] = size(x);
t = zeros(n, m);
for i=1:n
for j=1:m
t(i,j) = x(j,i);
end
end
end

a = ones(2000,2000);
tic; mytranspose(a); toc
Elapsed time is 0.132217 seconds.

*** Julia ***

function transpose(a::Matrix)
m,n = size(a)
b = similar(a, n, m)
for i=1:m, j=1:n
b[j,i] = a[i,j]
end
return b
end

julia> x = ones(2000,2000);

julia> tic(); y = x'; toc();
elapsed time: 2.17426395416259766 sec

@ghost ghost assigned JeffBezanson Jul 7, 2011
@JeffBezanson
Copy link
Member

I would love to have a small suite of tests like this, comparing to matlab on things like a[:,i], a[a:b,c:d], etc. as well.

@JeffBezanson
Copy link
Member

Another point: x' calls ctranspose, not the transpose function here. ctranspose uses a comprehension, which is slow due to tuple-allocation overhead. Compare again with x.'. I believe the performance bug is comprehensions, not 2d indexing.

@ViralBShah
Copy link
Member Author

I did notice that the transpose performance test in perf.j is looking quite good. Also, replacing ctranspose implementation for now.

-viral

On Jul 12, 2011, at 5:03 AM, JeffBezanson wrote:

Another point: x' calls ctranspose, not the transpose function here. ctranspose uses a comprehension, which is slow due to tuple-allocation overhead. Compare again with x.'. I believe the performance bug is comprehensions, not 2d indexing.

Reply to this email directly or view it on GitHub:
#95 (comment)

@ViralBShah
Copy link
Member Author

BTW, one of the things I have been trying for a while is to find a fast implementation of transpose. I just can't believe that they don't have it in BLAS. So far, it seems that transpose in FFTW is one we can try. Wonder if Bradley Kuzsmaul at MIT has a fast implementation - or if Charles Leiserson does.

-viral

On Jul 12, 2011, at 5:03 AM, JeffBezanson wrote:

Another point: x' calls ctranspose, not the transpose function here. ctranspose uses a comprehension, which is slow due to tuple-allocation overhead. Compare again with x.'. I believe the performance bug is comprehensions, not 2d indexing.

Reply to this email directly or view it on GitHub:
#95 (comment)

@ViralBShah
Copy link
Member Author

See kernel/transpose.c in FFTW. The routines are also exposed in the library. Can't quite yet figure out what the arguments are.

void X(transpose_tiled)(R *I, INT n, INT s0, INT s1, INT vl)

-viral

On Jul 12, 2011, at 5:03 AM, JeffBezanson wrote:

Another point: x' calls ctranspose, not the transpose function here. ctranspose uses a comprehension, which is slow due to tuple-allocation overhead. Compare again with x.'. I believe the performance bug is comprehensions, not 2d indexing.

Reply to this email directly or view it on GitHub:
#95 (comment)

@StefanKarpinski
Copy link
Member

Isn't it obvious? Documentation for the win :-P

@ViralBShah
Copy link
Member Author

What? I didn't understand a word of what you said.

-viral

On Jul 12, 2011, at 7:43 AM, StefanKarpinski wrote:

Isn't it obvious? Documentation for the win :-P

Reply to this email directly or view it on GitHub:
#95 (comment)

@StefanKarpinski
Copy link
Member

No, I mean that the arguments are completely opaquely named and there's no documentation. Complete transparency fail.

@JeffBezanson
Copy link
Member

Fixed in commit 5bf4bce.
Now the two versions of transpose perform the same.

StefanKarpinski pushed a commit that referenced this issue Feb 8, 2018
Add `@compat chol(A, Val{:U})`
inkydragon pushed a commit that referenced this issue Dec 15, 2024
Stdlib: SHA
URL: https://github.com/JuliaCrypto/SHA.jl.git
Stdlib branch: master
Julia branch: master
Old commit: aaf2df6
New commit: 8fa221d
Julia version: 1.12.0-DEV
SHA version: 0.7.0(Does not match)
Bump invoked by: @inkydragon
Powered by:
[BumpStdlibs.jl](https://github.com/JuliaLang/BumpStdlibs.jl)

Diff:
JuliaCrypto/SHA.jl@aaf2df6...8fa221d

```
$ git log --oneline aaf2df6..8fa221d
8fa221d ci: update doctest config (#120)
346b359 ci: Update ci config (#115)
aba9014 Fix type mismatch for `shake/digest!` and setup x86 ci (#117)
0b76d04 Merge pull request #114 from JuliaCrypto/dependabot/github_actions/codecov/codecov-action-5
5094d9d Update .github/workflows/CI.yml
45596b1 Bump codecov/codecov-action from 4 to 5
230ab51 test: remove outdate tests (#113)
7f25aa8 rm: Duplicated const alias (#111)
aa72f73 [SHA3] Fix padding special-case (#108)
3a01401 Delete Manifest.toml (#109)
da351bb Remvoe all getproperty funcs (#99)
4eee84f Bump codecov/codecov-action from 3 to 4 (#104)
15f7dbc Bump codecov/codecov-action from 1 to 3 (#102)
860e6b9 Bump actions/checkout from 2 to 4 (#103)
8e5f0ea Add dependabot to auto update github actions (#100)
4ab324c Merge pull request #98 from fork4jl/sha512-t
a658829 SHA-512: add ref to NIST standard
11a4c73 Apply suggestions from code review
969f867 Merge pull request #97 from fingolfin/mh/Vector
b1401fb SHA-512: add NIST test
4d7091b SHA-512: add to docs
09fef9a SHA-512: test SHA-512/224, SHA-512/256
7201b74 SHA-512: impl SHA-512/224, SHA-512/256
4ab85ad Array -> Vector
8ef91b6 fixed bug in padding for shake, addes testcases for full code coverage (#95)
88e1c83 Remove non-existent property (#75)
068f85d shake128,shake256: fixed typo in export declarations (#93)
176baaa SHA3 xof shake128 and shake256  (#92)
e1af7dd Hardcode doc edit backlink
```

Co-authored-by: Dilum Aluthge <[email protected]>
DilumAluthge added a commit that referenced this issue Dec 17, 2024
Stdlib: Distributed
URL: https://github.com/JuliaLang/Distributed.jl
Stdlib branch: master
Julia branch: master
Old commit: 6c7cdb5
New commit: c613685
Julia version: 1.12.0-DEV
Distributed version: 1.11.0(Does not match)
Bump invoked by: @DilumAluthge
Powered by:
[BumpStdlibs.jl](https://github.com/JuliaLang/BumpStdlibs.jl)

Diff:
JuliaLang/Distributed.jl@6c7cdb5...c613685

```
$ git log --oneline 6c7cdb5..c613685
c613685 Merge pull request #116 from JuliaLang/ci-caching
20e2ce7 Use julia-actions/cache in CI
9c5d73a Merge pull request #112 from JuliaLang/dependabot/github_actions/codecov/codecov-action-5
ed12496 Merge pull request #107 from JamesWrigley/remotechannel-empty
010828a Update .github/workflows/ci.yml
11451a8 Bump codecov/codecov-action from 4 to 5
8b5983b Merge branch 'master' into remotechannel-empty
729ba6a Fix docstring of `@everywhere` (#110)
af89e6c Adding better docs to exeflags kwarg (#108)
8537424 Implement Base.isempty(::RemoteChannel)
6a0383b Add a wait(::[Abstract]WorkerPool) (#106)
1cd2677 Bump codecov/codecov-action from 1 to 4 (#96)
cde4078 Bump actions/cache from 1 to 4 (#98)
6c8245a Bump julia-actions/setup-julia from 1 to 2 (#97)
1ffaac8 Bump actions/checkout from 2 to 4 (#99)
8e3f849 Fix RemoteChannel iterator interface (#100)
f4aaf1b Fix markdown errors in README.md (#95)
2017da9 Merge pull request #103 from JuliaLang/sf/sigquit_instead
07389dd Use `SIGQUIT` instead of `SIGTERM`
```

Co-authored-by: Dilum Aluthge <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Must go faster
Projects
None yet
Development

No branches or pull requests

3 participants