Parallella: The Most Energy Efficient Supercomputer - Ray Hightower of ROIClear

  Рет қаралды 412,708

Ray Hightower

Ray Hightower

Күн бұрын

Пікірлер
@gene4390
@gene4390 7 жыл бұрын
The most efficient computer I ever saw (I own 2 of them) made in the 1980s the Casio FX-790P. It had built in basic programming language, scientific functions, 16kb of ram, and ran at 1Mzh (very good for the early 80s) and could run for 2 years off two tiny little watch batteries! I used mine mainly in collage and wrote my own programs. I even programed several games for it. lol Almost 35+ years later I still use the FX-790P (or renamed Tandy PC-6) durable micro computer to this day.
@MatthewHolevinski
@MatthewHolevinski 7 жыл бұрын
but can it run lisp?
@kevincozens6837
@kevincozens6837 6 жыл бұрын
The parallel "hello, world" program failed. At 12:26 there are 20 responses from 16 cores. Three cores (0,1 0,3 and 2,1) never responded. Four cores responded multiple times.
@CryptoJones
@CryptoJones 6 жыл бұрын
Mr. Hightower, this motivates me to study parallelism more in-depth. Thank you for this.
@JohnVegas
@JohnVegas 8 жыл бұрын
I always enjoy your presentations. God bless!
@antonnym214
@antonnym214 8 жыл бұрын
Nice talk. Outstanding machine, and you present it very well.
@rainbowbunchie8237
@rainbowbunchie8237 8 жыл бұрын
When your electronics become obsolete, put them in a drawer and keep them forever. Electronic things are WAY too cool to throw away, no matter how old they are. =P
@pyrographic380
@pyrographic380 6 жыл бұрын
yeah
@satibel
@satibel 6 жыл бұрын
nasty ass-dildo or nasty-ass dildo? :p
@antonnym214
@antonnym214 8 жыл бұрын
Dr. Hightower, this is a very nice presentation. I like your style and how well you explain it for the layperson. It's pretty exciting to run a single module with that little solar generator. Makes me think it would be quite feasible to power a huge array of those with just a few solar panels on the roof. It could be virtually free to power the system. Lots of possibilities there, because for most installations, the challenge is covering the operating costs, as opposed to the initial expense.
@AndrewHelgeCox
@AndrewHelgeCox 8 жыл бұрын
This is quite interesting in that it is a talk given by a person who is clearly not an expert in his subjects of parallel programming, or really of anything he touches on, but it still manages to be a little bit entertaining.
@antonnym214
@antonnym214 8 жыл бұрын
45-seconds to fully boot is pretty impressive, compared to my win7 box.
@pieterrossouw8596
@pieterrossouw8596 9 жыл бұрын
1GB RAM with 18 cores, for a lot of HPC applications, that is going to be a catastrophic bottleneck. In x86 architecture compute clusters, a "golden rule" is 2GB per processing core, depending on application obviously. Sure these cores are comparatively weak, but since RAM chips are pretty inexpensive, it's a shame that for that price they didn't include at least 4GB of RAM.
@MichaelPohoreski
@MichaelPohoreski 9 жыл бұрын
+Pieter Rossouw Yup still waiting for an extremely low-cost 16 GB + 8 core, or hell, even 4GB + 4 core device. While the Raspberry Pi 2B, Banana Pi and Parallella are all "nice" SoC embedded devices the lack of 4GB+ RAM gimps theses devices from more "serious" work where our data sets are larger. :-/
@walter0bz
@walter0bz 8 жыл бұрын
+Pieter Rossouw these are 'little cores', more comparable to GPU warps or SIMD lanes. one x86 core is equivalent to several parallela cores (it might be as many as 16, depends on pipeline depth, simd, execution units, I don't know off hand), so it's still about right. The parallela concept is still worthwhile, GPUs prove more,simpler cores have higher throughput. a big-core spends huge resources figuring out parallelism from a single thread on the fly. nonetheless the board has other problems, but they have to start somewhere with this new architecture needing new software. it would be perfect for AI work IMO (dataflow)
@0xf7c8
@0xf7c8 8 жыл бұрын
+Pieter Rossouw If you see closely this is a Xilinx chip, probably a FPGA, so i would call it a mounted prototype. Its hard to put 4gigs of ram in a FPGA
@llothar68
@llothar68 8 жыл бұрын
+Pieter Rossouw The problem is not only the RAM (yes 1GB per core is important and at least 64KB cache per core) but the RAM throughput. With just one memory channel you will get almost 0 parallelism in many real world tasks. I'm not even think it's a good teaching device because of this restrictions which do not let you draw conclusions about bottlenecks when everything is a bottleneck.
@walter0bz
@walter0bz 8 жыл бұрын
it's really a forward looking experimental device, PGAS architecture would scale far better than anything else, but they didn't got the budget to build a large chip with newer process yet (the concept only makes sense when scaled up to thousands of cores). Chicken/egg situation with software
@mehmetedex
@mehmetedex 7 жыл бұрын
I can listen forever this guy. Great speech
@duderobi
@duderobi 8 жыл бұрын
3:25 dit I hear right 2 ARM (Acorn Risc Mashine) and 14 Risc cores?
@56335130
@56335130 5 жыл бұрын
xilinx zynq is a fpga soc
@ragsdale9
@ragsdale9 8 жыл бұрын
im curious if the parallella would increase wattage under high utilization.
@larrycastro7937
@larrycastro7937 8 жыл бұрын
I stumbled onto this website, and thought it was fascinating. All I know is about Moore'slaw, doubling of transistors on a microchip every eighteen months.
@ForbiddenUser403
@ForbiddenUser403 8 жыл бұрын
What we really need is a parallel platform with the individual nodes configured like hot swappable modules with the ability to plug them all into a centralized expandable location with a virtualization software solution that's able to recognize the resources of all those "blades" and utilize them, and see them as traditional PC hardware allowing the use of traditional software and OS's without the need to rewrite all application to make use of parallel processing individually..
@jgbreezer
@jgbreezer 7 жыл бұрын
Computer (software) can't yet parallelise problems for us automatically well enough, we still need to write things in a way ready for this. Its getting more and more the default way of writing things for scaling horizontally rather than vertically nowadays in the commercial world, but still not ready for low-level parallelism in a large way. Cultural change required.
@stevebez2767
@stevebez2767 7 жыл бұрын
so buy the board write the program too do nust tjat,parallel programming next stop quanta?
@neilruedlinger4851
@neilruedlinger4851 6 жыл бұрын
Sounds like a worthwhile project for a savvy start-up company?
@oldchannel6511
@oldchannel6511 9 жыл бұрын
18 cores and 1GB RAM.. Absolute savage.
@hydrochloricacid2146
@hydrochloricacid2146 8 жыл бұрын
Bottleneck FTW
@KianGurney
@KianGurney 8 жыл бұрын
+CasualMods 7 gamers, one CPU.
@0xf7c8
@0xf7c8 8 жыл бұрын
+Nerd You have no idea what you are talking about.
@oldchannel6511
@oldchannel6511 8 жыл бұрын
Yeah I do, lmao.
@0xf7c8
@0xf7c8 8 жыл бұрын
I'll put it easy for you. You have in your head the concept that this cores are even close to a modern x86 core, when this is not the case. This cores are not even as powerful as 1 single Cuda core in a gpu. A mid-range GPU has, let's say, 650 cuda cores and 2gb of ram. And with that amount of ram they have all the memory they can handle without overshooting. And gpus can easily be used as clusters and in fact they are. I'm not saying that this design is perfectly well thought and they have nothing to improve, but that 1gb of ram in this kind of device is not as crazy as you think.
@FlumenSanctiViti
@FlumenSanctiViti 6 жыл бұрын
I'm not a programmer, but... his code at 12:56 should return TRUE for input number 4?
@SudoPi
@SudoPi 9 жыл бұрын
It will be way cooler if this would be maybe about 40$ or so. 150$ is a big price to ask from consumers to purchase a SBC
@assaulth3ro911
@assaulth3ro911 9 жыл бұрын
+The Random Stuff Yeah. It is however different from a Pi, I think $75-$100 would be more fair.
@mysticvirgo9318
@mysticvirgo9318 9 жыл бұрын
+The Random Stuff will most likely get less expensive per unit as they sell more and more :)
@supercompy
@supercompy 9 жыл бұрын
+The Random Stuff They are $75 for the micro-server version and $99 for the desktop version right now on amazon. I think that is a fair price considering the number of cores.
@voyager1bg
@voyager1bg 9 жыл бұрын
+The Random Stuff not that expensive, we're talking supercomputing here... I believe such advancements are the future
@SudoPi
@SudoPi 9 жыл бұрын
Yea but if the price is 35$ like the Raspberry Pi than it would probably be more interesting to customers since not everyone would be willing to pay $150 just to tinker around but as you said not that expensive but it really depends on who is looking at the price point and for me, the 35$ price tag on the Pi 2 is cooler
@Rarius
@Rarius 8 жыл бұрын
1) Note that he compares his 18 core system with just a single core of the Mac, not with running on all four cores! 2) I coded up this algorithm in C# on my two year old PC (Intel i5-3570K!)... and even running single threaded it managed it in 6.65s... three times faster than this Parallella, and twice as fast as the Mac! 3) This is a pretty poor algorithm for finding primes... There are FAR better ones. For instance, on my PC, the sieve of Eratosthenes algorithm gets the same result in 0.38s! Better algorithms often (usually?) yield better results than throwing more hardware at a problem. While applaud the effort going into the Parallella, it needs to be significantly faster before it is worth investing in. It might be interesting to see how a stack of Raspberry Pi 3s (you can get 4 Pis with change from $150) would do with their 16 cores.
@fatkidd7782
@fatkidd7782 8 жыл бұрын
everybody needs to read this
@dialupdavid
@dialupdavid 8 жыл бұрын
This was my first thought too, no idea why in the hell anyone thought it would be a good idea to compare a single thread of a Quad core/ Eight thread system to a Dual core ARM chip with 16 Co-processing cores. Makes no logical sense too me, were they that offended in how low the performance was? To me this was nothing technical, this guy was no Engineer/Enthusiast; solely a salesman with a sales pitch.
@owatson67
@owatson67 8 жыл бұрын
Yeah but does your PC use 5 watts and did it cost $150? I haven't run this algorithmic on my PC yet but i know it would push a good time. It has a i7-6700HQ which is quad core CPU with 8 threads but I know that it would beat it but it's not the point.
@Rarius
@Rarius 8 жыл бұрын
No my PC doesn't consume 5 watts or cost $150... but neither does the Apple he compares the Parallella with. Actually, you could build a PC for less than $150 using second hand parts that would outperform the Parallela AND be much easier to program. I suspect that a $150 cluster of Raspberry Pis would give it a good run for its money too.
@dialupdavid
@dialupdavid 8 жыл бұрын
Thunder o Well, the Tegra X1 has about the same Power requirements, and a 256 Core Maxwell GPU. Anyone who's going to do parallel processing is going to be 10x better off using CUDA or openCL. Not to mention the Actual A57's in that SoC are probably faster than the entire Parallela board by a factor of 3-4.
@idhan
@idhan 8 жыл бұрын
the prime calculation program can be easily run in parallel on the mac.. assuming it has 4 logical processors.. it could run in about 3.5 seconds. That should be the real comparison.. saying that, still the Parallella is an amazing peace of hardware :-)
@altEFG
@altEFG 6 жыл бұрын
4 times vs 13 times faster in parallel
@tigerbody69
@tigerbody69 6 жыл бұрын
please make a vide and show us
@eggraf
@eggraf 8 жыл бұрын
Run it in parallel on the Mac. he only ran it serially...
@SarahC2
@SarahC2 7 жыл бұрын
3.6 seconds...
@minecraftermad
@minecraftermad 7 жыл бұрын
kek now do it on a 5W vega or ryzen based thingy (most energy efficient from what i've heard but might be wrong about ryzen)
@jvebarnes
@jvebarnes 6 жыл бұрын
2015 vs 2018 we cannot know the future
@MrGencyExit64
@MrGencyExit64 8 жыл бұрын
GPU cores are general-purpose too, they just work at peak efficiency when coupled with specialized hardware that handles scheduling, memory fetches, etc. for the sorts of patterns (i.e. 4 pixels at a time) used in rendering. You'd need A LOT more of them to achieve their kind of performance without that extra support hardware :)
@dogeeconomist4825
@dogeeconomist4825 7 жыл бұрын
I'm gonna have to start buying one of these every now and then and setting them up as an ever-growing cluster for BOINC. Much interest in future offers and capabilities as well as competing products as they emerge.
@TrueRebel
@TrueRebel 7 жыл бұрын
XcellenT info Ray... Parallella is the future of Super Computing and that audience couldn't make the math, ha ha ha ha ha ha ha ha. Congratulations Ray
@davecc0000
@davecc0000 7 жыл бұрын
Excellent presentation, understandable, great examples.
@stevebez2767
@stevebez2767 8 жыл бұрын
Well done with that,actual methodically harden course to project kickstart as well!
@jessstuart7495
@jessstuart7495 8 жыл бұрын
Any programming language or compiler developments for developing parallel software? I would think the compiler would have to know a lot about the underlying architecture to be able to produce software to efficiently allocate and manage the cores and memory.
@justy1337
@justy1337 6 жыл бұрын
Just wished that the video was in full hd.
@mike_98058
@mike_98058 8 жыл бұрын
Mr Hightower failed to demonstrate that Parallela was circa 2015 the most energy efficient supercomputer on the planet. He failed to compute the efficiency in terms of FLOPS/watt which was his initial basis of comparison.
@GeekBoy03
@GeekBoy03 7 жыл бұрын
The item is actually from 2012, but released in 2013. Four years, and nothing new from them
@neilruedlinger4851
@neilruedlinger4851 6 жыл бұрын
I did a computation based on Watts per Core. The Parallela is 18.26 times more energy efficient than the Tianha-2.
@pwnmeisterage
@pwnmeisterage 6 жыл бұрын
Now there's Epiphany-V, a 1024-core RISC SoC, and Epiphany-VI is already underway. Tianha-2 undergoes constant rotating upgrades as the Xeon/Phi cores are rescaled up and out. The presenter did explain that raw core count is not entirely meaningful in "real" (complex) problems, he was only able to demonstrate their overwhelming advantage in "optimum" (simplex) problems. The Tianha-2 was designed to be a unified distributed supercomputing platform, not an ad-hoc modular system with "limitless" expandibility - I suspect that in the real world it uses far less electrical power than the huge number of Parallella SBCs that would be needed to solve the same problems in the same time. Even energy efficiencies aren't linear, you can't just keep stacking LEGO computer modules together indefinitely, there are diminishing returns.
@monad_tcp
@monad_tcp 6 жыл бұрын
Without a new programming model you can't extract all that performance. Of course the culprit is the C language, but not everyone can program in Haskell yet.
@andrewyork3869
@andrewyork3869 6 жыл бұрын
@@monad_tcp what about ASM?
@draken68
@draken68 7 жыл бұрын
Very interesting video. What i got out of that is in metro Australia we pay $2-$2.50 in rural Australia $3-$3.50 (Australian Dollars per Watt)
@IDoThisInMySpareTime
@IDoThisInMySpareTime 9 жыл бұрын
Interesting video, on geek.com they listed the parallella as achiving around 90 GFLOPS. So if i did the math correctly there, a supercomputer with the processing power of Tianhe-2 (listed on wiki at 17.6 MW) would require a cluster of around 400k parallellas and run at around 2 MW?
@adavistheravyn573
@adavistheravyn573 7 жыл бұрын
I happen to work in the field of high-performance computing and had something similar in mind for my own numerical simulations or BOINC stuff. Before spending hundreds of Euros for RPi3 boards, I did some tests with a special version of my nbody code which is written in C, highly optimized and utilizes the OpenMP library for parallel computing. My benchmark focussed on floating-point performance with negligible RAM consumption. What are my results? Well, it's disillusioning. My benchmark task took 65 minutes on a single core of my RPi3, while a single Core i5-6500 solved the problem in 65 seconds! Using four threads, the RPi3 still took more than 18 minutes, while my Intel Core i5-6500 got that job done in 17 seconds. Conclusion: Neglecting communication overhead, I would have to come up with more than 60 RPi3 boards to get on par with a decent Core i5-6500 ... ARM might give you more FLOPS per Watt, but when it comes to pure floating-point performance, the architecture is still far behind.
@GeekBoy03
@GeekBoy03 7 жыл бұрын
ARM processors come in a very large variety, with up to 8 cores, The Raspberry Pi 3 uses a lower end ARM Cortex-53. The upper end is the Cortex-A75. But remember, ARM has low power usage as a priority. ARM is certainly getting more powerful, and some companies has started making Laptops with AMR processors. Remember, the RB Pi is just for projects, and prototyping.
@atranimecs
@atranimecs 8 жыл бұрын
It's not about the raw power of parallella, its about the performance/watt ratio and also the heat output.
@GeekBoy03
@GeekBoy03 8 жыл бұрын
Seems Parallella is fizzing out. Three years, and no new models.
@teknostatik1055
@teknostatik1055 8 жыл бұрын
Could just be gaining momentum since it's a fairly new product.
@GeekBoy03
@GeekBoy03 8 жыл бұрын
Tekno Statik Three years in technology is a very long time. It's had more than enough time to get grounded, and new models to appear.
@teknostatik1055
@teknostatik1055 8 жыл бұрын
Yeah, no... What parallela is doing is it's adding one or more dimensions for instructions to be run in parallel to each other (hence the name). Where it gets complicated is HOW the work is split because not all tasks require the same level of splitting, not all tasks can be split the same way, this example took trial and error to be split into the correct number of cores, and the program had to be re-written from serial to work in parallel.
@GeekBoy03
@GeekBoy03 8 жыл бұрын
Tekno Statik I take if you have zero understanding of product life cycles. I was referring to nothing new coming out in three years, not learning how to program the thing.
@teknostatik1055
@teknostatik1055 8 жыл бұрын
And I take it you know nothing of programming. How are we supposed to translate every program from serial into parallel? Have you any concept of the implications that go BEYOND your so-called "technology" and "product"? Without parallel programming there will be no product.
@rudde7251
@rudde7251 9 жыл бұрын
When you find primes, do you check up till sqrt rounded up or rounded down?
@MrBrew4321
@MrBrew4321 9 жыл бұрын
+Rudde down
@rudde7251
@rudde7251 9 жыл бұрын
+Brew Dain Thanks man :)
@fliptmartley
@fliptmartley 9 жыл бұрын
+Rudde, I square the prime I'm testing against and and check to see if it's larger than the candidate.
@mullermanden
@mullermanden 8 жыл бұрын
+Rudde Instead of using: for(int i=3; i
@MrBrew4321
@MrBrew4321 8 жыл бұрын
You can calculate sqrt(p) above the loop and store the result in a variable to use as the upper bound, but i*i changes each iteration so that isn't possible..
@daveb5041
@daveb5041 7 жыл бұрын
Why not put it in series with a 5w light bulb, the brightness of the bulb will show power consumption. The best way to save electricity is to make a computer that runs on vacuum tubes instead of transistors. A tube can take the place of three to five transistors so you can shrink a billion core processor down to 300 million tubes. To power it dont run it on coal, have monkeys pedaling bicycles hooked to generators. Feed them GMO bananas made by monsanto to cut down on food costs. You can also have them make copies of Shakespeare by putting a type writer in front of each one. Statistics proves that with enough time and type writers one will publish a complete work.
@frostgreen5527
@frostgreen5527 8 жыл бұрын
nice presentation, open source and small power consumption, not bad...
@stevebez2767
@stevebez2767 7 жыл бұрын
yunno the actual grounding back ground of living in a gent with some batterries you have too recharge,some windmills,some solar lights,etc to proove you can be coz of this wailing 'universal'failing that ego yelling utter liar of any exsist invites,pays,an builds club war den non men for meter maids count sell no wellys sheep shag act of 'had you all'you know you think I was???Yellow Lines,no on,no approach too know a Meter,answer door,in gets 'bill'big blue 'company guy'your all comparing exsists too pay build three arse non element teee red robe 'kill the giy'gsus vet war law own yer run into ground zero sport o non lord manger e state yells of sit yer on a stamp,get tirkey work or starve,full slave driven yer man,carzee lie sence have it learns???watt o?
@KittyKittaw
@KittyKittaw 7 жыл бұрын
Motorola - 16cores, 1985 -parallel processors, round the same time. Course it burned more power, or did it?
@erickleefeld4883
@erickleefeld4883 7 жыл бұрын
Could I use something like this to run Handbrake video compressions, and use an app on my Mac to administer it?
@RinksRides
@RinksRides 7 жыл бұрын
i think mores law is still relevant, it;s just taking a different direction. We're getting more and more powerful computing ability while the cost and power consumption can be lowered at the same time. So, if you view it from a performance per watt level then Moore's law is still relevant in that context.
@smorrow
@smorrow 7 жыл бұрын
Moore's Law proper is about number of transistors.
@DaHaiZhu
@DaHaiZhu 8 жыл бұрын
He never did say how much more energy efficient the Parallella was per core to the Chinese Supercomputer. In other words, how many petaflops per watt does the Parallella use compared to the Chinese Supercomputer???
@vinny142
@vinny142 8 жыл бұрын
@16:19 "a 150 dollar device was comparable to a $2000 mac" Well, to one core of the mac, he's onlty using one ore on the mac, not all four, which is what the $2000 costs. So really he is comparing a $500 mac to a $150 device. Loose the screen and the rest of the hardware, and the price is about the same. And even then it's only true for this particular application. Do you do much prime-number checking? I've never done it either. Parallel computing is ofcourse nothing new, back in the early 2000's companies like Industrial Light and Magic and Pixar learned very quickly that you get much more bang for you buck if you add many many many small cheap nodes, than fewer faster but more expensive nodes. Adding one 2Ghz core to a system adds 2Billion instructions a second to the system, which is the same as upgrading four cores from 2Ghz to 2.5, which is a lot more expensive than one 2Ghz core.
@jerryschull2122
@jerryschull2122 9 жыл бұрын
Seems way too pricey. The Pi2 and Pine64 are really cheap and have significant processing power, fits most project requirements.
@0xf7c8
@0xf7c8 8 жыл бұрын
+Jerry Schull Google cluster, that is what this is designed for.
@stevebez2767
@stevebez2767 7 жыл бұрын
yeah like all back too The Simpsons as some crazy kid finds 'dirty riffs in basic 'coo yells for skitso anarchy run hells exterimination repeat of give it too the keyIDzz sig moon frieds?
@reezlaw
@reezlaw 9 жыл бұрын
This video being 360p in 2015 showed that we must be actually going backwards
@chrisking7603
@chrisking7603 7 жыл бұрын
This video its presenter are quite entertaining, but I wanted: #1 a clear comparison of megaflops per megawatt-hour against currently optimised supercomputers; #2 explanation of how linearly adding parallel cores can really compensate for a limit in polynomial growth of chip density. Apart from being properly RISC, this is seemed very Transputer-ish.
@12kenbutsuri
@12kenbutsuri 4 жыл бұрын
I ordered one once, it was completely broken by the time it arrived.
@w.rustylane5650
@w.rustylane5650 7 жыл бұрын
Nice video on parallelism, for what it's worth.
@05Rudey
@05Rudey 9 жыл бұрын
I want one just to tell my mates that I've got a super computer.
@MrManerd
@MrManerd 6 жыл бұрын
Does the Parallella use ECC memory? That's all I want.
@terrance_huang
@terrance_huang 7 жыл бұрын
ditch the soft cores and do it on bare metal verilog, you can get another 10x performance
@tigerbody69
@tigerbody69 6 жыл бұрын
"Will it float?"
@djprodigalsun
@djprodigalsun 8 жыл бұрын
He is using the battery in that solar pack, why don't you tell us what the solar efficiency of that panel is..
@artlab_one
@artlab_one 7 жыл бұрын
Would love to see a Blender 3D test on this device :)
@Tommo_
@Tommo_ 8 жыл бұрын
macs are only expensive because of how compact they are. if you look at the inside of a 12 inch macbook, the whole 8gb of ram and 500gb of storage fits into about 10 by 5 cm of space. The rest is the battery. And it runs without a fan. Amazing.
@afronprime51
@afronprime51 6 жыл бұрын
Can you use them as a render farm?
@TheTurnipKing
@TheTurnipKing 6 жыл бұрын
16.21 That says far more about the overpricing of the Mac to me than anything else
@tenshi7angel
@tenshi7angel 7 жыл бұрын
The problem with Parallella, there are programs that cannot be done on multi-core or multi-system setups.
@rospotrebpozor3873
@rospotrebpozor3873 9 жыл бұрын
The problem is that program has to compute one result before it can make decision for another. parallel processing does not solve that problem.
@Thyhorrorchannel
@Thyhorrorchannel 9 жыл бұрын
+rospotreb pozor RISC .
@walter0bz
@walter0bz 8 жыл бұрын
+rospotreb poor many algorithms parallelise fine. changes the way you program and the types of work you can do. see deep-learning (which became viable due to GPUs, and its a poor use of a CPU), it can use huge parallelism across layers and deeper nets, but suffers at the moment from communication bottlenecks in clusters. the point here is parallelism with local memories and an on-chip network overcome that.
@jarisipilainen3875
@jarisipilainen3875 8 жыл бұрын
is it 18cores and 18 extra cores? if some core will broke. some intel prosessors have 9 cores but there 9 exrts to fix broken core at fly. OR you can activate them all lol 18 cores. propably they work paraller. not atleast serial lol
@maxlol0
@maxlol0 7 жыл бұрын
could be good as a linux media server or NAS. a bit weak for main computing task.
@iluan_
@iluan_ 7 жыл бұрын
It has a ZYNQ FPGA chip from Xilinx. For many applications, that's more than enough for high performance computing.
@DAVIDGREGORYKERR
@DAVIDGREGORYKERR 8 жыл бұрын
I wonder has anyone built a super computer around 16 boards containing 64 IMOS T800 Transputer’s each which equals 1024 Transputer cores that will run Linux.
@jarisipilainen3875
@jarisipilainen3875 8 жыл бұрын
if anyone intersted how fast is 5 rasbperry pi3 cluster and it not cost 180 :) but i didnt say this board not good. need more cores lol
@roschereric
@roschereric 8 жыл бұрын
Just think that at the same time, Nvidia had the 970 for less power per GFLOP already. Pair that with an ARM dual core and you are better performer
@ultraviolet.catastrophe
@ultraviolet.catastrophe 4 жыл бұрын
Why would I buy this when I can buy 6 Raspberry Pi 3 boards that would give me a total of 24 cores?
@sigmareaver680
@sigmareaver680 6 жыл бұрын
The only thing attractive here is the energy efficiency. Would it be worth crypto mining with?
@antonnym214
@antonnym214 8 жыл бұрын
18 cores for $150 is pretty spectacular, especially compared to a standard AMD or Intel CPU.
@StayMadNobodycares
@StayMadNobodycares 7 жыл бұрын
4:07 are those vents supposed to look like a brain?
@38KSW
@38KSW 8 жыл бұрын
Too bad can't find this thing any place
@IraQNid
@IraQNid 9 жыл бұрын
A fractal Parallela cluster is the real answer. But how well does it run beneficial programs such as SETI@HOME and BOINC? These are programs that seek to solve our most pressing issues of the day using idle distributed CPU and GPU cycles. That idle CPU and GPU processing prowess is then used with tiny segments of data sent to users all around the globe to analyze data. Results are then sent back to the researcher's computer centers. I used to participate on SETI@HOME, Einstein@HOME, and BOINC to help solve the mysteries of our Universe, to find a cure for cancers, and to produce better rice yields to feed more people with an improvement in how the rice is grown. You might want to research the computational power of a Titan series GPU and something called "CUDA" :)
@marcusdudley7235
@marcusdudley7235 8 жыл бұрын
I used to run BOINC too, but, although it claimed to only use idle cycles, my CPU and GPU showed much more active cooling with BOINC running and my power usage almost tripled.
@Gamepak
@Gamepak 6 жыл бұрын
cool but does it do Crysis?
@hinasamal8406
@hinasamal8406 7 жыл бұрын
Parralela supercomputing is a fantastic idea
@RobbieFPV
@RobbieFPV 8 жыл бұрын
haha I saw "hightower" and immediately expected a huge black cop.
@StefanBlurr
@StefanBlurr 8 жыл бұрын
he died a long time ago :'(
@RobbieFPV
@RobbieFPV 8 жыл бұрын
O haha yea ofcourse! I hardly play that map though. I'm more of a goldrush or dustbowl player :v
@-ColorMehJewish-
@-ColorMehJewish- 8 жыл бұрын
Ill just connect and link my RPi's thanks - least til something truly better comes out - not just scaled into one board.
@ChrisD__
@ChrisD__ 9 жыл бұрын
If I could run Blender Cycles on this, I'll take fifty.
@mutantgenepool
@mutantgenepool 7 жыл бұрын
Was thinking the same thing. xDD
@Art7220
@Art7220 6 жыл бұрын
Can it run XP or Crysis, or Bitcoin Mining? Someone always asks about Crysis.
@afronprime51
@afronprime51 6 жыл бұрын
Reading my mind
@Phoen1x883
@Phoen1x883 6 жыл бұрын
With only 1 GB of RAM, you'd be fairly limited in your scene size. In addition, rendering requires lots of high speed access to _all_ the memory, as rays need to bounce around the scene (and therefore, around memory), Just looking at the block diagram, you can see that none of the cores have direct access to a large block of memory. Unless there is some extremely fast communication bus between cores, that means long pauses in execution while data is fetched from memory. Would be nice if we could get someone familiar with Cycles internals to take a look and evaluate whether the Parallella architecture is usable for rendering. I did some quick searches, and didn't find anything solid.
@williamhart4896
@williamhart4896 8 жыл бұрын
hmm this board plus a other companies board both of them running in one device the parella in CO process and a pine a64 ln main hmm super compute in a tablet case ?
@fy7589
@fy7589 7 жыл бұрын
This is not a new idea. We already build cluster computers using super high end hardware and one chip in them is capable of the same speed as thousands of parallellas or raspberry pi's Just one chip in them. And it is much more power efficient and space friendly than building super big Pi Clusters. Instead, FPGA chips will become more popular in the future .
@j.macjordan9779
@j.macjordan9779 7 жыл бұрын
When did # of CPU cores become the sole measure of one's genitals, with zero regard for RAM? I'd think this has it backwards if anything... If my analogy fits - & I think it does - It's like a genetic freak show: a dude with 18 testicles and a centimeter long wrecker. That ain't cool...it's not getting the job done. I certainly don't want that...
@einsteinwallah2
@einsteinwallah2 4 жыл бұрын
make this in 480p or higher
@jarisipilainen3875
@jarisipilainen3875 8 жыл бұрын
are you scared to show how fast mac is on multiple threats? it was faster anyway on one lol. but yet it cost more and allmost 3 times faster core. ,ac could do it on 7 seconds and way under. your probram was only think benefit of your board lol
@Masoudy91
@Masoudy91 9 жыл бұрын
A mac with 2.4 GHz toke 14 sec. 18 (or 20?) 1GHz should add up to 18 GHz or 20 GHz? Yet it toke 18 sec? Not really familiar with computation stuff .. :(
@ToriRocksAmos
@ToriRocksAmos 9 жыл бұрын
+Yousif Tareq you can't just add the numbers up. Those are entirely different machines running different architectures.
@Masoudy91
@Masoudy91 9 жыл бұрын
+Marcel Krebs yep, so I heard.
@diskgrind3410
@diskgrind3410 9 жыл бұрын
Other than the Jobbathehut in the audience I thought it was a good speech.
@looneyburgmusic
@looneyburgmusic 8 жыл бұрын
There is a fatal flaw with "Moore's Law"... Where is it written that CPU's *must* always stay the same size, (or get smaller), while the transistor count rises? Simply increase the CPU die size, instead of only worrying about shrinking the transistors. Sure, this would cause a bottleneck for portable devices like smartphones, but for desktop/laptop computers it would not be an issue - How many customers would reject a desktop or laptop computer, (or tablet even), that is 10x more powerful, with the slight downsize that it is slightly larger, and consumes more energy?
@backflp
@backflp 8 жыл бұрын
I've been wondering about this too, why not just make the physical size of a CPU bigger, making room for more transistors? What stops them from doing this with the CPU's used in parallell computing and supercomputers? Portable devices like phones and tablets are small enough anyway, no need for a smaller processor in those.
@sdphotography4733
@sdphotography4733 8 жыл бұрын
www.newegg.com/Product/Product.aspx?Item=N82E16819117643&cm_re=i7-_-19-117-643-_-Product 10 cores, and growing.
@0xy_
@0xy_ 8 жыл бұрын
One of the main reasons that CPU and GPU die stay the same size is when you increase the size the thermal output increases and it becomes harder to disperse that heat. The bigger the die size the more power required to power it and the more heat that it creates. Think of it like a bigger car using more fuel. Now I know that's not the only reason but I'm on a phone and can't really research and verify. Also bigger CPU = more material = more money
@looneyburgmusic
@looneyburgmusic 8 жыл бұрын
Phaint Well all know these things... But would you buy a CPU that was twice as large, (and needs more cooling/power), if it offered 10x the processing power, or 50x, or 100x? There are practical applications for larger die-CPU's, where power/heat dissipation would not be an issue.... Everything is a trade-off - want a thin cellphone, you need a thin CPU. Need a single computer that can do the work of 10 computers in one box, a larger CPU die could give you that... The bottom line is CPU die size gets smaller because that is what the market has demanded since almost day one, but there is no actual reason die size *must* shrink.
@ubbgn
@ubbgn 8 жыл бұрын
Clearly u dont know shit about the business! :)
@BoggyBogdan
@BoggyBogdan 9 жыл бұрын
That's awesome Thanks for sharing
@Raven-fu1zz
@Raven-fu1zz 3 жыл бұрын
Why can't you just use a GPU to do the calculations, they have thousands of cores, and per watt you would get more performance
@Nomoreidsleft
@Nomoreidsleft 7 жыл бұрын
I don't know why he's even calling it a supercomputer. Only 16 cores, and probably doesn't even do floating point.
@SamuelBSR
@SamuelBSR 6 жыл бұрын
2015 Hahaha :)))) It's 2018 and where is parallella now?
@Petr75661
@Petr75661 9 жыл бұрын
Mobileye EyeQ4 pulls 2.5 teraflops @ 3 W. Parallela gives only 0.09 teraflops @ 5 W.
@llothar68
@llothar68 8 жыл бұрын
+jednoucelovy Yes it's all fake. In real world nothing in the ARM world beats Intel on performace/Watt (except GPU if you use matrix algorithms in single preceision).
@madgamer3974
@madgamer3974 8 жыл бұрын
cloud phone connected by internet to supercomputer = best phone ever :D
@StefanReich
@StefanReich 6 жыл бұрын
0:32 "One of my interns" - So you exploit people?
@i00Productions
@i00Productions 8 жыл бұрын
kzbin.info/www/bejne/eHm9dHaLet9-ZtU the IsPrime function seems to be incorrect
@i00Productions
@i00Productions 8 жыл бұрын
Yep checked it... Their incorrect function iterates through the 16M values in 3.356 seconds on my server theirs took 237.104 ... double checked with their incorrect number of primes (1031130) returned - see kzbin.info/www/bejne/eHm9dHaLet9-ZtU I am currently calculating the real number of primes in this... and will check back on this later as it will take much much longer...
@i00Productions
@i00Productions 8 жыл бұрын
There are actually 1151367 primes < 16M ... my server calculated this in 7.216 seconds
@andychow5509
@andychow5509 8 жыл бұрын
There are 1,031,130 primes less than or equal to 16,000,000. You made a mistake. My server calculated this in 0.029 seconds (yeah, seriously).
@andychow5509
@andychow5509 8 жыл бұрын
I found your mistake, you are looking up to 18M. There are 1,151,367 primes less than or equal to 18,000,000. My server calculated this in 0.09 seconds.
@sighpocket5
@sighpocket5 8 жыл бұрын
i00 my server brought me a beer....
@トヤマトカナワ
@トヤマトカナワ 9 жыл бұрын
the conclusion is - its a tool for work! if so, then several questions have to be asked! 1 - for what purpose is this small factor ? IMO it must be a relatively big board something like a standard server MB and with a efficient cooling solution "out of the box" (for server cases) 2 - whats a point of energy efficiency of the board it self if the power that it takes from the wall will be significantly higher ? and that brings us back to question #1 (more chips on a single board - more effective PSU you can use) fit a 100-200 of those on a single board and it will shine ! Until ... As a technology - yah seems ok. As a device - not worth it at all! 3 - comparing a single core mac results (14s) and multicore your devise (18s) is a vary accurate way to do the test !? 4 - comparing price of a consumer grade notebook (with a display, design, all those other peaces that form its price, including a brand premium) with a PSB with a 4 chips and 2 connectors in it is really a great way to compare the prices! REALY !? HUGE dislike !
@ammonlu8566
@ammonlu8566 6 жыл бұрын
superb talk thank you very much
@hasemali7799
@hasemali7799 8 жыл бұрын
Can i have or make a supercomputer which runs Microsoft Windows
@nicolek4076
@nicolek4076 8 жыл бұрын
Yes, of course. You'll have a computer that hangs and crashes faster than anyone else's. It's notable that none of the fastest computers in the world use Windows - there's a reason for that and it isn't the price of the software.
@livesimplyandhumbly
@livesimplyandhumbly 8 жыл бұрын
+Nicole K and does not even have built in SSH.
@nicolek4076
@nicolek4076 8 жыл бұрын
+jamesassbag You experience is uncharacteristic or you used an outdated version, or you used and unstable distribution, or you set it up incorrectly. I have been using Linux as my desktop for the last ten years.It's got better and better over that time. It lets me do what I need to do. It never "phones home". I can, should I want, audit the code so I know exactly what it's up to behind my back; which is precisely nothing. It's my computer and Linux lets me define how it's used. Try saying that about Windows.
@nicolek4076
@nicolek4076 8 жыл бұрын
The only part of your unreasoned rant that is remotely true is that the documentation is limited. For this you have the Internet and your search engine of choice. The whole point of it is that if you find it to be broken or unoptimised (a very rare occurrences) you can do something about it. Try that with Windows.
@livesimplyandhumbly
@livesimplyandhumbly 8 жыл бұрын
***** Currently my Windows computers can not even bring up settings since the last update.
@hanniffydinn6019
@hanniffydinn6019 6 жыл бұрын
Anyone remember transputers?
@jarisipilainen3875
@jarisipilainen3875 8 жыл бұрын
you only used 1 thread on mac lol
@jimbig3997
@jimbig3997 6 жыл бұрын
Moore's Law is on hiatus because there has been no competition to Intel until recently so they took a break.
@MasterGhostKnight
@MasterGhostKnight 6 жыл бұрын
It doesn't matter if it is a MAC or whatever. You are using 1 2.7GHz processor to do the job. Each core of the paralllella was 1GHz but you have 16 of them. Let's be generous and say that each parallella is 1/3 the speed so it would take sensibly 3x as long. But you have 16 cores, so you would expect 16/3 the processing power, or it should take you 3/16 the amount of time.. lets be generous and round it up to 1/4. The serial still took less time to finish the job. I would say that this was a massive fail!
@gregk5505
@gregk5505 7 жыл бұрын
That was 17.8 kilo watts not mega watts
@stonent
@stonent 6 жыл бұрын
17,800 kilowatts = 17.8 million watts....Look at the table again.
6 жыл бұрын
4 Orangepis for 48$ andyour have 16 CPU Cores with 1.2ghz
@SarahC2
@SarahC2 7 жыл бұрын
If the Mac used all 4 cores: 14.4 sec / 4 cores = 3.6 seconds!...... on the Parallella it takes 18.6 seconds using all 18 cores!... bit crap really.
CHIP $9 Computer
18:28
ExplainingComputers
Рет қаралды 2,8 МЛН
DON'T Use Raspberry Pis for Servers! (Use THIS)
16:23
Hardware Haven
Рет қаралды 1,2 МЛН
Мен атып көрмегенмін ! | Qalam | 5 серия
25:41
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН
Cracking Enigma in 2021 - Computerphile
21:20
Computerphile
Рет қаралды 2,5 МЛН
Build Your Own GPU Accelerated Supercomputer - NVIDIA Jetson Cluster
15:03
Beating Moore's Law: This photonic computer is 10X faster than NVIDIA GPUs using 90% less energy
17:03
John Koetsier (tech, AI, & the future)
Рет қаралды 546 М.
How an ASML Lithography Machine Moves a Wafer
16:15
Asianometry
Рет қаралды 487 М.
Asus Tinker Board
15:46
ExplainingComputers
Рет қаралды 1,1 МЛН
ODROID-XU4 Octa-Core SBC
14:50
ExplainingComputers
Рет қаралды 397 М.
NO WIRES: How the Apple Airport Changed Everything
29:36
The Serial Port
Рет қаралды 292 М.
Стыдные вопросы про Китай / вДудь
3:07:50
вДудь
Рет қаралды 2,3 МЛН
Why Thorium will be a Game-Changer in Energy
32:00
Copenhagen Atomics
Рет қаралды 284 М.
Мен атып көрмегенмін ! | Qalam | 5 серия
25:41