10 - Sorting & Aggregations (CMU Databases Systems / Fall 2019)

  Рет қаралды 19,278

CMU Database Group

CMU Database Group

Күн бұрын

Пікірлер: 18
@AndersonSilva-dg4mg
@AndersonSilva-dg4mg 5 жыл бұрын
You are the Best teacher in the subject database
@420_gunna
@420_gunna 4 жыл бұрын
pavlo possibly the most dangerous man in database systems
@firstlast7086
@firstlast7086 3 жыл бұрын
38:32 the best definition for aggregation I've ever heard
@galaxy_mosaic3587
@galaxy_mosaic3587 2 жыл бұрын
my only constructive feedback would be that it would be nice for the content to be formatted so that the full slide is visible. then somehow adjust the proportions so that the recorded video replay is not always overlapping the lower right corner of the slide because sometimes important content gets hidden.
@pb25193
@pb25193 4 жыл бұрын
The Indian dude asks really good questions. Good job Andy handling them masterfully.
@hasan0770816268
@hasan0770816268 3 жыл бұрын
7:37 agenda (sorting and aggregation) 8:17 why we need sorting? 11:26 external merge sort 13:01 2-way external merge sort 23:52 double buffering optimization 24:27 general k-way merge sort 29:05 B-tree for sorting 37:35 aggregation
@patrykuchman5388
@patrykuchman5388 3 жыл бұрын
25:02 - The general external merge sort. Andy mentions that in general if we have B buffer pool pages then we can use B-1 pages to store the sorted runs and 1 as output. But wouldn't it be better to dedicate more than 1 page for the output? If B is high and we only have 1 output buffer page, then every times it fills up we have to store that page to disk and wait until this operation finishes before we can proceed with merging
@th3sly
@th3sly 5 жыл бұрын
was that guy wearing a MongoDB t-shirt? :)))
@abdelrhmanahmed1378
@abdelrhmanahmed1378 2 жыл бұрын
1:04:00 in rehash phase we do sequential scan over the buckets ? if yes , if it will be better to just do sequential scan over the original data ?
@420_gunna
@420_gunna 4 жыл бұрын
dj drop tables last heard muttering I aint goin back as he left
@alexalexfff
@alexalexfff 3 жыл бұрын
What text book is mentioned in the development hints slide?
@user-jl4io6rs4y
@user-jl4io6rs4y 4 жыл бұрын
in the hash sorting,why we need rehash rather than just operate on the buckets from phase 1.
@AshishNegi1618
@AshishNegi1618 4 жыл бұрын
If first page has 1k key-values (some duplicates).. you will need some data structure to group values of same key together => hence 2nd hash table. Now, it looks like we could just use first hash table and second is redundant.. but from this lecture, it looks like idea is first to collect all values of a key together using first hash and then calculate aggregation using second hash..
@huangwentao
@huangwentao 3 жыл бұрын
@@AshishNegi1618 In my opinion, if we just use 1st hash functions, we can have different partitions, and following aggregation operations can be performed by scanning each key-value pair inside each partition. But such scanning may require a large number of scans, which is not that effective. If we use the 2nd hash function, we can avoid the aforementioned scans. Instead, we can just perform aggregations by a single scan over hash keys with the help of 2nd hash tables.
@zhchen6644
@zhchen6644 3 жыл бұрын
because you can't: the data is larger than you mem
@akshpatel7935
@akshpatel7935 2 жыл бұрын
21:31 how does pass #3 fit entirely in main memory? Wasn't that the precondition that whole dataset can't fit in main memory?
@andypavlo
@andypavlo 2 жыл бұрын
It doesn't fit entirely in memory. You fetch one page at a time per sorted run and then fill up the output buffer. When the output is full, you write it out to disk.
@akshpatel7935
@akshpatel7935 2 жыл бұрын
@@andypavlo Thanks a lot Prof. Pavlo, that clears things up. I was not expecting a response so soon on a 2 year old lecture. And thank you for making the lectures, assignments and problem sets public.
11 - Join Algorithms  (CMU Databases Systems / Fall 2019)
1:11:35
CMU Database Group
Рет қаралды 21 М.
12 - Query Execution I (CMU Databases Systems / Fall 2019)
1:05:38
CMU Database Group
Рет қаралды 19 М.
나랑 아빠가 아이스크림 먹을 때
00:15
진영민yeongmin
Рет қаралды 16 МЛН
Фейковый воришка 😂
00:51
КАРЕНА МАКАРЕНА
Рет қаралды 6 МЛН
小丑妹妹插队被妈妈教训!#小丑#路飞#家庭#搞笑
00:12
家庭搞笑日记
Рет қаралды 28 МЛН
Blue Food VS Red Food Emoji Mukbang
00:33
MOOMOO STUDIO [무무 스튜디오]
Рет қаралды 35 МЛН
07 - Tree Indexes I (CMU Databases Systems / Fall 2019)
1:18:02
CMU Database Group
Рет қаралды 33 М.
16 - Concurrency Control Theory (CMU Databases Systems / Fall 2019)
1:23:08
CMU Database Group
Рет қаралды 23 М.
Cursor Is Beating VS Code (...by forking it)
18:00
Theo - t3․gg
Рет қаралды 68 М.
06 - Hash Tables (CMU Databases Systems / Fall 2019)
1:13:26
CMU Database Group
Рет қаралды 33 М.
05 - Buffer Pools + Memory Management (CMU Databases Systems / Fall 2019)
1:19:00
18 - Timestamp Ordering Concurrency Control (CMU Databases Systems / Fall 2019)
1:19:12
08 - Tree Indexes II (CMU Databases Systems / Fall 2019)
1:17:44
CMU Database Group
Рет қаралды 19 М.
Lecture 25 - Main Memory and DRAM Basics - Carnegie Mellon - Computer Architecture 2013 - Onur Mutlu
1:35:16
나랑 아빠가 아이스크림 먹을 때
00:15
진영민yeongmin
Рет қаралды 16 МЛН