high_scalability high_scalability-2009 high_scalability-2009-609 knowledge-graph by maker-knowledge-mining
Source: html
Introduction: Combining GPU power with PostgreSQL PostgreSQL is one of the world's leading Open Source databases and it provides enormous flexibility as well as extensibility. One of the key features of PostgreSQL is that users can define their own procedures and functions in basically any known programming language. With the means of functions it is possible to write basically any server side codes easily. Now, all this extensibility is basically not new. What does it all have to do with scaling and then? Well, imagine a world where the data in your database and enormous computing power are tightly integrated. Imagine a world where data inside your database has direct access to hundreds of FPUs. Welcome to the world of CUDA, NVIDIA's way of making the power of graphics cards available to normal, high-performance applications. When it comes to complex computations databases might very well turn out to be a bottleneck. Depending on your application it might easily happen that adding more
sentIndex sentText sentNum sentScore
1 Combining GPU power with PostgreSQL PostgreSQL is one of the world's leading Open Source databases and it provides enormous flexibility as well as extensibility. [sent-1, score-0.192]
2 One of the key features of PostgreSQL is that users can define their own procedures and functions in basically any known programming language. [sent-2, score-0.259]
3 With the means of functions it is possible to write basically any server side codes easily. [sent-3, score-0.239]
4 Now, all this extensibility is basically not new. [sent-4, score-0.196]
5 Well, imagine a world where the data in your database and enormous computing power are tightly integrated. [sent-6, score-0.334]
6 Imagine a world where data inside your database has direct access to hundreds of FPUs. [sent-7, score-0.12]
7 Welcome to the world of CUDA, NVIDIA's way of making the power of graphics cards available to normal, high-performance applications. [sent-8, score-0.201]
8 When it comes to complex computations databases might very well turn out to be a bottleneck. [sent-9, score-0.072]
9 CUDA is C / C++ Basically a CUDA program is simple a C program with some small extensions. [sent-13, score-0.136]
10 The CUDA subsystem transforms your CUDA program to normal C code which can then be compiled and linked nicely with existing code. [sent-14, score-0.401]
11 This also means that CUDA code can basically be used to work inside a PostgreSQL stored procedure easily. [sent-15, score-0.392]
12 The goal for this simplistic example is to generate a set of random number on the CPU, copy it to the GPU and make the code callable from PostgreSQL. [sent-18, score-0.389]
13 h" PG_MODULE_MAGIC; /* prototypes to silence compiler */ extern Datum test_random(PG_FUNCTION_ARGS); /* define function to allocate N random values (0 - 1. [sent-25, score-0.512]
14 The test random function can be called just like this: SELECT test_random(1000); Of course this is a just brief introduction to see how things can practically be done. [sent-27, score-0.304]
15 A more realistic application will need more thinking and can be integrated into the database even more closely. [sent-28, score-0.042]
wordName wordTfidf (topN-words)
[('cuda', 0.774), ('postgresql', 0.196), ('int', 0.164), ('gpu', 0.163), ('random', 0.151), ('basically', 0.146), ('include', 0.123), ('enormous', 0.122), ('function', 0.107), ('nvidia', 0.103), ('copy', 0.09), ('official', 0.087), ('allocate', 0.082), ('graphics', 0.081), ('compiled', 0.078), ('procedure', 0.077), ('computations', 0.072), ('inside', 0.07), ('power', 0.07), ('program', 0.068), ('nicely', 0.066), ('silence', 0.063), ('define', 0.063), ('datum', 0.06), ('normal', 0.056), ('stored', 0.056), ('generate', 0.055), ('void', 0.055), ('return', 0.051), ('functions', 0.05), ('simplistic', 0.05), ('extensibility', 0.05), ('world', 0.05), ('transported', 0.048), ('imagine', 0.048), ('import', 0.047), ('gpus', 0.047), ('servicesthe', 0.047), ('practically', 0.046), ('prototypes', 0.046), ('generator', 0.046), ('numbers', 0.045), ('transforms', 0.045), ('subsystem', 0.045), ('tightly', 0.044), ('matrix', 0.044), ('pointer', 0.044), ('code', 0.043), ('codes', 0.043), ('realistic', 0.042)]
simIndex simValue blogId blogTitle
same-blog 1 1.0 609 high scalability-2009-05-28-Scaling PostgreSQL using CUDA
Introduction: Combining GPU power with PostgreSQL PostgreSQL is one of the world's leading Open Source databases and it provides enormous flexibility as well as extensibility. One of the key features of PostgreSQL is that users can define their own procedures and functions in basically any known programming language. With the means of functions it is possible to write basically any server side codes easily. Now, all this extensibility is basically not new. What does it all have to do with scaling and then? Well, imagine a world where the data in your database and enormous computing power are tightly integrated. Imagine a world where data inside your database has direct access to hundreds of FPUs. Welcome to the world of CUDA, NVIDIA's way of making the power of graphics cards available to normal, high-performance applications. When it comes to complex computations databases might very well turn out to be a bottleneck. Depending on your application it might easily happen that adding more
2 0.1533265 1239 high scalability-2012-05-04-Stuff The Internet Says On Scalability For May 4, 2012
Introduction: It's HighScalability Time: Quotable quotes: Richard Feynman: Suppose that little things behave very differently than anything big @orgnet : "Data, data everywhere, but not a thought to think" -- John Allen Paulos, Mathematician @ bcarlso : just throw out the word " scalability ". That'll bring em out @ codypo : Here are the steps to the Scalability Shuffle. 1: log everything. 2: analyze logs. 3: profile. 4: refactor. 5: repeat. @ FoggSeastack : If math had been taught in a relevant way I might have been a # BigData person today. @secboffin : I know a programming joke about 10,000 mutexes, but it's a bit contentious. Twitter gets personal with Improved personalization algorithms and real-time indexing , a tale of a real-time tool chain. Earlybird is Twitter's real-time search system. Every Tweet has its URLs extracted and expanded. URL contents are fetched via SpiderDuck . Cassovary, a graph processing library, is used t
3 0.1477105 953 high scalability-2010-12-03-GPU vs CPU Smackdown : The Rise of Throughput-Oriented Architectures
Introduction: In some ways the original Amazon cloud, the one most of us still live in, was like that really cool house that when you stepped inside and saw the old green shag carpet in the living room, you knew the house hadn't been updated in a while. The network is a little slow, the processors are a bit dated, and virtualization made the house just feel smaller. It has been difficult to run high bandwidth or low latency workloads in the cloud. Bottlenecks everywhere. Not a big deal for most applications, but for many high performance applications (HPC) it was a killer. In a typical house you might just do a remodel. Upgrade a few rooms. Swap out builder quality appliances with gleaming stainless steel monsters. But Amazon has a big lot, instead of remodeling they simply keep adding on entire new wings, kind of like the Winchester Mystery House of computing. The first new wing added was a CPU based HPC system featuring blazingly fast Nehalem chips , virtualization replaced by a close t
4 0.099872403 73 high scalability-2007-08-23-Postgresql on high availability websites?
Introduction: I was looking at the pingdom infrastructure matrix (http://royal.pingdom.com/royalfiles/0702_infrastructure_matrix.pdf) and I saw that no sites are using Postgresql, and then I searched through highscalability.com and saw very few mentions of postgresql. Are there any examples of high-traffic sites that use postgresql? Does anyone have any experience with it? I'm having trouble finding good, recent studies of postgres (and postgres compared w/ mysql) online.
5 0.085074604 1523 high scalability-2013-09-27-Stuff The Internet Says On Scalability For September 27, 2013
Introduction: Hey, it's HighScalability time: ( The WINLAB at Rutgers, with software defined radios tied into GENI. ) 384 cores & 32TB of RAM : Oracle's SPARC M6 Quotable Quotes: @jennyinc : 2003: "I replaced you with a set of very small shell scripts." 2013: "I replaced your scripts with a six-figure enterprise DevOps platform." @tomdale : OH: “Redis is so fast, why don’t we replace RAM with Redis?” @petrillic : OH "Promises/futures are the one-night stands of architectural constructs" nice #strangeloop @TwitterEng : "Java and Scala let Twitter readily share and modify its enormous codebase across a team of hundreds of developers." Lots of juicy numbers revealed at Structure:Europe : Netflix streams 114,000 years of video every month; Custom build Netflix boxes for its content-delivery network that contain between 100 and 150 terabytes of stor
6 0.079808824 936 high scalability-2010-11-09-Facebook Uses Non-Stored Procedures to Update Social Graphs
7 0.076745234 1475 high scalability-2013-06-13-Busting 4 Modern Hardware Myths - Are Memory, HDDs, and SSDs Really Random Access?
8 0.076534092 297 high scalability-2008-04-05-Skype Plans for PostgreSQL to Scale to 1 Billion Users
9 0.073782496 933 high scalability-2010-11-01-Hot Trend: Move Behavior to Data for a New Interactive Application Architecture
10 0.064959824 1036 high scalability-2011-05-06-Stuff The Internet Says On Scalability For May 6th, 2011
12 0.064357921 750 high scalability-2009-12-16-Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud
13 0.063883439 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
14 0.063499384 230 high scalability-2008-01-29-Speed up (Oracle) database code with result caching
15 0.063474223 1621 high scalability-2014-03-28-Stuff The Internet Says On Scalability For March 28th, 2014
16 0.061919704 661 high scalability-2009-07-25-Latency is Everywhere and it Costs You Sales - How to Crush it
17 0.060398594 1369 high scalability-2012-12-10-Switch your databases to Flash storage. Now. Or you're doing it wrong.
18 0.059948877 1186 high scalability-2012-02-02-The Data-Scope Project - 6PB storage, 500GBytes-sec sequential IO, 20M IOPS, 130TFlops
19 0.05869199 1567 high scalability-2013-12-20-Stuff The Internet Says On Scalability For December 20th, 2013
20 0.057457566 1509 high scalability-2013-08-30-Stuff The Internet Says On Scalability For August 30, 2013
topicId topicWeight
[(0, 0.102), (1, 0.054), (2, -0.002), (3, 0.004), (4, -0.003), (5, 0.028), (6, 0.016), (7, 0.019), (8, -0.028), (9, -0.007), (10, 0.003), (11, -0.008), (12, 0.0), (13, 0.008), (14, 0.015), (15, -0.013), (16, -0.015), (17, 0.006), (18, 0.001), (19, 0.02), (20, -0.017), (21, -0.021), (22, -0.018), (23, 0.025), (24, -0.002), (25, 0.005), (26, -0.0), (27, -0.022), (28, 0.011), (29, -0.008), (30, -0.044), (31, -0.014), (32, -0.016), (33, 0.013), (34, 0.025), (35, 0.013), (36, 0.019), (37, 0.006), (38, -0.017), (39, 0.018), (40, -0.031), (41, -0.033), (42, -0.001), (43, 0.003), (44, 0.02), (45, -0.0), (46, 0.03), (47, -0.029), (48, -0.009), (49, 0.026)]
simIndex simValue blogId blogTitle
same-blog 1 0.93198448 609 high scalability-2009-05-28-Scaling PostgreSQL using CUDA
Introduction: Combining GPU power with PostgreSQL PostgreSQL is one of the world's leading Open Source databases and it provides enormous flexibility as well as extensibility. One of the key features of PostgreSQL is that users can define their own procedures and functions in basically any known programming language. With the means of functions it is possible to write basically any server side codes easily. Now, all this extensibility is basically not new. What does it all have to do with scaling and then? Well, imagine a world where the data in your database and enormous computing power are tightly integrated. Imagine a world where data inside your database has direct access to hundreds of FPUs. Welcome to the world of CUDA, NVIDIA's way of making the power of graphics cards available to normal, high-performance applications. When it comes to complex computations databases might very well turn out to be a bottleneck. Depending on your application it might easily happen that adding more
2 0.74726993 1304 high scalability-2012-08-14-MemSQL Architecture - The Fast (MVCC, InMem, LockFree, CodeGen) and Familiar (SQL)
Introduction: This is an interview with MemSQL cofounder’s Eric Frenkiel and Nikita Shamgunov , in which they try to answer critics by going into more depth about their technology. MemSQL ruffled a few feathers with their claim of being the fastest database in the world. According to their benchmarks MemSQL can execute 200K TPS on an EC2 Quadruple Extra Large and on a 64 core machine they can push 1.2 million transactions a second. Benchmarks are always a dark mirror, so make of them what you will, but the target market for MemSQL is clear: projects looking for something both fast and familiar. Fast as in a novel design using a combination of technologies like MVCC , code generation, lock-free data structures , skip lists , and in-memory execution . Familiar as in SQL and nothing but SQL. The only interface to MemSQL is SQL. It’s right to point out MemSQL gets a boost by being a first release. Only a limited subset of SQL is supported, neither rep
3 0.72118127 823 high scalability-2010-05-05-How will memristors change everything?
Introduction: A non-random sample of my tech friends shows that not many have heard of memristors (though I do suspect vote tampering). I'd read a little about memristors in 2008 when the initial hubbub about the existence of memristors was raised. I, however, immediately filed them into that comforting conceptual bucket of potentially revolutionary technologies I didn't have to worry about because like most wondertech, nothing would ever come of it. Wrong. After watching Finding the Missing Memristor by R. Stanley Williams I've had to change my mind. Memristors have gone from "maybe never" to holy cow this could happen soon and it could change everything. Let's assume for the sake of dreaming memristors do prove out. How will we design systems when we have access to a new material that is two orders of magnitude more efficient from a power perspective than traditional transistor technologies, contains multiple petabits (1 petabit = 128TB) of persistent storage, and can be reconfigured t
4 0.71955395 1436 high scalability-2013-04-05-Stuff The Internet Says On Scalability For April 5, 2013
Introduction: Hey, it's HighScalability time: ( Dr. Who Scaling Up the Shard click for cool animated gif ) 50 sextillion : # of earth-like planets in universe; 100,000: stars Quotable Quotes: @petdance : "I wish I had enough money to run Oracle instead of Postgres." "Why do you want to do that?" "I don't, I just wish I had enough money to." @JBossMike : Java is old. Java is verbose. Java is boring. Java is dead… Java is FAST. @old_sound : We need a "shrink conf" for when scaling is not what we actually need. Carsten Puls : At first, customers want to get going. Understanding what's going on under the hood isn't that important. As grows, want more control and go under the hood. Managing that balance through lifecycle is important. @rbranson : What does almost every memcache library do during a multi-get when 1 out of 10 boxes times out? F*cking whole thing fails. < Reminded me of
5 0.71478975 1572 high scalability-2014-01-03-Stuff The Internet Says On Scalability For January 3rd, 2014
Introduction: Hey, it's HighScalability time, can you handle the truth? Should software architectures include parasites ? They increase diversity and complexity in the food web. 10 Million : classic hockey stick growth pattern for GitHub repositories Quotable Quotes: Seymour Cray : A supercomputer is a device for turning compute-bound problems into IO-bound problems. Robert Sapolsky : And why is self-organization so beautiful to my atheistic self? Because if complex, adaptive systems don’t require a blue print, they don’t require a blue print maker. If they don’t require lightning bolts, they don’t require Someone hurtling lightning bolts. @swardley : Asked for a history of PaaS? From memory, public launch - Zimki ('06), BungeeLabs ('06), Heroku ('07), GAE ('08), CloudFoundry ('11) ... @neil_conway : If you're designing scalable systems, you should understand backpressure and build mechanisms to support it. Scott Aaronson ...the
6 0.70338321 1246 high scalability-2012-05-16-Big List of 20 Common Bottlenecks
7 0.70279545 1063 high scalability-2011-06-17-Stuff The Internet Says On Scalability For June 17, 2011
8 0.70064479 1509 high scalability-2013-08-30-Stuff The Internet Says On Scalability For August 30, 2013
9 0.69457263 1475 high scalability-2013-06-13-Busting 4 Modern Hardware Myths - Are Memory, HDDs, and SSDs Really Random Access?
10 0.69359934 817 high scalability-2010-04-29-Product: SciDB - A Science-Oriented DBMS at 100 Petabytes
11 0.6935755 538 high scalability-2009-03-16-Are Cloud Based Memory Architectures the Next Big Thing?
12 0.69283313 1650 high scalability-2014-05-19-A Short On How the Wayback Machine Stores More Pages than Stars in the Milky Way
13 0.69253165 1487 high scalability-2013-07-05-Stuff The Internet Says On Scalability For July 5, 2013
14 0.69157869 1567 high scalability-2013-12-20-Stuff The Internet Says On Scalability For December 20th, 2013
15 0.69072455 1607 high scalability-2014-03-07-Stuff The Internet Says On Scalability For March 7th, 2014
17 0.68799996 1637 high scalability-2014-04-25-Stuff The Internet Says On Scalability For April 25th, 2014
18 0.68684918 1553 high scalability-2013-11-25-How To Make an Infinitely Scalable Relational Database Management System (RDBMS)
19 0.68519622 1096 high scalability-2011-08-10-LevelDB - Fast and Lightweight Key-Value Database From the Authors of MapReduce and BigTable
20 0.68438023 1581 high scalability-2014-01-17-Stuff The Internet Says On Scalability For January 17th, 2014
topicId topicWeight
[(1, 0.173), (2, 0.184), (10, 0.052), (26, 0.013), (30, 0.02), (47, 0.017), (49, 0.012), (51, 0.049), (59, 0.172), (61, 0.109), (79, 0.025), (94, 0.063)]
simIndex simValue blogId blogTitle
same-blog 1 0.90616852 609 high scalability-2009-05-28-Scaling PostgreSQL using CUDA
Introduction: Combining GPU power with PostgreSQL PostgreSQL is one of the world's leading Open Source databases and it provides enormous flexibility as well as extensibility. One of the key features of PostgreSQL is that users can define their own procedures and functions in basically any known programming language. With the means of functions it is possible to write basically any server side codes easily. Now, all this extensibility is basically not new. What does it all have to do with scaling and then? Well, imagine a world where the data in your database and enormous computing power are tightly integrated. Imagine a world where data inside your database has direct access to hundreds of FPUs. Welcome to the world of CUDA, NVIDIA's way of making the power of graphics cards available to normal, high-performance applications. When it comes to complex computations databases might very well turn out to be a bottleneck. Depending on your application it might easily happen that adding more
2 0.88925833 1314 high scalability-2012-08-30-Dramatically Improving Performance by Debugging Brutally Complex Prolems
Introduction: Debugging complex problems is 90% persistence and 50% cool tools. Brendan Gregg in 10 Performance Wins tells a fascinating story of how a team at Joyent solved some weird and challenging performance issues deep in the OS. It took lots of effort, DTrace , Flame Graphs , USE Method , and writing custom tools when necessary. Here's a quick summary of the solved cases: Monitoring. 1000x improvement . An application blocked while paging anonymous memory back in. It was also blocked during file system fsync() calls. The application was misconfigured and sometimes briefly exceeded available memory, getting page out. Riak. 2x improvement . The Erlang VM used half the CPU count it was supposed to, so CPUs remained unused. Fix was a configuration change. MySQL. 380x improvement . Reads were slow. Cause was correlated writes. Fix was to tune the cache flush interval on the storage controller. Various. 2800x improvement . Large systems calls to getvmusage() could take a few sec
3 0.88907623 656 high scalability-2009-07-16-Scalable Web Architectures and Application State
Introduction: In this article we follow a hypothetical programmer, Damian, on his quest to make his web application scalable. Read the full article on Bytepawn
4 0.88091916 1536 high scalability-2013-10-23-Strategy: Use Linux Taskset to Pin Processes or Let the OS Schedule It?
Introduction: This question comes from Ulysses on an interesting thread from the Mechanical Sympathy news group, especially given how multiple processors are now the norm: Ulysses: On an 8xCPU Linux instance, is it at all advantageous to use the Linux taskset command to pin an 8xJVM process set (co-ordinated as a www.infinispan.org distributed cache/data grid) to a specific CPU affinity set (i.e. pin JVM0 process to CPU 0, JVM1 process to CPU1, ...., JVM7process to CPU 7) vs. just letting the Linux OS use its default mechanism for provisioning the 8xJVM process set to the available CPUs? In effrort to seek an optimal point (in the full event space), what are the conceptual trade-offs in considering "searching" each permutation of provisioning an 8xJVM process set to an 8xCPU set via taskset? Given taskset is they key to the question, it would help to have a definition: Used to set or retrieve the CPU affinity of a running process given its PID or to launch a new COMMAND with
5 0.87969166 1323 high scalability-2012-09-15-4 Reasons Facebook Dumped HTML5 and Went Native
Introduction: Facebook made quite a splash when they released their native iOS app , not because of their app per se, but because of their conclusion that their biggest mistake was betting on HTML5 , so they had to go native. As you might imagine this was a bit like telling a Great White Shark that its bark is worse than its bite. A common refrain was Facebook simply had made a bad HTML5 site, not that HTML5 itself is bad, as plenty of other vendors have made slick well performing mobile sites. An interesting and relevant conversation given the rising butt kickery of mobile. But we were lacking details. Now we aren't. If you were wondering just why Facebook ditched HTML5, Tobie Langel in Perf Feedback - What's slowing down Mobile Facebook , lists out the reasons: Tooling / Developer APIs . Most importantly, the lack of tooling to track down memory problems. Scrolling performance. Scrolling must be fast and smooth and full featured. It's not. GPU. A clunky API and black box ap
6 0.87489229 1281 high scalability-2012-07-11-FictionPress: Publishing 6 Million Works of Fiction on the Web
7 0.87193304 1582 high scalability-2014-01-20-8 Ways Stardog Made its Database Insanely Scalable
8 0.86312628 1264 high scalability-2012-06-15-Cloud Bursting between AWS and Rackspace
9 0.84603608 1405 high scalability-2013-02-13-7 Sensible and 1 Really Surprising Way EVE Online Scales to Play Huge Games
10 0.84177411 1218 high scalability-2012-03-29-Strategy: Exploit Processor Affinity for High and Predictable Performance
11 0.84031224 850 high scalability-2010-06-30-Paper: GraphLab: A New Framework For Parallel Machine Learning
12 0.83827007 1579 high scalability-2014-01-14-SharePoint VPS solution
13 0.83384907 1634 high scalability-2014-04-18-Stuff The Internet Says On Scalability For April 18th, 2014
14 0.82559413 1038 high scalability-2011-05-11-Troubleshooting response time problems – why you cannot trust your system metrics
15 0.82532269 1329 high scalability-2012-09-26-WordPress.com Serves 70,000 req-sec and over 15 Gbit-sec of Traffic using NGINX
16 0.82175297 1123 high scalability-2011-09-23-The Real News is Not that Facebook Serves Up 1 Trillion Pages a Month…
17 0.82164043 276 high scalability-2008-03-15-New Website Design Considerations
18 0.82132798 787 high scalability-2010-03-03-Hot Scalability Links for March 3, 2010
19 0.82098055 1087 high scalability-2011-07-26-Web 2.0 Killed the Middleware Star
20 0.81967521 427 high scalability-2008-10-22-Server load balancing architectures, Part 2: Application-level load balancing