Skip to content

{ Tag Archives } hadoop

Building Voldemort read-only stores with Hadoop

A well-known lesson in scalability is that writes are 40x more expensive than reads and if your application becomes write-intensive as it is easily the case when you are dealing with sufficiently large number of users, you will be in trouble if you don’t design to scale. For example, if you are using MySQL, [...]

Also tagged ,

Building a terabyte-scale data cycle at LinkedIn with Hadoop and Project Voldemort

Many of LinkedIn’s products are critically dependent on computationally intensive data mining algorithms. Examples of these include some modules like People You May Know, Viewers of This Profile Also Viewed, and much of the Job matching functionality that we give to people who post jobs on the site. To support these data-intensive products we have [...]

Also tagged ,