![]() |
|
|||||||||||||||||||||||||
| Ravi Pandya software | nanotechnology | economics |
ARCHIVES
ABOUT ME
DISCLAIMER
|
|||||||||||||||||||||||||
|
Mon 14 Apr 2008
Supple:
automatically generating user interfaces, by Krzysztof Gajos and
Dan Weld 05:48 # Sun 20 Jan 2008 Economics of Software as a Service Delivering Software as a Service - McKinsey Quarterly Two interesting data points: While SaaS companies have lower operating margins than packaged
software companies overall, if you restrict the comparison to
comparably-sized packaged software companies, the numbers look
essentially identical: However, if you look at the customer cost side, there's a huge
advantage to SaaS: 16:05 #
Multistage Collaboration in CACHE: The Bayes Community Model 15:41 #
A
Logic of Filesystems
Computation Spreading: Employing Hardware Migration to Specialize CMP
Cores On-the-fly A
Predictive Model for Transcriptional Control of Physiology in a Free
Living Cell Forever
Minus a Day? Some Theory and Empirics of Optimal Copyright 08:33 # Fri 23 Nov 2007 My colleague Eric Northup has mentioned these a few times, and I'm glad I looked them up. The Tornado OS (from my alma mater, U of T) and its successor K42 (at IBM Research) use a fine-grained object-oriented approach to all operating system structures (processes, memory regions, etc.), with built-in clustering for replicated instances across processors. This reduces lock contention and increases cache locality by operating on the per-processor instance as much as possible. Since objects are generally expected to be local, it can optimize for this case, and track cross-processor operations as a special case. There are some policy choices (e.g. maintaining replica tables for all processors) that would probably need to be adapted for manycore. The scalability architecture is best described in this paper. The memory manager was key, e.g. for locality-aware allocation, padding to cache line size to avoid false sharing, deferring deletion until quiescence to avoid existence locks, etc. An insight as the basic Tornado model was applied to real workloads was that creation-time object specialization isn't sufficient, instead it is better to for example start with an unshared implementation and then upgrade to shared implementation when multiple processes share an object. They were able to improve their 24-proc scalability from "terrible" to pretty good in 2 weeks of work because of good OO discipline and tracing infrastructure. Overall, I found it striking how the scalability architecture mirrored that for distributed systems - state partitioning, replication, dynamic upgrade, etc. I had expected this from general principles, but it was valuable to see it confirmed in practice with significant workloads. The scalability graphs are impressively linear. Security isn't mentioned, but I expect that the same OO design that gives the OS good modularity and scalability could be applied to give it good capability discipline as well. 12:04 #
I was looking through the slides from the
NCSA Petascale BOF session at SC07. The slides weren't a
particularly good substitute for the actual presentation,
unfortunately. However, this graph caught my eye - in case you were
wondering whether we'd hit the single-processor scaling wall, it
leaves little room for doubt: 11:51 # |
||||||||||||||||||||||||||
© 2002-2004 Ravi Pandya | All Rights Reserved |