Thursday, December 31, 2009

Experiment: Does Intel’s Turbo Boost Trump Overclocking?

I still remember the PC I owned back in 1998. It was based on a Pentium II 233 with Intel’s Deschutes core, dropped into an Asus P2B motherboard. That system was fast, but I was a bored engineering student and wanted to do more with it. I started with aftermarket air cooling. And although I don’t remember how much overclocking headroom I was able to realize, I do remember that it wasn’t enough. At one point, I pried the plastic cartridge away from the slot-mounted processor and started experimenting with Peltier coolers for better cooling performance. When the proverbial smoke cleared, I was running at a stable 400 MHz—as fast as the most expensive model available at the time, for significantly less money.

Of course, the overclocks today are a lot more significant than 166 MHz. But the principle remains the same: take a processor running at its default settings and squeeze additional value out of it by trying to match the performance of higher-end and more expensive models. With a little effort, it’s actually quite easy to get a sub-$300 Core i7-920 beyond the performance levels of a $1,000 Core i7-975 Extreme without obliterating its reliability.

What About “Auto-Overclocking?”

Overclocking, in general, has always been a bit of a sore subject with AMD and Intel, which officially have to discourage the practice with threats of voided warranties should your CPU show signs of manipulation. Publically, however, both vendors try to appear enthusiast-friendly by giving away overclocking software, facilitating aggressive BIOSes, and selling CPUs with unlocked clock multipliers. Despite those off-the-record endorsements, though, power users simply accept that there’s no such thing as a free lunch, and killing a CPU with too much voltage is sometimes just part of the game.

But with the introduction of Turbo Boost technology in Intel’s LGA 1366-based Core i7 and the subsequent debut of an even more aggressive implementation in the LGA 1156-based Core i5 and Core i7 processors, Intel took it upon itself to implement a form of intelligent overclocking based on a handful of different factors: voltage, amperage, temperature, and operating system P-state requests directly related to CPU utilization.

Zoom

In monitoring each of those parameters, Intel’s onboard power control unit is able to augment performance by increasing clock rate in situations where the processor’s maximum TDP isn’t being reached. By essentially shutting down unused cores, thereby dropping power consumption, more headroom is freed up in single-threaded workloads, a little less when two threads are active, still less with three cores utilized, and so on. Thus, Intel’s “automatic overclocking” exists as an elegant, more granular way to increase performance without taking power consumption over the maximum TDP rating of any given CPU (130W in the case of Intel’s Bloomfields and 95W in the case of the Lynnfields).

Can You Do Better?

The question we asked ourselves—especially after seeing that the Core i7-860 and -870 would accelerate a fantastic 667 MHz in single-threaded apps—was whether it was still worth it for the power user to go all-out with processor overclocking and risk nuking a perfectly good CPU, or simply let Intel’s version of the technology handle business? I hoped that I wasn’t just getting lazy in my old(er) age, and that there’d still be palpable gains to taking the enthusiast’s path to better performance. But I also wasn’t ready to dismiss the efforts Intel’s engineers made in optimizing Nehalem for balanced performance in single- and multi-threaded software.

Turbo Boost Versus Overclocking


We decided to run a little experiment: take a Core i5-750 and Core i7-860, overclock each, and compare the results to what the two processors can do with Turbo Boost enabled, and then disabled. Of course, we have samples from Intel here in the lab, but we can’t really believe that those are representative of retail models. So we bought both chips off of Newegg, just to be sure. While we debated sticking with Intel’s retail cooling solution, at the end of the day it was decided that we’d never seen 4 GHz if we didn’t augment the reference heatsink. Thus, all of our testing is done with Thermalright’s MUX-120, too.

ATI Radeon HD 5870: DirectX 11, Eyefinity, And Serious Speed

Originally, I titled this piece ATI Radeon HD 5870: Learning From Nvidia's Mistakes. That was an unfair way to kick things off, I decided. But I still want to explain my justification for that idea. When Nvidia launched the GeForce GTX 260 and GTX 280 boards more than a year ago, the company knew it had the fastest board on the market and wasn’t afraid to charge a premium for it; $650, to be exact.

How utterly devastating, then, when the Radeon HD 4870 launched a couple of weeks later, besting the $400 GeForce GTX 260 with a $300 price point. It’s not that ATI had snatched away the performance crown—Nvidia still had the fastest card around. But enthusiasts (especially those who actually bought one of the GeForce GTX 200-series boards) were certainly left feeling gouged when the cards immediately fell to more competitive prices. Good way to earn extra margin on a big GPU. Bad way to encourage brand loyalty.

Without spoiling too much of today’s story, ATI seems to have learned a thing or two from the green faux pas. It’s launching a flagship just under $400 (Ed.: as of November 30th, Radon HD 5870s, when in stock, sell for $410) and a second-in-command board based on the same design at $259 (Ed.: as of November 30th, the least-expensive Radeon HD 5850s sell for $310). That’s still a lot of money, but the two cards are being positioned as GeForce GTX 295 and GeForce GTX 285 killers. Could these boards really knock down Nvidia’s fastest pair at even lower prices?

One card, three monitors, truly useful.One card, three monitors, truly useful.

They Began By Scaring Me

ATI’s Radeon HD 5870 briefing, held in the belly of the decommissioned U.S.S. Hornet aircraft carrier, mixed mainstream press and the more enthusiast-oriented tech folks. So, when the presentation began and the company started talking about buying graphics based on a fuzzy-wuzzy user experience, I started to worry that we’d next hear how 3D gaming was fast enough already. The message was that end-users don't care about megahertz, shader units, or cache repositories; they want smooth gaming, easy transcoding (but call it something cozier, please), and flawless Blu-ray playback. Hopefully that's not entirely true for the enthusiasts here to learn about Cypress, ATI's 2+ billion transistor, 40nm GPU. I'd like the think the engine powering Radeon HD 5870 is actually full of stuff you'll want to know more about.

Zoom

Fortunately, after a group hug and a round of Kumbaya, ATI switched gears and dove into a much more technical round of info-sharing on its Evergreen-series GPUs: everything from the chip's design to the dual-GPU Hemlock, mainstream Juniper, and entry-level Redwood and Cedar, slated for a launch in 2010.

We also took away plenty of information about DirectX 11, Windows 7, stream computing, ATI’s Eyefinity technology, power consumption, video playback, and of course, performance. There’s a lot of ground to cover, so let’s start with a look at the Cypress GPU sitting at the heart of today’s two newcomers.

Intel® Core™2


Intel® Core™2
Intel® Core™2 Duo processor

Investing in new PCs with Intel® Core™2 processor family can mean big savings for your business. Delivering faster performance, greater energy efficiency, and more responsive multitasking, desktop PCs with Intel® Core™2 processor family can help your whole company be more productive.

By combining breakthrough processing speeds with advanced power saving features, desktop PCs with Intel® Core™2 processor family let you get more done in less time than ever before reducing energy costs by an average of 50 percent.¹ Processors built with Intel's unique 45nm technology offer excellent performance as well as unique energy-saving features that help PCs meet ENERGY STAR² requirements. That means reduced power consumption for desktop PCs and lower energy costs for your company.

Product information

Features and benefits

Get the best overall performance with Intel® Core™2 Duo processor you'll get an arsenal of performance-rich technologies, including up to 6MB of shared L2 cache and up to 1333 MHz Front Side Bus.

Featured white paper

Discover next-generation Intel® Core™ microarchitecture built on 45nm high-k metal gate silicon technology.

Compare products

Compare these desktop products:

Enjoy 3X faster multitasking performance with multi-core processing combines two independent processor cores in one physical package.¹ Processors run at the same frequency and share up to 6MB of L2 cache and up to 1333 MHZ Front Side Bus for truly parallel computing with over.

Improve execution time and energy efficiency with more instructions per clock cycle enabled by Intel® Wide Dynamic Execution.

Get smarter, more energy-efficient performance enabled by Intel® Intelligent Power Capability.

Improve system performance enabled by Intel® Smart Memory Access, optimizing the use of the available data bandwidth.

Get higher-performance, more efficient cache subsystem enabled by Intel® Advanced Smart Cache, optimized for multi-core and dual-core processors.

Accelerate a broad range of applications, , including video, speech and image, photo processing, encryption, financial, engineering and scientific applications, enabled by Intel® Advanced Digital Media Boost.

Intel Core i7


Intel Core i7


Intel® Core™ i7 processor

Brilliantly fast

With faster, intelligent, multi-core technology that applies processing power where it's needed most, new Intel® Core™ i7 processors deliver an incredible breakthrough in PC performance. They are the best desktop processor family on the planet.¹

You'll multitask applications faster and unleash incredible digital media creation. And you'll experience maximum performance for everything you do, thanks to the combination of Intel® Turbo Boost technology² and Intel® Hyper-Threading technology (Intel® HT technology)³, which maximizes performance to match your workload.

Product information

  • 3.06 GHz, 2.93 GHz, and 2.66 GHz core speed
  • 8 processing threads with Intel® HT technology
  • 8 MB of Intel® Smart Cache
  • 3 Channels of DDR3 1066 MHz memory

Features and benefits

Go to the next level of multi-core performance.

Intel Core i7 processors deliver an incredible breakthrough in quad-core performance and feature the latest innovations in processor technologies:

Intel® Turbo Boost technology maximizes speed for demanding applications, dynamically accelerating performance to match your workload—more performance when you need it the most.²

Intel® Hyper-Threading technology enables highly threaded applications to get more work done in parallel. With 8 threads available to the operating system, multi-tasking becomes even easier.³

Intel® Smart Cache provides a higher-performance, more efficient cache subsystem. Optimized for industry leading multi-threaded games.

Intel® QuickPath Interconnect is designed for increased bandwidth and low latency. It can achieve data transfer speeds as high as 25.6 GB/sec with the Extreme Edition processor.

Integrated memory controller enables three channels of DDR3 1066 MHz memory, resulting in up to 25.6 GB/sec memory bandwidth. This memory controller's lower latency and higher memory bandwidth delivers amazing performance for data-intensive applications.

Intel® HD Boost significantly improves a broad range of multimedia and compute-intensive applications. The 128-bit SSE instructions are issued at a throughput rate of one per clock cycle, allowing a new level of processing efficiency with SSE4 optimized applications.

WiMAX is the next-generation of wireless technology designed to enable pervasive, high-speed mobile Internet access to the widest array of devices including notebook PCs, handsets, smartphones, and consumer electronics such as gaming devices, cameras, camcorders, music players, and more. As the fourth generation (4G) of wireless technology, WiMAX delivers low-cost, open networks and is the first all IP mobile Internet solution enabling efficient and scalable networks for data, video, and voice. As a major driver in the support and development of WiMAX, Intel has designed embedded WiMAX solutions for a variety of mobile devices supporting the future of high-speed broadband on-the-go.

Driving standards and global networks for WiMAX
WiMAX: Connect in more places, more often
Connecting laptops of the future with WiMAX

Driving standards and global networks for WiMAX

Laying the foundation for broad, cost-effective deployments, Intel is working to easily integrate Intel® WiMAX technology into complex designs and global networks, providing a standards-based foundation for ongoing product innovation.

WiMAX: Connect in more places, more often

Built for the future, Intel® WiMAX technology will allow you to connect in more places, more often, without being restricted to hotspots. When built into notebooks and mobile devices, you'll be able to extend your connected experience beyond Wi-Fi.

Connecting notebooks of the future with WiMAX

With the Intel® WiMAX/WiFi Link 5050 Series module solution, available in notebooks with Intel® Centrino® 2 processor technology, Intel is providing advancements in wireless mobile technology for the future of notebooks and a wide range of consumer devices.