Today Futuremark is pulling the covers off of their new Time Spy benchmark, which is being released today for all Windows editions of 3DMark. A showcase of sorts of the last decade or so of 3DMark benchmarks, Time Spy is a modern DirectX 12 benchmark implementing a number of the API's important features. All of this comes together in a demanding test for those who think their GPU hasn’t earned its keep yet.

DirectX 12 support for game engines has been coming along for a few months now. To join in the fray Futuremark has written the Time Spy benchmark on top of a pure DirectX 12 engine. This brings features such as asynchronous compute, explicit multi-adapter, and of course multi-threading/multi-core work submission improvements. All of this comes together into what I think is not only visually interesting, but also borrows a large number of gaming assets from benchmarks of 3DMarks past.

For those who haven’t been following the 3DMark franchise for more than a decade, there are portions of the prior benchmarks showcased as shrunken museum exhibits. These exhibits come to life as the titular Time Spy wanders the hall, giving a throwback to past demos. I must admit a bit of fun was had watching to see what I recognized. I personally couldn’t spot anything older than 3DMark 2005, but I would be interested in hearing about anything I missed.

Unlike many of the benchmarks exhibited in this museum, the entirety of this benchmark takes place in the same environment. Fortunately, the large variety of eye candy present gives a varied backdrop for the tests presented. To add story in, we see a crystalline ivy entangled with the entire museum. In parts of the exhibit there are deceased in orange hazmat suits demonstrating signs of a previous struggle. Meanwhile, the Time Spy examines the museum with a handheld time portal. Through said portal she can view a bright and clean museum, and view bustling air traffic outside. I’ll not spoil the entire brief story here, but the benchmark makes good work of providing both eye candy for the newcomers and tributes for the enthusiasts that will spend ample time watching the events unroll.

From a technical perspective, this benchmark is, as you might imagine, designed to be the successor to Fire Strike. The system requirements are higher than ever, and while Fire Strike Ultra could run at 4K, 1440p is enough to bring even the latest cards to their knees with Time Spy.

Under the hood, the engine only makes use of FL 11_0 features, which means it can run on video cards as far back as GeForce GTX 680 and Radeon HD 7970. At the same time it doesn't use any of the features from the newer feature levels, so while it ensures a consistent test between all cards, it doesn't push the very newest graphics features such as conservative rasterization.

That said, Futuremark has definitely set out to make full use of FL 11_0. Futuremark has published an excellent technical guide for the benchmark, which should go live at the same time as this article, so I won't recap it verbatim. But in brief, everything from asynchronous compute to resource heaps get used. In the case of async compute, Futuremark is using it to overlap rendering passes, though they do note that "the asynchronous compute workload per frame varies between 10-20%." On the work submission front, they're making full use of multi-threaded command queue submission, noting that every logical core in a system is used to submit work.

Meanwhile on the multi-GPU front, Time Spy is also mGPU capable. Futuremark is essentially meeting the GPUs half-way here, using DX12 explicit multi-adapter's linked-node mode. Linked-node mode is designed for matching GPUs - so there isn't any Ashes-style wacky heterogeneous configurations supported here - trading off some of the fine-grained power of explicit multi-adapter for the simplicity of matching GPUs and useful features that can only be done with matching GPUs such as cross-node resource sharing. For their mGPU implementation Futuremark is using otherwise common AFR, which for a non-interactive demo should offer the best performance.

3DMark Time Spy Benchmark: 1440p

3DMark Time Spy Benchmark: 1440p

To take a quick look at the benchmark, we ran the full test on a small number of cards on the default 1440p setting. In our previous testing AMD’s RX 480 and R9 390 traded blows with each other and NVIDIA’s GTX 970. Here though, the RX 480 pulls a small lead over the R9 390 while they both leave a slightly larger gap ahead of the GTX 970. Only to then see the GeForce GTX 1070 appropriately zip past the lot of them.

The graphics tests scale similarly to the overall score in this case, and if these tests were a real game anything less than the GTX 1070 would provide a poor gameplay experience with framerates under 30 fps. While we didn’t get any 4K numbers off our test bench, I ran a GTX 1080 in my personal rig (i7-2600k @4.2GHz) and saw 4K scores that were about half of my 1440p scores. While this is a synthetic test, the graphical demands this benchmark can place on a system will provide a plenty hefty workload for any seeking it out.

Meanwhile, for the Advanced and Professional versions of the benchmark there's an interesting ability to run it with async compute disabled. Since this is one of the only pieces of software out right now that can use async on Pascal GPUs, I went ahead and quickly ran the graphics test on the GTX 1070 and RX 480. It's not an apples-to-apples comparison in that they have much different performance levels, but for now it's the best look we can take at async on Pascal.

3DMark Time Spy Benchmark: Async Compute

Both cards pick up 300-400 points in score. On a relative basis this is a 10.8% gain for the RX 480, and a 5.4% gain for the GTX 1070. Though whenever working with async, I should note that the primary performance benefit as implemented in Time Spy is via concurrency, so everything here is dependent on a game having additional work to submit and a GPU having execution bubbles to fill.

The new Time Spy test will be coming today to Windows users of 3DMark. This walk down memory lane not only puts demands on the latest gaming hardware but also provides another showcase of the benefits DX12 can bring to our games. To anyone who’s found FireStrike too easy of a benchmark, keep an eye out for Time Spy in the near future.

Comments Locked


View All Comments

  • donkay - Thursday, July 14, 2016 - link

    I could copy paste parts of the article here, or you could just read the full article. There's more here than just charts you know.
  • godrilla - Friday, July 15, 2016 - link

    With true low level APIs optimization Teraflops finally matter, and yes nvidias hardware is already almost fully used amd's on the other hand has the most to gain.
  • Yojimbo - Friday, July 15, 2016 - link

    I've been thinking about it. I think GCN was AMD's first architecture whose foundations were laid down post-merger. I wonder if AMD built GCN more with Fusion in mind than with DirectX 11 in mind. If that's true then maybe there is hope for AMD when Navi comes out. Perhaps Navi will be AMD's first post GCN architecture, with foundations laid down after Fusion had already failed.

    DirectX 12 seems to expose GCN better than DirectX 11, but GCN is still a lot less efficient in DirectX 12 than Pascal.
  • JRW - Friday, July 15, 2016 - link

    Dang my old timey i7 920 (overclocked 3.5Ghz) and R9 290X 4GB scored a 3878, didn't expect that higher considering CPU.
  • scottjames_12 - Friday, July 15, 2016 - link

    Yeah, my old faithful i7-930 (@ 3.99 Ghz) puts out virtually identical FPS numbers in graphics test 1 and 2, when compared with a 6700k that has a similarly clocked GTX 970 also. Pretty safe to say those tests are GPU limited, but it is good to know the CPU isn't holding the GPU back.
  • Oxford Guy - Friday, July 15, 2016 - link

    "It's not an apples-to-apples comparison in that they have much different performance levels, but for now it's the best look we can take at async on Pascal."

    Why is the 480 the best choice for the comparison? Why not add in a 390X, Nano, or something?
  • Oxford Guy - Friday, July 15, 2016 - link

    The best choice for Nvidia maybe:

    Extreme Tech:

    "The RX 480 is just one GPU, and we’ve already discussed how different cards can see very different levels of performance improvement depending on the game in question — the R9 Nano picks up 12% additional performance from enabling versus disabling async compute in Ashes of the Singularity, whereas the RX 480 only sees a 3% performance uplift from the same feature."
  • pencea - Friday, July 15, 2016 - link

    I just ran a benchmark test on the new 3dmark Time Spy DX12 with the GTX 1080.

    I uploaded the video here for those who's interested to see how the card performs.
  • Deders - Friday, July 15, 2016 - link

    Wow, and it still dips close to 30fps in the 2nd test. I wonder what AMD's 480 gets at that point.
  • Deders - Friday, July 15, 2016 - link

    Not sure if the stone horse is a reference to one of the sub tests. It's the same pose and colour, with added aliens.

Log in

Don't have an account? Sign up now