Professional Documents
Culture Documents
Synopsys 2012
SASE 2012
Agenda
Introduction
The Evolution of Synthesis
SoC
IC Design Methodology
New Techniques and Challenges
IP Market, an opportunity for Latin America
Synopsys 2012
Introduction
Synopsys 2012
Source: Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 20112016, Feb 14, 2012
Synopsys 2012
A Decade of Digital
Universe Growth
7.910
Zettabytes
8000
7000
6000
5000
4000
3000
2000
1000
130
Exabytes
1.2
Zettabytes
0
2005
Synopsys 2012
2010
2015
Synopsys 2012
Tomorrows World
Reality Augmented Reality Blended Reality
Search Agents Info That Finds You
(and networks that know you)
Synopsys 2012
Synopsys 2012
Computing
Creating Info
Compute Power
Business
At your desk
Work
Synopsys 2012
10
Today Its
Connectivity
Consuming Info
Battery Power
Consumer
Anywhere, anytime
Entertainment
35%
Current
Next
31%
30%
25%
20%
20%
15%
13%
13%
10%
5%
5%
6%
180nm
130nm
5%
4%
3%
0%
250nm
11
90nm
65/55nm
45/40nm
32/28nm
22/20nm
<20nm
40%
35%
30%
50-100M, 6%
25%
20-50M, 7%
>100M, 3%
20%
50-100M, 3%
10-20M, 5%
20-50M, 3%
15%
10-20M, 5%
10%
5-10M, 9%
5-10M, 4%
5%
2-5M, 6%
2-5M, 7%
2010
2011
0%
Synopsys Global User Survey, Feb 2012
Synopsys 2012
12
>2GHz
1-2GHz
751MHz-1GHz
80%
42%
501-750MHz
401-500MHz
60%
301-400MHz
40%
201-300MHz
20%
101-200MHz
51-100MHz
50MHz
0%
2004
2005
2006
13
2007
2008
2009
2010
2011
Other
Back-biasing/Well-biasing
350%
300%
State retention
250%
MTCMOS/Power gating
200%
150%
100%
50%
0%
2010
Synopsys Global User Survey, Feb 2012
N = 282
Synopsys 2012
14
2011
Device segmentation
28 nm is 2X harder than 40 nm
28 nm IP area increases
Complexity
Approximately 1700 design rule checks at 28nm vs. 700 at 65nm
without circuit innovation
28 nm analog layout
9% larger than 40 nm
due to limitations
on poly area
Synopsys 2012
15
40 nm layout
System on a chip
SoC = Software
HW & SW Development Costs
App-Specific SW
$2.50
Low-Level SW
OS Support
$2.00
Design Management
Post-silicon Validation
$M
$1.50
Masks
Physical Design
$1.00
RTL Verification
$0.50
RTL Development
Spec Development
$1 2 3 4 5 6 7 8 9 101112131415161718192021222324252627
Source: IBS, Synopsys
Months
IP Qualification
16
Software
$150
Cost ($M)
$125
Hardware
$100
$75
$50
$25
$0
90nm (60M)
Synopsys 2012
17
65nm (90M)
45/40nm (130M)
32/28nm (180M)
Feature Dimension (Transistor Count)
Source: IBS and Synopsys, 2011
22/20nm (240M)
Unlike Moore
Software Guys are Pessimists
Pages Law: 2009
18
Synopsys 2012
19
Synopsys 2012
20
21
Logic Synthesis
David Gregory, Karen Bartlett, Aart J. de Geus, Gary D.
Hachtel, GE & University of Colorado at Boulder, 1986
Synopsys 2012
22
Front-End
Schematic Capture
Timing Simulation
Back-End
Synopsys 2012
23
Early 90s
The Relationship Needs Improvements Badly:
Walls Now Lead to Iterations, Often Out of Control
Front-End
RTL Simulation
Logic Synthesis
Back-End
Sign-Off
Synopsys 2012
24
Delay Calculation
Timing Simulation
Front-End
Floorplan
Physical Synthesis
Back-End
Sign-Off
Synopsys 2012
25
Floorplan
P&R
2003
90 Nanometers
Interoperability
Synopsys 2012
26
2005
65 Nanometers
Correlation
2007
45/40 Nanometers
Look Ahead
2009
32/28 Nanometers
In-Design
Synopsys 2012
27
28
SoC
Synopsys 2012
29
User inputs:
High-level algorithm
Constraints
c a * b c;
Automation using
High-Level Synthesis
HLS outputs:
HLS
Results
Synopsys 2012
Synthesizable RTL
C-model
RTL testbench
Scripts for synthesis,
verification and
downstream tools
30
Example benefits
2-5X productivity for initial designs
5-10X productivity for design re-use
Increased exploration leading to better results
Multi-million gate designs in weeks vs. months
RTL Coding
Cycle by cycle
functional debug
Algorithm
Design
High-Level
Design
Faster design at
higher abstraction
31
RTL
Verification
Quickly evaluate
multiple architectures
Implementation
Spreadsheets
Synopsys 2012
RTL Verification
Implementation
Architecture
Exploration
Better Designs,
Faster
Emerging
Mix and Match
Bottom Up and Top
Down Flow
Distributed development
Better design preservation and isolation
Design style adjustments needed to achieve
optimal timing Quality of Results (e.g.
registering module boundaries
Synopsys 2012
32
DesignWare
IP
Synplify
Premier/Certify
DW Implementation
Your IP
ASIC Implementation
DesignWare
Building Blocks
Galaxy
DW Implementation
Synopsys 2012
33
34
Building Blocks
Instructions
Instructions
1. Preheat the oven to 450.
2. Melt butter and chocolate together in the top of a double broiler
or in the microwave. Add sea salt.
3. Meanwhile, beat together the egg, egg yolks, and sugar with a
whisk or an electric beater until light
and slightly
foamy.
4. Add the egg mixture to the warm chocolate; whisk quickly to
combine. Add flour and stir just to combine. The batter will be quite
thick.
5. Butter small ramekins, or use Reynolds foil cupcake liners.
6. Divide the batter evenly among the ramekins. (You can make
the cakes in advance to this point and chill them until you're ready
to bake. Be sure to bring the batter back to room temperature
before baking.)
7. Baking time will depend on your oven; start with 7 minutes for a
thin outer shell with a completely molten interior.
8. Melt a little more chocolate to drizzle on top. Sprinkle a little
more salt, and serve with berries or ice cream.
Synopsys 2012
35
Typical
Threshold
Hierarchical
Flat
Instances
Synopsys 2012
36
3M 5M
15M
100M+
#2 Top-Level Style
Requires different discipline
#3 Block Size
Tradeoff size versus TAT
#4 Modeling
Modeling for top-level closure
#5 Top-Level Closure
Meeting the inter-block signals
Synopsys 2012
37
#7 Block-Level Drivers/Loads
Affects block boundary closure
#9 Constraints Management
Affects design closure & TAT
#1 Floorplan
Affects Design Closure
Example 1
vs.
Example 2
Synopsys 2012
38
Logical connectivity
Clock
Voltage areas
Physical size
Multiple Instantiated
Modules (MIM)
Macro Placement
Power Planning
IO Planning
vs.
Challenge
Partitioning Guidelines
Better Approach
#2 Top-Level Style
Requires Different Design Discipline
Channel
Narrow Channel
Abutted
Implementation Complexity
clock
Synopsys 2012
39
Data
#3 Block Size
Tradeoff Size versus TAT (turn around time)
1.5M
1.5M
1.5M
3M
1.5M
5M
1.5M
2M
2M
5M
1.5M
40
#4 Modeling
ETM vs. Abstract Model
Extracted Timing Model (ETM)
Abstract Model
Synopsys 2012
41
#5 Top-Level Closure
Meeting Timing on Inter-Block Signals
Closing top-level inter block
signals can be challenging
Can be minimized with
Chg graphic
Simultaneous optimization of
top-level and inter-block
signals needed
Synopsys 2012
42
Logic
Registers
Logic
Logic
Registers
Adjacent Block
Logic
Registers
Logic
Registers
Adjacent Block
I/O paths are not finalized during early stage block design
Overconstraining these paths direct the tool to focus on I/O paths
instead of the intra-block paths
Accuracy of proportional time budgets is affected if interfaces are
still changing
Synopsys 2012
43
Logic
Logic
Registers
Adjacent Block
Logic
Registers
Logic
Registers
Registers
Adjacent Block
44
Block A
Block B
A
45
If no load
is specified
Synopsys 2012
46
Top to Block
path
Synopsys 2012
47
With Shielding
Use shielding to reduce crosstalk effects between the block- and toplevel t significantly improve timing closure in inter-block critical paths
Use new Transparent Interface Optimization (TIO) in IC Compiler
Synopsys 2012
48
Synopsys 2012
49
size_only attributes
dont_touch attributes
Multi-cycle paths
False paths
Etc.
Performs both consistency and correlation check with user controllable accuracy
level
Supports both pre-route and post-route checks
Synopsys 2012
50
51
Synopsys 2012
52
Source: ITRS 2009; C.A. Malachowsky, NVIDIA, EDPS 2009; P. Saxena, Intel, ISPD 2003
Synopsys 2012
53
IC Design Methodology
Synopsys 2012
54
Synopsys 2012
55
Synopsys 2012
56
Synopsys 2012
57
58
Place
& Route
DRC / LVS
Synopsys 2012
59
Place
& Route
DRC / LVS
Signoff
Signoff
Signoff
Design
Planning
Synthesis
Synthesis
Place
& Route
DRC / LVS
Synthesis
Exploration
Synthesis
2011Exploration
2009-2010
In-Design
Place
& Route
DRC / LVS
Implementation
2005-2008
Look-ahead
Signoff
2000-2005
Correlation
Figure 1
Figure 2
60
#2 Setup
Correlation and Runtime
#3 Scripts
Impacts Your Design
#4 Constraints
Watch Your Constraints
#5 Analyze
Analyze-Fix-Proceed
Synopsys 2012
61
#6 Methodology
One or Two Flows
#7 Optimization
Adjust Accordingly
#8 Signoff
Review Your Environment
#9 Performance
Leverage Your EDA Partner
After
Optimization
Original Area
New Area
62
Technology and IP
Make Sure to Have a Good Quality Library
A properly designed set of library
cells give optimization engines more
choice
Synopsys 2012
63
Example:
Cell Sensitivity To Load Uncertainty
Cell A
Delay
Cell B
D
*
C*
Cload
Netlist v1.1
SDC v1.1
Compile
3.2M
instances
Compile
6.8M
instances??
Synopsys 2012
64
What
happened???
DC Utility
Checker
ICC Utility
Checker
PT Utility
Checker
Synopsys 2012
65
Incomplete
Complete
Synopsys 2012
66
Recommendations
Synopsys 2012
67
Input Delay
Output Delay
Time Available
for logic
Synopsys 2012
68
Circuit A
Circuit B
Cost = pi * wi
Default Weights
CLKA weight = 1
CLKB weight = 1
0.30
0.10
0.28
0.15
0.40
<
0.43
Adjusted Weights
CLKA weight = 10
CLKB weight = 1
3.00
0.10
2.80
0.15
3.10
Synopsys 2012
69
>
2.95
#5 Analyze: Analyze-Fix-Proceed
Push Button Flow
does not exists
Synopsys 2012
70
insert_dft
IC Compiler
StarRC
PrimeTimeSI
Synopsys 2012
71
place_opt -spg
clock_opt
route_opt
signoff_opt
Signoff extraction
Signoff STA
Analyze
results
between
design
stages
72
Exploration flow
target for
early specs
& constraints
Implementation
flow
for final
design
realization
45 nanometers (2010)
96mm2, ~ 300M transistors
7-9W
Exploration
Implementation
RTL
Design Compiler
Look-ahead & Physical Guidance
Creates a better starting point
RTL
Exploration
RTL
Synthesis
IC Compiler
Design Exploration
Creates initial floorplan
Design
Exploration
Design
Planning
Block Feasibility
Determines physical feasibility
Block
Feasibility
Block
Implementation
Synopsys 2012
73
Physical
Design
Compiler
Synopsys 2012
74
1,000
950
900
1029
Timing Closure
Profile
971
913
850
Do Not
over
Place
Clock
Route
Complicate your flow
Addnl. Customization For High-Performance
Tuned For Hi-Performance/Low Power
Synopsys 2012
1,050
Synthesis
Route
1,100
800
CTS
MHz
75
RM (Baseline)
60
128
50
112
96
40
80
30
64
20
48
32
10
16
0
1.1
1.2
5.5
37.0
Instances (Million)
50+
0
1.1
1.2
5.5
76
37.0
Instances (Million)
50+
Reading parasitics
Use binary parasitics when
possible
Multiple timing updates
Eliminate redundant/legacy
update_timing steps
Inefficient TCL scripting and
reporting
Synopsys 2012
77
Reduce time-to-results
Built on Synopsys RM
Understand the new
technologies and features
Easy to use
Synthesis
P&R
Automated methodology to
achieve 90% of target quickly
Additional advanced
techniques to reach final goal
Minimize number of iterations
or trial and errors
Reduce ECO efforts
Iterations
Signoff + ECO
Typical Flow
HSLP Flow
Design Schedule
Synopsys 2012
78
Targets
Typical Flow on
High Performance designs
100%
90%
Typical Flow
75%
With HSLP
Implementation
Best Practices
HSLP Flow
Reduces time-to-results
Time
Synopsys 2012
79
Design-specific
customization
VDDB
VDD
VDDI
IN
VDDB
VDDO
L
S
VDD
OUT
VDD
VDDB
on/off
IN
ISO
RR
OUT
IN
OUT
AO
EN
Gate
Gate
Gate
VSS
VSS
DESIGN TECHNIQUES
0.9V
0.7V
Isolation
Cells
VSS
Retention
Registers
Alwayson Logic
0.9V
OFF
0.9V
0.9V
Multiple Voltage
(MV) Domains
Level
Shifters
VSS
Power
Switches
(MTCMOS)
0.9V
Multi-Supply with
shutdown
No State Retention
OFF
0.9V
0.7V
Multi-voltage with
shutdown & State
Retention
0.9V
OFF
SR
0.9V
0.7V
Multi-Voltage with
shutdown
0.9V
Synopsys 2012
80
Synopsys 2012
81
Synopsys 2012
82
Source M. van den Brink, ASML, ITF 2009; P. Magarshack, STMicroelectronics, 2010
Synopsys 2012
83
Synopsys 2012
84
Synopsys Solution
DPT Ready IC Compiler P&R, and IC Validator DRC
Wide Spacing Enforced
85
Synopsys Solution
DPT Ready IC Compiler P&R, and IC Validator DRC
86
32 Nanometer Planar
Performance Power
87
22 Nanometer Tri-Gate
Performance Power
88
89
Synopsys 2012
90
FinFET Advantages
FinFET vs Planar Transistor
Planar
Reduced leakage
Enhanced electron
mobility
FinFET
Fin
Source: Intel
Synopsys 2012
91
Depleted substrate
92
C4
Memory
Cube
Synopsys 2012
TSV
93
Silicon Interposer
3D Stack
Bump
3D-IC
Vertically stacked dies with TSVs
Drivers: Wireless handset, Processors
Benefits: Performance, form factor
Synopsys 2012
94
560 microns
50 microns
1
Source: C.-G. Hwang, Samsung, IEDM 2006
Synopsys 2012
95
Synopsys 2012
96
IP
Intellectual property core, IP core, or IP block is
a reusable unit of logic, cell, or chip layout design
that is the intellectual property of one party
IP cores may be licensed to another party or can be
owned and used by a single party alone
Synopsys 2012
97
IP
Synopsys 2012
98
Physical libaries
3%
Other IP
4%
GP Analog/MS
4%
Processors
(CPUs, GPUs, DSPs)
Memory Cells/Blocks
10%
Microprocessors
39%
Wired Interfaces
19%
Fixed Function
(GPUs, Security)
15%
99
DSP
5%
14.0%
1,800.0
12.0%
1,600.0
$M
1,200.0
8.0%
1,000.0
6.0%
800.0
600.0
4.0%
400.0
2.0%
200.0
0.0
Semiconductor IP Market Size
Synopsys Share
100
CY04
964.0
7.9%
CY05
1,068.3
7.6%
CY06
1,267.3
7.3%
CY07
1,378.2
7.2%
CY08
1,464.1
7.2%
CY09
1,351.0
9.1%
CY10
1,695.0
11.3%
CY11
1,910.9
12.4%
0.0%
Synopsys Share
10.0%
1,400.0
Rank
1
2
3
4
5
6
7
8
9
10
Company
2010
ARM Hol di ngs
575.8
Synops ys
191.8
Ima gi na ti on Technol ogi91.5
es
MIPS Technol ogi es
85.3
Ceva
44.9
Si l i con Ima ge
38.5
Ra mbus
41.4
Tens i l i ca
31.5
Mentor Gra phi cs
27.3
AuthenTec
19.6
Synopsys 2012
101
2011
732.5
236.2
126.4
72.1
60.2
42.8
38.9
36.3
23.6
22.8
Growth
27.2%
23.2%
38.1%
-15.5%
34.1%
11.2%
-6.0%
15.2%
-13.8%
16.3%
2011 Share
38.3%
12.4%
6.6%
3.8%
3.2%
2.2%
2.0%
1.9%
1.2%
1.2%
70
100
60
% Design Reuse
80
50
60
40
40
30
IP Subsystems
20
20
IP Blocks
0
10
2005
2006
Synopsys 2012
102
2007
2008
2009
2010
2011
2012
2013
2014
% Design Reuse
Subsystems:
The Next Evolution in The IP Market
What is a Subsystem?
Complete
Solution: HW, SW,
Prototype
Synopsys 2012
103
Pre-integrated
and Verified
SoC Ready:
Seamlessly Dropin and Go
Thank You
Synopsys 2012
104