Intel® Software Development Emulator

Все приведенные ниже цены являются ценами на коммерческую лицензию для одного разработчика. Все указанные цены являются розничными, рекомендуемыми производителем, и могут быть изменены без предварительного уведомления. Цены указаны БЕЗ учета НДС и прочих применимых налогов и сборов.

Для получения информации о передаваемых лицензиях, лицензиях с возможностью использования только на определенных узлах и о прочих вариантах лицензирования обратитесь к торговому посредникуили к представителю корпорации Intel по адресу intel.software.sales@intel.com.
Для приобретения лицензии для научных, исследовательских и учебных заведений, выберите нужный продукт, и при оформлении покупки будет показана цена со скидкой. Для получения дополнительных сведений обо всех наших продуктах для учебных заведений посетите сайт центра предложений для учебных заведенийили обратитесь к представителю корпорации Intel по адресу academicdevelopersinfo@intel.com.
Продление поддержки — поддержка в течение одного года с даты истечения срока действия текущего соглашения о поддержке.
Существующим клиентам предоставляются специальные цены на обновление для продуктов Intel® Parallel Studio XE 2013, Intel® C++ Studio XE 2013 и Intel® Fortran Studio XE 2013.Подробнее о возможностях обновления.

Категория	Рекомендуемая розничная цена продукта (один пользователь)	Рекомендуемая розничная цена продления поддержки (один пользователь)	Варианты
Пакеты продуктов
Intel® Parallel Studio XE 2013 для Windows* Включает Intel® Composer XE 2013, Intel® VTune™ Amplifier XE 2013, Intel® Inspector XE 2013 и Intel® Advisor XE 2013	2299 долл. США	799 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Parallel Studio XE 2013 для Linux* Включает Intel® Composer XE 2013, Intel® VTune™ Amplifier XE 2013, Intel® Inspector XE 2013 и Intel® Advisor XE 2013	2299 долл. США	799 долл. США**	Найти торгового посредника › Все варианты ›
Intel® C++ Studio XE 2013 для Windows или для Linux Включает Intel® C++ Composer XE 2013, Intel® VTune™ Amplifier XE 2013, Intel® Inspector XE 2013 и Intel® Advisor XE 2013	1599 долл. США	599 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Visual Fortran Studio XE 2013 для Windows Включает Intel® Visual Fortran Composer XE 2013, Intel® VTune™ Amplifier XE 2013, Intel® Inspector XE 2013 и Intel® Advisor XE 2013	1899 долл. США	699 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Fortran Studio XE 2013 для Linux Включает Intel® Fortran Composer XE 2013, Intel® VTune™ Amplifier XE 2013, Intel® Inspector XE 2013 и Intel® Advisor XE 2013	1899 долл. США	699 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Parallel Studio Включает Intel® Parallel Advisor, Intel® Parallel Amplifier, Intel® Parallel Composer и Intel® Parallel Inspector		320 долл. США	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio XE 2013 для Windows Включает Intel® C++ Composer XE 2013, Intel® Visual Fortran Composer XE 2013, Intel® Trace Analyzer and Collector 8.1, Intel® MPI Library 4.1, Intel® MPI Benchmarks, Intel® Inspector XE 2013 иIntel® VTune™ Amplifier XE 2013, Intel® Advisor XE 2013	2949 долл. США	1049 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio XE 2013 для Linux Включает Intel® C++ Composer XE 2013, Intel® Fortran Composer XE 2013, Intel® Trace Analyzer and Collector, Intel® MPI Library 4.1, Intel® MPI Benchmarks, Intel® Inspector XE 2013 и Intel® VTune™ Amplifier XE 2013, Intel® Advisor XE 2013	2949 долл. США	1049 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio 2013 для Windows Включает Intel® Composer XE 2013, Intel® Trace Analyzer and Collector 8.1, Intel® MPI Library 4.1 и Intel® MPI Benchmarks	2049 долл. США	749 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio 2013 для Linux Включает Intel® Composer XE 2013, Intel® Trace Analyzer and Collector 8.1, Intel® MPI Library 4.1 и Intel® MPI Benchmarks	2049 долл. США	749 долл. США**	Найти торгового посредника › Все варианты ›
Компиляторы и библиотеки
Intel® Composer XE 2013 для Windows Включает Intel® C++ Composer XE 2013 и Intel® Visual Fortran Composer XE 2013	1199 долл. США	449 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Composer XE 2013 для Linux Включает Intel® C++ Composer XE 2013 и Intel® Fortran Composer XE 2013	1449 долл. США	499 долл. США**	Найти торгового посредника › Все варианты ›
Intel® C++ Composer XE 2013 для Windows, для Linux или для OS X* Включает Intel® C++ Compiler, Intel® Integrated Performance Primitives 7.1, Intel® Math Kernel Library 11.0 и Intel® Parallel Building Blocks	699 долл. США	249 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Visual Fortran Composer XE 2013 для Windows Включает Intel® Visual Fortran Compiler и Intel® Math Kernel Library 11.0	849 долл. США	299 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Fortran Composer XE 2013 для Linux Включает Intel® Fortran Compiler и Intel® Math Kernel Library 11.0	999 долл. США	349 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Fortran Composer XE 2013 для OS X Включает Intel® Fortran Compiler и Intel® Math Kernel Library 11.0	849 долл. США	299 долл. США**	Найти торгового посредника › Все варианты ›
Intel® C++ Compiler Professional Edition с поддержкой ОС реального времени QNX Neutrino* Включает Intel® C++ Compiler и Intel® Integrated Performance Primitives 7.1	599 долл. США	240 долл. США	Все варианты ›
Компилятор Intel® C для байтового кода EFI	995 долл. США	398 долл. США	Найти торгового посредника › Все варианты ›
Intel® Visual Fortran Composer XE 2013 для Windows с IMSL 6.0* Включает Intel® Visual Fortran Compiler, IMSL* Fortran Numerical Library и Intel® Math Kernel Library 11.0. Включает лицензию на 1 разработчика и 1 лицензию на развертывание, предназначенную для разработчика. При предоставлении приложений, содержащих код IMSL, пользователям, отличным от разработчиков, требуется лицензия на развертывание.	2049 долл. США	749 долл. США**	Найти торгового посредника › Все варианты ›
Лицензии IMSL* на выполнение (также называемые лицензиями IMSL* на развертывание) Вопросы и ответы о лицензировании IMSL*
Коммерческая однопользовательская лицензия на выполнение приложений с кодом IMSL на системах, содержащих не более 16 процессорных ядер	2049 долл. США	685 долл. США	Найти торгового посредника › Все варианты ›
Пакет из 10 коммерческих однопользовательских лицензий на выполнение приложений с кодом IMSL на системах, содержащих не более 16 процессорных ядер	9709 долл. США	1826 долл. США	Найти торгового посредника › Все варианты ›
Коммерческая многопользовательская лицензия на выполнение приложений с кодом IMSL на системах, содержащих не более 64 процессорных ядер	13 592 долл. США	2557 долл. США	Найти торгового посредника › Все варианты ›
Инструменты для процессора Intel® Atom™
Набор инструментов Intel® для разработки ПО для встраиваемых систем на базе процессора Intel® Atom™ Включает Intel® C++ Compiler, Intel® Application Debugger, Intel® JTAG Debugger, Intel® Integrated Performance Primitives 7.1 и Intel® VTune™ Performance Analyzer	1999 долл. США	799 долл. США	Найти торгового посредника › Все варианты ›
Библиотеки для повышения производительности
Intel® Integrated Performance Primitives 7.1 для Windows или для Linux	199 долл. США	69 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Math Kernel Library 11.0 для Windows или для Linux	499 долл. США	179 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Threading Building Blocks 4.1 для Windows, для Linux или для OS X	499 долл. США	179 долл. США**	Найти торгового посредника › Все варианты ›
Анализаторы производительности приложений
Intel® VTune™ Amplifier XE 2013 для Windows или для Linux	899 долл. США	349 долл. США**	Найти торгового посредника › Все варианты ›
Средства проверки работы с памятью и потоками
Intel® Inspector XE 2013 для Windows или для Linux	899 долл. США	349 долл. США**	Найти торгового посредника › Все варианты ›
Средства для работы с кластерами
Intel® Cluster Studio XE 2013 для Windows Включает Intel® C++ Composer XE 2013, Intel® Fortran Composer XE 2013, Intel® Trace Analyzer and Collector, Intel® MPI Library, Intel® MPI Benchmarks, Intel® Inspector XE 2013 и Intel® VTune™ Amplifier XE 2013, Intel® Advisor XE 2013	2949 долл. США	1049 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio XE 2013 для Linux Включает Intel® C++ Composer XE 2013, Intel® Fortran Composer XE 2013, Intel® Trace Analyzer and Collector, Intel® MPI Library 4.1, Intel® MPI Benchmarks, Intel® Inspector XE 2013 и Intel® VTune™ Amplifier XE 2013, Intel® Advisor XE 2013	2949 долл. США	1049 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio 2013 для Windows Включает Intel® Composer XE 2013, Intel® Trace Analyzer and Collector 8.1, Intel® MPI Library 4.1 и Intel® MPI Benchmarks	2049 долл. США	749 долл. США**	Найти торгового посредника › Все варианты ›
Intel® Cluster Studio 2013 для Linux Включает Intel® Composer XE 2013, Intel® Trace Analyzer and Collector 8.1, Intel® MPI Library 4.1 и Intel® MPI Benchmarks	2049 долл. США	749 долл. США**	Найти торгового посредника › Все варианты ›
Intel® MPI Library 4.1 для Windows или для Linux	499 долл. США	179 долл. США**	Найти торгового посредника › Все варианты ›
Средства моделирования и имитации работы систем
CoFluent Studio*	Н/д	Н/д	Все варианты ›
CoFluent Reader	Н/д	Н/д	Все варианты ›
Средства для профилирования и отладки графических приложений
Средства для профилирования и отладки графический приложений Intel®	Н/д	Н/д	Все варианты ›

**Вы можете обновить лицензии по минимальной цене только в том случае, если сделаете это до истечения срока предыдущей подписки. Для получения дополнительной информации о продлении лицензий нажмите здесь.

Starting with version 7.12.0, Intel® SDE has Intel® TSX-related instruction and memory access logging features which can be useful for debugging Intel® TSX's capacity aborts. With the log data from the Intel SDE you can diagnose cache set population to determine if there is non-uniform cache set usage causing capacity overflows. A refined log data may be used to further diagnose the source of the aborts. Since the log file may be huge to navigate and diagnose without refining, here, a simple Python script is presented to help analyze Intel TSX-related log files to help root-cause sources of capacity aborts.

TxSDELogAnalyzer.py is a simple program which focuses on capacity-aborted transactions. It uses the SDE's log data which can be collected using the set of parameters as below.

>$ sde -tsx -hle_enabled 1 -rtm_mode full -tsx_debug_log 3 -tsx_log_inst 1 -tsx_log_file 1

We will describe a few features this simple script has and why it might be important to use them to aid in debugging transaction aborts due to capacity overflows. In a typical scenario, you may do the following to find the source of those aborts.

(i) Look at the TSX read/write set sizes and cache set usage distribution

(ii) Take a random sample of aborted transactions and closely examine them

(ii) In conjunction with case (ii) you may need to look at various log data related to capacity aborted transactions, as well as committed transactions to compare how they differ, hoping that you may spot a difference that could lead to the source of the aborts.

Cache set usage distribution

You might need to see how the cache sets are populated in an aborted transaction to see if there are outliers or a non-uniformity which could be causing premature overflows. Then you can locate the data structure in this transaction and enhance it such that it uniformly uses the available cache sets -- thus avoiding capacity overflows (see CIA allocator as a possible mitigation). Under normal circumstances you may need to compare cache access distributions between committed and aborted transactions to see how they differ by plotting a histogram or any suitable graph with both cases.

TxSDELogAnalyzer.py calculates the average cache population among all transactions of the same type from the input log file. It outputs the final result in CSV format, making it easy to plot a graph in an Excel sheet for example. For average distribution of cache lines access the following command structure is used.

>$ TxSDELogAnalyzer.py -D <type_num> [-o <output_file>] <input_log_file>

Where type_num specifies the type of transactions data to generate the distribution from. type_num can either be 2 for capacity-aborted transactions or 8 for committed transactions. This command calculates unique cache set accesses for each transaction data of type type_num and finally calculates the average of these cache access distributions.

A sample output from this command from capacity aborted transactions log data is given below.

Average cache set population for capacity aborts

Cache set #, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63
Avg. population, 5.00, 1.00, 0.00, 0.00, 1.00, 0.00, 1.00, 2.00, 3.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 1.00, 2.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 2.00, 2.00, 0.00, 2.00, 1.00, 0.00, 0.00, 2.00, 1.00, 0.00, 1.00, 2.00, 0.00, 1.00, 3.00, 0.00, 2.00, 2.00, 2.00, 2.00, 3.00, 2.00, 3.00, 2.00, 4.00, 1.00, 1.00, 0.00, 1.00, 1.00, 3.00

Fig.1: A graphical representation of the sample output. It shows how many cache lines are used in each of 64 cache sets of a data cache.

To easily interpret the graph in Fig.1 we need to understand the Intel® SDE's "-tsx_cache_set_size " parameter whose default value is 8 but 4 was used to generate the sample output above.

Intel's L1 data cache is 8-way associative that means every set of the data cache has 8 ways. The two logical cores share the L1 cache dynamically (Intel(r) Hyper-Threading technology is enabled). To approximate this behavior we can statically limit the number of ways per thread to 4 using Intel SDE (but this assumes that the usage of ways is equal).

If "-tsx_cache_set_size " is set to the default value which is 8, it means hyper-threading is not "emulated" as all the ways in a cache set are used by the (only) single hardware thread in a core. To emulate Hyper-Threading, at least at data cache level, we set "-tsx_cache_set_size " to 4. Thus only a half of the ways in each data cache set are used up. With the Hyper-Threading emulation a maximum of 4 ways per cache set should be uniquely modified in TSX-transaction execution. Otherwise a capacity overflow occurs aborting the transaction. From Figure 1, we see that at cache set #0 there is an overflow (5 unique way accesses instead of max 4) and this could lead to a capacity abort.

Transactions data filtering

Intel® SDE already has knobs to start/stop logging thus limiting the log output size. This is normally done by adding the following command options to the SDE command you use to run your test

-control start:interactive:bcast,stop:interactive:bcast -interactive_file tmp

However, you may sometimes need log data with longer execution coverage; hence a huge log file will be produced. Limiting the log output size may therefore not be enough and you many need filtering options.

Moreover, your log data may contain data for various abort reasons and the commits, too. Even the log information of thread operations outside a transaction executions may be included in the SDE log. Therefore, it may be hard to analyze a huge log file with a mixture of information. Moreover, you may be interested in only logs of certain types -- e.g; capacity aborted transactions -- only. TxSDELogAnalyzer.py helps you filter out the log data and presents you only the data for a specific abort reason, or a commit. The command format to do exactly this is shown below.

>$ TxSDELogAnalyzer.py -t <type_num> [-o <output_file>] <input_log_file>

type_num can be 1 (all log data of both committed and aborted transactions), 2 (for capacity aborted transactions only) or 8 (for committed transactions only),

# The command below outputs only capacity-abort related transaction logs>$ TxSDELogAnalyzer.py -t 2 sde-tsx-out.txt

# The following command line writes only commit-related transaction logs to a file named log_data_for_commits_data.txt
>$ TxSDELogAnalyzer.py -t 8 -o log_data_for_commits_data.txt sde-tsx-out.txt

Sampling transactions

Since you cannot analyse millions of transactions to identify code path patterns in a limited time you may want to extract log data for a few typical transactions out of millions. TxSDELogAnalyzer.py does exactly that. It randomly selects num random transactions. It supplements these data with cache set accesses giving a full overview of the cache set population at each operation within a given transaction. This features works in conjunction with "Transactions data filtering" described above.

An example of commands for this feature is as follows.

>$ TxSDELogAnalyzer.py -t 8 -r <num> <input_log_file> # for commits, num random commits

>$ TxSDELogAnalyzer.py -t 2 -r <num> <input_log_file> # for capacity aborts, num random capacity aborts

A sample output from these commands is as below:

Transaction #0
                  OPERATION                      ; CACHE LINE ADDRESS; CACHE SET #; NEW CACHE LINE; CACHE SET POPULATION;    FUNCTION              ;  MANGLED FUNCTION         ; LIBRARY                 ; SOURCE
[ 1 1] @0x400b66 write access to 0x7f7bc2a39e7c:4;     0x7f7bc2a39e40;          57;           TRUE;                    1; tm_begin()               ; _Z8tm_beginv              ; /ABank/aBank:0x000000bee; /ABank/hle_lock.h 45
[ 1 1] @0x400b69 read access to 0x7f7bc2a39e7c:4 ;     0x7f7bc2a39e40;          57;          FALSE;                    1; tm_begin()               ; _Z8tm_beginv              ; /ABank/aBank:0x000000bee; /ABank/hle_lock.h 45
[ 1 1] @0x400c29 read access to 0x7f7bc2a39eac:4 ;     0x7f7bc2a39e80;          58;          FALSE;                    1; paySalaries(void*)       ; _Z11paySalariesPv         ; /ABank/aBank:0x000000c29; /ABank/SalaryBatch.cpp 78
[ 1 1] @0x400a36 write access to 0x7f7bc2a39e80:8;     0x7f7bc2a39e80;          58;          FALSE;                    1; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a36; /ABank/SalaryBatch.cpp 62
[ 1 1] @0x4008ca read access to 0x7f7bc2a39e1c:4 ;     0x7f7bc2a39e00;          56;          FALSE;                    1; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
. . .
[ 1 1] @0x4009b0 write access to 0x605dac:4      ;           0x605d80;          54;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x4009b7 read access to 0x605db0:4       ;           0x605d80;          54;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a4b read access to 0x7f7bc2a39e7c:4 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400837 write access to 0x7f7bc2a39e04:4;     0x7f7bc2a39e00;          56;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a57 read access to 0x7f7bc2a39e7c:4 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400a4e write access to 0x7f7bc2a39e60:8;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x40098f write access to 0x7f7bc2a39e48:8;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400993 read access to 0x7f7bc2a39e28:8 ;     0x7f7bc2a39e00;          56;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400997 read access to 0x605dd0:4       ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a35 read access to 0x7f7bc2a39e60:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a53 write access to 0x7f7bc2a39e7c:4;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400a57 read access to 0x7f7bc2a39e7c:4 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400a4b read access to 0x7f7bc2a39e7c:4 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400980 read access to 0x7f7bc2a39e24:4 ;     0x7f7bc2a39e00;          56;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x40098f write access to 0x7f7bc2a39e48:8;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400993 read access to 0x7f7bc2a39e28:8 ;     0x7f7bc2a39e00;          56;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400997 read access to 0x605dd8:4       ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x40099c read access to 0x7f7bc2a39e28:8 ;     0x7f7bc2a39e00;          56;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a22 read access to 0x7f7bc2a39e48:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a26 read access to 0x605dfc:4       ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a2c read access to 0x7f7bc2a39e48:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a30 write access to 0x605dfc:4      ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a33 read access to 0x7f7bc2a39e50:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a34 read access to 0x7f7bc2a39e58:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a35 read access to 0x7f7bc2a39e60:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a53 write access to 0x7f7bc2a39e7c:4;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x400a4e write access to 0x7f7bc2a39e60:8;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::paySalary(int)   ; _ZN6myBank9paySalaryEi    ; /ABank/aBank:0x000000a53; /ABank/SalaryBatch.cpp 65
[ 1 1] @0x40082c write access to 0x7f7bc2a39e58:8;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a06 read access to 0x605dfc:4       ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a0c read access to 0x7f7bc2a39e40:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a10 write access to 0x605dfc:4      ;           0x605dc0;          55;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a13 read access to 0x7f7bc2a39e48:8 ;     0x7f7bc2a39e40;          57;          FALSE;                    4; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a17 read access to 0x605e00:4       ;           0x605e00;          56;           TRUE;                    5; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] @0x400a17 read access to 0x605e00:4       ;           0x605e00;          56;           TRUE;                    5; myBank::addToBalance(int); _ZN6myBank12addToBalanceEi; /ABank/aBank:0x00000082c; /ABank/SalaryBatch.cpp 30
[ 1 1] self abort transaction(1) abort reason 9
                                                 ;  TOTAL FOOTPRINT: 250 cache lines;
                                                 ;  TOTAL WRITE SET: 249 cache lines;

Intel Transactional Synchronization Extensions (Intel TSX)

Intel SDE

Restricted Transactional Memory (RTM)

Microsoft Windows* (XP, Vista, 7)

Microsoft Windows* 8.x

Intel® Software Development Emulator

Intel® Transactional Synchronization Extensions

Debugging

Development Tools

Intel® Core™ Processors

License Agreement:

Intel Sample Source Code License Agreement

Protected Attachments:

Attachment	Size
Download TxSDELogAnalyzer.zip	5.05 KB

URL

Improving performance

Multithread development

Co-authors:

Roman Dementiev (Intel)

Thread 1	Thread 2
start critical section
read sums[mygroup]
	do transaction that updates sums[mygroup]
write sums[mygroup]

Thread 1	Thread 2
read_and_lock( vid_values[i] )
	do transaction that swaps vid_values[i] and vid_values[k] and makes vid_values[i] non-zero
read_and_lock( vid_values[j] )
write_and_release( vid_values[j] )
Wait for vid_values[i] to become 0

Example of some Intel® Architectures that do not support masking feature
Processor Name	Code Name
2nd gen Intel® Core™ processor family	Sandy Bridge
3rd gen Intel® Core™ processor family	Ivy Bridge
4th gen Intel® Core™ processor family	Haswell
*Example of some Intel® Architectures that do support masking feature*
Processor Name	Code Name
Intel® Xeon Phi™ coprocessor	Knights Landing

Column	Header	Description
First	mask	Classifies the masked instructions vs. unmasked instructions
Second	cat	Classifies categories of the instruction (e.g. memory instructions (data transfer), sparse (gather/scatter) and computational (mask)
Third	vec-length	Specifies the vector register width
Fourth	#elements	Specifies maximum number of elements possible in the vector register (with vec-length in third column) and with element size (specified in fifth column)
Fifth	element_s	Specifies size of element (or data type) in bits (e.g. 64b = 64 bits = 8 byte)
Sixth	element_t	Classifies based on element type (e.g. fp - floating point vs. int – integer)
Seventh	icount	Total Instruction count of each category/type
Eighth	comp_count	Corresponding computation count for the executed instructions of each category/type
Ninth	%max-comp	Shows % vector lane utilization for each category/type

Workloads	Reference FLOP (from NERSC)	MPI ranks	FLOP Count (using Intel® SDE)	Error Margin
MiniFE	5.05435E+12	144	5.18039E+12	1.03
miniGhost	6.55500E+12	96	6.85624E+12	1.05
AMG	1.30418E+12	96	1.43311E+12	1.10
UMT	1.30211E+13	96	1.38806E+13	1.07

不支持屏蔽功能的英特尔® 架构示例
处理器名称	代号名称
第二代英特尔® 酷睿™ 处理器系列	Sandy Bridge
第三代英特尔® 酷睿™ 处理器系列	Ivy Bridge
第四代英特尔® 酷睿™ 处理器系列	Haswell
*支持屏蔽功能的英特尔® 架构示例*
处理器名称	代号名称
英特尔® 至强融核™ 协处理器	Knights Landing

列	标题	描述
一	mask	分类屏蔽指令和非屏蔽指令
二	cat	区分指令类型（比如，内存指令（数据传输）、稀疏指令（聚集/分散）或计算指令（掩码））
三	vec-length	注明矢量寄存器宽度
四	#elements	注明矢量寄存器（如第三列 vec-length 所示）中的最大元素数，以及元素大小（见第五列）
五	element_s	注明元素（或数据类型）的大小（单位：位，比如 64b = 64 位 = 8 字节）
六	element_t	根据元素类型进行分类（比如 fp - floating point 和 int – integer）
七	icount	各类型的总指令数
八	comp_count	各类型执行指令的相应计算数
九	%max-comp	显示各类型的矢量通道利用率

Категория

Рекомендуемая розничная цена продукта(один пользователь)

Рекомендуемая розничная цена продления поддержки(один пользователь)

Варианты

Пакеты продуктов

Компиляторы и библиотеки

Лицензии IMSL* на выполнение (также называемые лицензиями IMSL* на развертывание)

Инструменты для процессора Intel® Atom™

Библиотеки для повышения производительности

Анализаторы производительности приложений

Средства проверки работы с памятью и потоками

Средства для работы с кластерами

Средства моделирования и имитации работы систем

Средства для профилирования и отладки графических приложений

The Incidents

Example 1.

Example 2.

Example 3.

Analysis

Example 1.

Example 2.

Example 3.

Possible Fixes

Example 1.

Example 2.

Example 3.

Conclusions

Icon Image:

Theme Zone:

Cache set usage distribution

Transactions data filtering

Sampling transactions

License Agreement:

Protected Attachments:

Co-authors:

Purpose

Methodology

Instructions to Count Unmasked FLOP

Instructions to Count Masked FLOP

Validation

Footnotes:

目的

方法

用于计算非屏蔽 FLOP 的指令

用于计算屏蔽 FLOP 的指令

验证

脚注：

Рекомендуемая розничная цена продукта
(один пользователь)

Рекомендуемая розничная цена продления поддержки
(один пользователь)