20002021

Research activity per year

If you made any changes in Pure these will be visible here soon.
Filter
Conference contribution

Search results

  • 2020

    Efficient and scalable cross-ISA virtualization of hardware transactional memory

    Wang, W., Yew, P. C., Zhai, A. & McCamant, S., Feb 22 2020, CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization. Mars, J., Tang, L., Xue, J. & Wu, P. (eds.). Association for Computing Machinery, Inc, p. 107-120 14 p. (CGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    7 Scopus citations
  • First Time Miss: Low Overhead Mitigation for Shared Memory Cache Side Channels

    Ramkrishnan, K., McCamant, S., Yew, P. C. & Zhai, A., Aug 17 2020, Proceedings of the 49th International Conference on Parallel Processing, ICPP 2020. Association for Computing Machinery, 3404434. (ACM International Conference Proceeding Series).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Scopus citations
  • In-Network Memory Access Ordering for Heterogeneous Multicore Systems

    Yin, J. & Zhai, A., Sep 24 2020, 14th IEEE/ACM International Symposium on Networks-on-Chip, NOCS 2020. Institute of Electrical and Electronics Engineers Inc., 9241583. (14th IEEE/ACM International Symposium on Networks-on-Chip, NOCS 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2019

    Unleashing the power of learning: An enhanced learning-based approach for dynamic binary translation

    Song, C., Wang, W., Yew, P. C., Zhai, A. & Zhang, W., 2019, Proceedings of the 2019 USENIX Annual Technical Conference, USENIX ATC 2019. USENIX Association, p. 77-89 13 p. (Proceedings of the 2019 USENIX Annual Technical Conference, USENIX ATC 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    8 Scopus citations
  • 2018

    Enhancing cross-ISA DBT through automatically learned translation rules

    Wang, W., McCamant, S. A., Zhai, A. B. & Yew, P-C., Mar 19 2018, ASPLOS 2018 - 23rd International Conference on Architectural Support for Programming Languages and Operating Systems. 2 ed. Association for Computing Machinery, Vol. 53. p. 84-97 14 p. (ACM SIGPLAN Notices).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    10 Scopus citations
  • 2017

    Enabling cross-isa offloading for COTS binaries

    Wang, W., Yew, P. C., Zhai, A., McCamant, S., Wu, Y. & Bobba, J., Jun 16 2017, MobiSys 2017 - Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services. Association for Computing Machinery, Inc, p. 319-331 13 p. (MobiSys 2017 - Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access
    20 Scopus citations
  • 2016

    A general persistent code caching framework for dynamic binary translation (DBT)

    Wang, W., Yew, P. C., Zhai, A. & McCamant, S., 2016, Proceedings of the 2016 USENIX Annual Technical Conference, USENIX ATC 2016. USENIX Association, p. 591-603 13 p. (Proceedings of the 2016 USENIX Annual Technical Conference, USENIX ATC 2016).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    18 Scopus citations
  • 2014

    Energy-efficient time-division multiplexed hybrid-switched noc for heterogeneous multicore systems

    Yin, J., Zhou, P., Sapatnekar, S. S. & Zhai, A., 2014, Proceedings - IEEE 28th International Parallel and Distributed Processing Symposium, IPDPS 2014. IEEE Computer Society, p. 293-303 11 p. 6877264. (Proceedings of the International Parallel and Distributed Processing Symposium, IPDPS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    29 Scopus citations
  • Multi-stage coordinated prefetching for present-day processors

    Mehta, S., Fang, Z., Zhai, A. & Yew, P. C., 2014, ICS 2014 - Proceedings of the 28th ACM International Conference on Supercomputing. Association for Computing Machinery, p. 73-82 10 p. (Proceedings of the International Conference on Supercomputing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    19 Scopus citations
  • 2013

    Accelerating data race detection utilizing on-chip data-parallel cores

    Mekkat, V., Holey, A. & Zhai, A., 2013, Runtime Verification - 4th International Conference, RV 2013, Proceedings. p. 201-218 18 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8174 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    5 Scopus citations
  • HAccRG: Hardware-Accelerated data race detection in GPUs

    Holey, A., Mekkat, V. & Zhai, A. B., 2013, Proceedings: International Conference on Parallel Processing - The 42nd Annual Conference, ICPP 2013. Institute of Electrical and Electronics Engineers Inc., p. 60-69 10 p. 6687339. (Proceedings of the International Conference on Parallel Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    12 Scopus citations
  • Managing shared last-level cache in a heterogeneous multicore processor

    Mekkat, V., Holey, A., Yew, P. C. & Zhai, A., Nov 18 2013, PACT 2013 - Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques. p. 225-234 10 p. 6618819. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    67 Scopus citations
  • Triggered instructions: A control paradigm for spatially-programmed architectures

    Parashar, A., Pellauer, M., Adler, M., Ahsan, B., Crago, N., Lustig, D., Pavlov, V., Zhai, A., Gambhir, M., Jaleel, A., Allmon, R., Rayess, R., Maresh, S. & Emer, J., 2013, ISCA 2013 - 40th Annual International Symposium on Computer Architecture, Conference Proceedings. p. 142-153 12 p. (Proceedings - International Symposium on Computer Architecture).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    64 Scopus citations
  • 2012

    Energy-efficient non-minimal path on-chip interconnection network for heterogeneous systems

    Yin, J., Zhou, P., Holey, A., Sapatnekar, S. S. & Zhai, A., 2012, ISLPED'12 - Proceedings of the International Symposium on Low Power Electronics and Design. p. 57-62 6 p. (Proceedings of the International Symposium on Low Power Electronics and Design).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    17 Scopus citations
  • 2011

    Enabling improved power management in multicore processors through clustered DVFS

    Kolpe, T., Zhai, A. & Sapatnekar, S. S., May 31 2011, Proceedings - Design, Automation and Test in Europe Conference and Exhibition, DATE 2011. p. 293-298 6 p. 5763052. (Proceedings -Design, Automation and Test in Europe, DATE).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    62 Scopus citations
  • NoC frequency scaling with flexible-pipeline routers

    Zhou, P., Yin, J., Zhai, A. B. & Sapatnekar, S. S., Sep 19 2011, IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2011. p. 403-408 6 p. 5993674

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    13 Scopus citations
  • 2010

    Energy efficient speculative threads: Dynamic thread allocation in same-ISA heterogeneous multicore systems

    Luo, Y., Packirisamy, V., Hsu, W. C. & Zhai, A., 2010, PACT'10 - Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques. Institute of Electrical and Electronics Engineers Inc., p. 453-464 12 p. (Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    17 Scopus citations
  • Improving the performance of program monitors with compiler support in multi-core environment

    He, G. & Zhai, A., Jul 1 2010, Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010. 5470405. (Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    3 Scopus citations
  • Performance characterization of data mining benchmarks

    Mekkat, V., Natarajan, R., Hsu, W. C. & Zhai, A., May 18 2010, INTERACT-14 - Proceedings of the 2010 Workshop on Interaction between Compilers and Computer Architecture. 1739040. (Proceedings - Annual Workshop on Interaction between Compilers and Computer Architectures, INTERACT).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    4 Scopus citations
  • 2009

    Dynamic performance tuning for speculative threads

    Luo, Y., Packirisamy, V., Hsu, W. C., Zhai, A., Mungre, N. & Tarkas, A., 2009, ISCA 2009 - 36th Annual International Symposium on Computer Architecture, Conference Proceedings. p. 462-473 12 p. (Proceedings - International Symposium on Computer Architecture).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    26 Scopus citations
  • Exploiting TLS parallelism at multiple loop-nest levels

    Packirisamy, V. & Zhai, A. B., Dec 1 2009, ICPADS '09 - 15th International Conference on Parallel and Distributed Systems. p. 205-212 8 p. 5395253. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Scopus citations
  • Exploring speculative parallelism in SPEC2006

    Packirisamy, V., Zhai, A., Hsu, W. C., Yew, P. C. & Ngai, T. F., 2009, ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software. p. 77-88 12 p. 4919640. (ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    34 Scopus citations
  • Hardware supported flexible monitoring: Early results

    Zhai, A. B., He, G. & Heimdahl, M., 2009, Runtime Verification - 9th International Workshop, RV 2009, Selected Papers. p. 168-183 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5779 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 2008

    Compiler optimizations for parallelizing general-purpose applications under Thread-Level Speculation

    Zhai, A. B., Wang, S., Yew, P-C. & He, G., Dec 1 2008, PPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. p. 271-272 2 p. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    5 Scopus citations
  • Efficiency of thread-level speculation in smt and cmp architectures - performance, power and thermal perspective

    Packirisamy, V., Luo, Y., Hung, W. L., Zhai, A., Yew, P. C. & Ngai, T. F., 2008, 26th IEEE International Conference on Computer Design 2008, ICCD. p. 286-293 8 p. 4751875. (26th IEEE International Conference on Computer Design 2008, ICCD).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    9 Scopus citations
  • 2007

    Exploiting speculative thread-level parallelism in data compression applications

    Wang, S., Zhai, A. B. & Yew, P-C., 2007, Languages and Compilers for Parallel Computing - 19th International Workshop, LCPC 2006, Revised Papers. Springer Verlag, p. 126-140 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4382 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    2 Scopus citations
  • 2006

    A study of the performance potential for dynamic instruction hints selection

    Fu, R., Lu, J., Zhai, A. & Hsu, W. C., 2006, Advances in Computer Systems Architecture - 11th Asia-Pacific Conference, ACSAC 2006, Proceedings. Springer Verlag, p. 67-80 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4186 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Scopus citations
  • Issues and support for dynamic register allocation

    Das, A., Fu, R., Zhai, A. & Hsu, W. C., 2006, Advances in Computer Systems Architecture - 11th Asia-Pacific Conference, ACSAC 2006, Proceedings. Springer Verlag, p. 351-358 8 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4186 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Scopus citations
  • Loop selection for thread-level speculation

    Wang, S., Dai, X., Yellajyosula, K. S., Zhai, A. & Yew, P. C., Dec 1 2006, Languages and Compilers for Parallel Computing - 18th International Workshop, LCPC 2005, Revised Selected Papers. p. 289-303 15 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4339 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    24 Scopus citations
  • Supporting speculative multithreading on simultaneous multithreaded processors

    Packirisamy, V., Wang, S., Zhai, A., Hsu, W. C. & Yew, P. C., 2006, High Performance Computing - HiPC 2006 - 13th International Conference Proceedings. p. 148-158 11 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4297 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    10 Scopus citations
  • 2005

    A general compiler framework for speculative optimizations using data speculative code motion

    Dai, X., Zhai, A., Hsu, W. C. & Yew, P. C., Dec 1 2005, Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005. p. 280-290 11 p. 1402095. (Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005; vol. 2005).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    16 Scopus citations
  • 2004

    Compiler optimization of memory-resident value communication between speculative threads

    Zhai, A., Colohan, C. B., Steffan, J. G. & Mowry, T. C., 2004, International Symposium on Code Generation and Optimization, CGO 2004. p. 39-50 12 p. (International Symposium on Code Generation and Optimization, CGO).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    34 Scopus citations
  • 2002

    Compiler optimization of scalar value communication between speculative threads

    Zhai, A., Colohan, C. B., Steffan, J. G. & Mowry, T. C., 2002, Operating Systems Review (ACM). p. 171-183 13 p. (International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    64 Scopus citations
  • Improving value communication for thread-level speculation

    Steffan, J. G., Colohan, C. B., Zhai, A. & Mowry, T. C., 2002, Proceedings - 8th International Symposium on High-Performance Computer Architecture, HPCA 2002. IEEE Computer Society, p. 65-75 11 p. 995699. (Proceedings - International Symposium on High-Performance Computer Architecture; vol. 2002-January).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    76 Scopus citations