Commit Graph

  • f25919dc06 thesis: finished re-read and made some improvements main Daniel 2025-06-10 11:34:58 +02:00
  • 3efd8a6c26 thesis: aded abstract and kurzfassung; re-read conclusion and evaluation to iron out mistakes etc. Daniel 2025-06-09 14:11:58 +02:00
  • b494803611 evaluation: finished chapter; re-read for errors yet to be done Daniel 2025-06-07 13:58:49 +02:00
  • 4132a4946f evaluation: started and almost finished transpiler section Daniel 2025-06-06 13:16:40 +02:00
  • 275162d38d evaluation: added diagrams for comparison of all three implementations Daniel 2025-05-31 15:49:02 +02:00
  • bccf830d6f evaluation: continued performance tuning interpreter. conducted some additional tests Daniel 2025-05-29 18:20:33 +02:00
  • 381a4819c9 evaluation: continued with interpreter benchmarking and performance tuning Daniel 2025-05-29 15:13:17 +02:00
  • 99a222341d benchmarking: finished taking evaluation results; evaluation: continued writing Daniel 2025-05-25 13:27:18 +02:00
  • 14b2e23d9a evaluation: found thath benchmark 2 can't be executed by any implementation due to RAM constraints Daniel 2025-05-24 16:58:35 +02:00
  • 5f44e4d122 benchmarking: added benchmark4 interpreter results; extended evaluation section Daniel 2025-05-24 13:17:15 +02:00
  • 2bbdef6837 benchmarking: added results for benchmark 4; extended thesis to include the fourth benchmark Daniel 2025-05-24 10:55:44 +02:00
  • 7c97213d13 benchmarking: added CPU results for benchmark 3 Daniel 2025-05-24 09:10:08 +02:00
  • 1bee747a8e benchmarking: added CPU results for benchmark 2 Daniel 2025-05-23 15:55:00 +02:00
  • 4a51ebcde9 benchmarking: added final results for benchmark 1 for transpiler Daniel 2025-05-23 13:57:52 +02:00
  • 2cab6e0698 evaluation: started documenting results of evaluations Daniel 2025-05-22 13:02:11 +02:00
  • 5bada5ffcb Merge branch 'main' of https://git.elosia.mooo.com/Daniel/master-thesis Daniel 2025-05-22 09:35:54 +02:00
  • ad04f4715b benchmarking: added final results for interpreter in benchmark 1 Daniel 2025-05-22 09:35:30 +02:00
  • 4ed0ce6523 evaluation: almost finished benchmark environment section Daniel 2025-05-21 12:47:24 +02:00
  • c2334d7f05 benchmarking: added results for transpiler Daniel 2025-05-20 18:55:45 +02:00
  • 39302ffe9e benchmarking: added results for transpiler 2-interpreter-blocksize-121_transpiler-transpilation-only-once-new Daniel 2025-05-20 18:55:15 +02:00
  • 908cc55b46 evaluation: added evaluation information for second transpiler run Daniel 2025-05-20 18:52:26 +02:00
  • 2701f1aa38 benchmarking: minor improvements to interpreter Daniel 2025-05-20 14:42:52 +02:00
  • 250deb334c benchmarking: tuned interpreter blocksize Daniel 2025-05-20 09:05:35 +02:00
  • a9ffd5da63 benchmarking: fixed bugs introduced by modification of transpiler Daniel 2025-05-19 12:29:05 +02:00
  • e29199d865 benchmarking: moved compilation of kernel to evaluate function, as it required too much memory Daniel 2025-05-19 11:58:24 +02:00
  • f33551e25f benchmarking: updated transpiler to drastically reduce the number of transpilations at the expense of memory usage Daniel 2025-05-19 11:39:49 +02:00
  • 33e7edd4c8 benchmarking: added results for first optimisation for transpiler Daniel 2025-05-19 09:14:16 +02:00
  • 6bcc9000b1 benchmarking: added initial results for transpiler 0-initial-implementation-new Daniel 2025-05-19 09:12:59 +02:00
  • 09ca7ac6c6 benchmarking: added initial results for transpiler Daniel 2025-05-19 09:12:06 +02:00
  • 7c3616a264 benchmarking: added results for transpiler 1-improved-data-transfer-new Daniel 2025-05-19 09:06:36 +02:00
  • 93516c54d4 evaluation: started describing benchmark environment Daniel 2025-05-18 14:17:47 +02:00
  • e03f20006f benchmarking: added results for interpreter after first performance improvement Daniel 2025-05-18 10:11:33 +02:00
  • a5518dd63e benchmarking: moved frontend calls and sending postfixExprs+vars outside to drastically reduce amount of calculations Daniel 2025-05-17 18:32:04 +02:00
  • 88ee8d20bd benchmarking: added interpreter results Daniel 2025-05-17 16:07:03 +02:00
  • 47dcc29b33 benchmarking: added CPU benchmark results Daniel 2025-05-16 14:29:47 +02:00
  • d7e18f183d benchmarking: updated benchmarking suite and prepared for taking the benchmarks Daniel 2025-05-15 16:25:32 +02:00
  • 3d80ae95e4 benchmarking: further tests done. Seems like transpiler takes ages, need to investigate further Daniel 2025-05-11 16:54:19 +02:00
  • 5b31fbb270 benchmarking: changes to not fill up all of the RAM Daniel 2025-05-10 15:10:26 +02:00
  • 2ba1fef5ba benchmarking: removed caches to get initial performance measurement. still some problems Daniel 2025-05-10 13:11:27 +02:00
  • 6d3c3164cf expression processing: added support for inverse/reciprocal Daniel 2025-05-10 09:36:02 +02:00
  • 7121329a17 benchmarking: started tuning benchmarking results. found some errors that need fixing Daniel 2025-05-09 19:19:53 +02:00
  • 327e4ebf1b benchmarking: prepared tests for using actual data Daniel 2025-05-09 13:58:10 +02:00
  • 2c8a9cd2d8 added support for variables and parameters as array. also improved conversion of variables and parameters into Expressionelement Daniel 2025-05-09 11:04:10 +02:00
  • aaa3f2c7c0 interpreter: reverted previous change to make interpreter work correctly again Daniel 2025-05-09 09:45:43 +02:00
  • ae03ebdf1d implementation: finished re-reading chapter and fixed suboptimal wording Daniel 2025-05-08 16:32:05 +02:00
  • 6b7205e026 implementation: Finished chapter. checking for errors and textual improvements not finished Daniel 2025-05-08 12:34:08 +02:00
  • b69a3efe96 implementation: continued writing transpiler section Daniel 2025-05-04 13:54:05 +02:00
  • 18d89e27ca implementation: started transpiler section Daniel 2025-05-03 12:25:34 +02:00
  • e8e457eae9 implementation: finished interpreter section and started transpiler section Daniel 2025-05-02 12:35:03 +02:00
  • c4187a131e implementation: finished interpreter section Daniel 2025-05-01 14:36:14 +02:00
  • 101b13e7e7 implementation: started writing interpreter section; finished CPU-side part; also found error with interpreter Daniel 2025-04-27 13:17:35 +02:00
  • e571fa5bd6 implementation: finished pre-processing section; updated code Daniel 2025-04-26 13:46:23 +02:00
  • ad2eab2e0a implementation: continued pre-processing section; added cache to ExpressionProcessing.jl to improve performance Daniel 2025-04-25 16:22:57 +02:00
  • b40a06af3f implementation: started describing frontend Daniel 2025-04-21 11:58:48 +02:00
  • 210831146a implementation: started writing impl; finished technology section Daniel 2025-04-20 12:18:40 +02:00
  • 90a4194283 expression processing: fixed error if expression contained nested unary operators such as log(sqrt(4)) Daniel 2025-04-19 10:54:41 +02:00
  • 0d888edc52 Merge pull request #2 from daniwipes/expression-file daniwipes 2025-04-19 08:43:35 +02:00
  • 742a544e1a concept and design: finished reworking sections Daniel 2025-04-18 12:41:54 +02:00
  • 4fe9040a6f Make @btime work Gabriel Kronberger 2025-04-18 11:41:25 +02:00
  • bc49b33149 Bugfix Gabriel Kronberger 2025-04-18 11:41:11 +02:00
  • 293c5f13a4 Add files with expressions, parser, Nikuradse dataset and a new test case. Gabriel Kronberger 2025-04-18 08:19:22 +02:00
  • 6ab826cc42 concept and design: rewrote requirements section Daniel 2025-04-17 15:12:58 +02:00
  • c3f19d807a relwork: implemented Kronberger feedback Daniel 2025-04-17 13:31:11 +02:00
  • 6880c1ceb5 benchmarking: added uni performance test Daniel 2025-04-16 20:54:12 +02:00
  • c62aff806a small updates and notes for further writing Daniel 2025-04-15 19:32:39 +02:00
  • ef721b13e0 evaluation: updated notes for chapter Daniel 2025-04-13 14:20:16 +02:00
  • a5c34a53b7 benchmarking: reverted previous; made interpreter use fast math Daniel 2025-04-13 13:26:35 +02:00
  • 6d6874c7ba benchmarking: added results for test 4 on uni-pc Daniel 2025-04-13 11:45:52 +02:00
  • 278a493595 benchmarking: tested int32 also on uni pc 4-interpreter-using-int32 Daniel 2025-04-13 11:43:17 +02:00
  • af3b72f196 benchmarking: used int32 wherever possible; resulted in noticeable performance drop Daniel 2025-04-13 11:32:54 +02:00
  • 4c60331288 evaluation: added introduction text and made plan for additional text Daniel 2025-04-12 16:22:14 +02:00
  • 101ccef67b benchmarking: tuned blocksizes; slightly improved performance; mostly improved standard deviation 3-tuned-blocksize-i128-t96 Wiplinger Daniel - s2310454043 2025-04-12 13:20:50 +02:00
  • c6e2ce47aa benchmarking: redid performance tests on uni pc Wiplinger Daniel - s2310454043 2025-04-12 12:13:28 +02:00
  • 9746db56c0 benchmarking: added tests for performance tuning with Nsight compute and systems 2-performance-inbounds Daniel 2025-04-12 10:24:45 +02:00
  • 5a9760d221 concept and design: minor improvements Daniel 2025-04-10 11:17:50 +02:00
  • c68e0d04a0 concept and design: improved wording and added overview diagram of kernel usage Daniel 2025-04-10 10:21:01 +02:00
  • 258d33c338 benchmarking: redid inbounds tests because first test run did not use inbounds everywhere Daniel 2025-04-06 14:29:07 +02:00
  • 20fcbab4ca concept and design: added transpiler section Daniel 2025-04-06 13:59:14 +02:00
  • 9e1094ac43 benchmarking: added reminder to redo one of the performance tests Daniel 2025-04-05 12:56:06 +02:00
  • 2a8de064a6 concept & design: improved existing sections Daniel 2025-04-05 12:54:26 +02:00
  • 8afc3a5e3b concept and design: continued with architecture Daniel 2025-04-04 14:18:56 +02:00
  • d8f5454e9c concept and design: started writing this chapter Daniel 2025-04-03 13:43:23 +02:00
  • 2b9c394f1b benchmarking: improved performance with @inbounds. still slower in most cases Daniel 2025-04-01 21:48:59 +02:00
  • d9c83caad9 benchmarking: updated blocksize to 256 with moderate improvements 1-performance-256-blocksize Daniel 2025-03-30 13:56:25 +02:00
  • 1dc0c1898d benchmarking: fixed bugs; took initial_benchmark initial-implementation Daniel 2025-03-30 12:54:50 +02:00
  • ad175abac0 benchmarking: added gpu evaluators to performance testing. getting execution errors still Daniel 2025-03-29 13:35:59 +01:00
  • 690ee33db1 benchmarks: started preparing benchmarks Daniel 2025-03-29 12:01:06 +01:00
  • effd477558 transpiler: generates valid PTX and evaluates expressions correctly Daniel 2025-03-28 19:32:48 +01:00
  • 9df78ca72e transpiler: invalid memory access error finally fixed Daniel 2025-03-27 22:32:24 +01:00
  • 561b37160b transpiler: trying to fix problem with writing to global memory; not yet fixed Daniel 2025-03-27 18:00:31 +01:00
  • eaee21ca75 transpiler: results are now written in results array; preperation for performance testing Daniel 2025-03-27 09:55:29 +01:00
  • baa37ea183 code: started finalising transpilation process and preparing for performance testing and tuning Daniel 2025-03-23 13:38:22 +01:00
  • db02e9f90f Merge pull request #1 from daniwipes/cpu-interpreter daniwipes 2025-03-23 10:23:54 +01:00
  • e33be8f59e relwork: finished second read to improve wording and correct mistakes Daniel 2025-03-23 09:57:23 +01:00
  • 3c289f13d1 relwork: slight rewording and fixing errors Daniel 2025-03-21 17:49:55 +01:00
  • a718a3572e relwork: finished relwork Daniel 2025-03-21 14:35:55 +01:00
  • db3ea32b66 relwork: continuation of compilers section Daniel 2025-03-20 13:31:45 +01:00
  • d514b07434 relwork: started compilers section Daniel 2025-03-18 10:32:14 +01:00
  • de5493ca3e relwork: finished occupancy and ptx sections Daniel 2025-03-16 17:43:32 +01:00
  • 84fdf5c9ca relwork: continuation of programing gpus Daniel 2025-03-15 14:33:33 +01:00