summaryrefslogtreecommitdiffstats
path: root/src/video_core/CMakeLists.txt (follow)
Commit message (Collapse)AuthorAgeFilesLines
* gpu: dependency-inject scaling/antialiasing filter state for capture layersLiam2024-02-091-0/+1
|
* nvnflinger/gpu: implement applet captureLiam2024-02-091-0/+1
|
* VideoCore: Move Slot Vector to CommonFernando Sahmkow2024-02-041-1/+0
|
* renderer_opengl: implement layer stack compositionLiam2024-01-311-0/+3
|
* renderer_vulkan: implement layer stack compositionLiam2024-01-311-0/+3
|
* renderer_opengl: split up blit screen resources into antialias and window adapt passesLiam2024-01-311-0/+4
|
* renderer_opengl: split out FXAALiam2024-01-311-0/+2
|
* renderer_opengl: split out SMAALiam2024-01-311-2/+5
|
* renderer_vulkan: split up blit screen resources into separate antialias and window adapt passesLiam2024-01-311-0/+5
|
* renderer_vulkan: isolate FXAA from blit screenLiam2024-01-311-4/+8
|
* renderer_opengl: isolate core presentation codeLiam2024-01-311-0/+2
|
* video_core: simplify accelerated surface fetch and crop handling between APIsLiam2024-01-311-0/+1
|
* Merge pull request #12814 from Kelebek1/time_new_ipcliamwhite2024-01-291-2/+2
|\ | | | | Move time services to new IPC and add debug printing
| * Move time services to new IPC.Kelebek12024-01-271-2/+2
| | | | | | | | Add some fixes/improvements to usage with the new IPC
* | Merge pull request #12439 from FireBurn/vkresultliamwhite2024-01-291-1/+1
|\ \ | |/ |/| Simplify VkResult lookup
| * Add Vulkan-Utility-Libraries dependencyMike Lothian2024-01-221-1/+1
| |
* | SMMU: Initial adaptation to video_core.Fernando Sahmkow2024-01-191-2/+1
| |
* | NVDRV: Implement sessions and initial implementation of SMMUFernando Sahmkow2024-01-191-0/+2
|/
* Merge pull request #11535 from GPUCode/upload_cmdbufFernando S2023-11-261-0/+1
|\ | | | | renderer_vulkan: Introduce separate cmd buffer for uploads
| * renderer_vulkan: Introduce separate cmd buffer for uploadsGPUCode2023-11-121-0/+1
| |
* | video_core: refactor video frame and packet parsingLiam2023-11-161-1/+3
|/
* Query Cache: Setup Base reworkFernando Sahmkow2023-09-231-0/+6
|
* vma: enable options everywhereAlexandre Bouvier2023-07-311-0/+2
|
* cmake: allow using system VMA libraryAlexandre Bouvier2023-07-121-1/+5
|
* video_core: Add BCn decoding supportGPUCode2023-06-281-3/+3
|
* externals: Add vma and initialize itlat9nq2023-06-181-1/+1
| | | | video_core: Move vma implementation to library
* Merge pull request #10476 from ameerj/gl-memory-mapsliamwhite2023-06-071-2/+2
|\ | | | | OpenGL: Make use of persistent buffer maps in buffer cache
| * OpenGL: Make use of persistent buffer maps in buffer cache downloadsameerj2023-05-281-2/+2
| | | | | | | | | | | | Persistent buffer maps were already used by the texture cache, this extends their usage for the buffer cache. In my testing, using the memory maps for uploads was slower than the existing "ImmediateUpload" path, so the memory map usage is limited to downloads for the time being.
* | build: only enable adrenotools on arm64Liam2023-06-031-1/+1
| |
* | externals: add adrenotools for bcenablerLiam2023-06-031-0/+4
| |
* | cmake: Integrate bundled FFmpeg for Android.bunnei2023-06-031-1/+1
|/
* textures: add BC1 and BC3 compressors and recompression settingLiam2023-05-231-1/+5
|
* renderer_vulkan: Async presentationGPUCode2023-05-011-0/+2
|
* Buffer Cache: Fully rework the buffer cache.Fernando Sahmkow2023-04-291-0/+5
|
* Partially apply LTO to only core and video_core projects.Matías Locatti2023-02-271-0/+4
|
* video_core/opengl: Add FSR upscaling filter to the OpenGL rendererWollnashorn2023-01-261-0/+4
|
* Merge pull request #9556 from vonchenplus/draw_textureliamwhite2023-01-191-0/+2
|\ | | | | video_core: Implement maxwell3d draw texture method
| * video_core: Implement opengl/vulkan draw_textureFeng Chen2023-01-051-0/+2
| |
* | Merge pull request #9552 from liamwhite/turboliamwhite2023-01-061-0/+2
|\ \ | | | | | | vulkan: implement 'turbo mode' clock booster
| * | vulkan: implement 'turbo mode' clock boosterLiam2023-01-051-0/+2
| |/
* / video_core: Cache GPU internal writes.Fernando Sahmkow2023-01-051-0/+1
|/
* RasterizerMemory: Add filtering for flushing/invalidation operations.Fernando Sahmkow2023-01-011-0/+1
|
* video_core: Integrate SMAALiam2022-12-081-0/+4
| | | | | Co-authored-by: goldenx86 <goldenx86@users.noreply.github.com> Co-authored-by: BreadFish64 <breadfish64@users.noreply.github.com>
* Merge pull request #9401 from vonchenplus/draw_managerFernando S2022-12-081-0/+2
|\ | | | | video_core: Implement maxwell3d draw manager and split draw logic
| * video_core: Implement maxwell3d draw manager and split draw logicFeng Chen2022-12-081-0/+2
| |
* | cmake: prefer system librariesAlexandre Bouvier2022-12-041-4/+3
|/
* Merge pull request #9374 from liamwhite/externalsliamwhite2022-12-041-2/+8
|\ | | | | externals: update dynarmic, SDL2
| * externals: update dynarmic, SDL2Liam2022-12-041-2/+8
| |
* | Merge pull request #9344 from liamwhite/nullbunnei2022-12-031-0/+4
|\ \ | |/ |/| video_core: add null backend
| * video_core: add null backendLiam2022-11-291-0/+4
| |
* | CMake: Use precompiled headersameerj2022-11-301-0/+5
|/
* Fermi2D: Rework blit engine and add a software blitter.Fernando Sahmkow2022-11-241-0/+4
|
* Initial ARM64 supportLiam2022-11-091-3/+12
|
* CMakeLists: Remove all redundant warningsMorph2022-10-221-7/+1
| | | | These are already explicitly or implicitly set in src/CMakeLists.txt
* video_core: Implement memory manager page kindFengChen2022-10-171-0/+1
|
* NVDRV: Refactor Host1xFernando Sahmkow2022-10-061-0/+2
|
* VideoCore: Refactor syncing.Fernando Sahmkow2022-10-061-19/+21
|
* Refactor VideoCore to use AS sepparate from Channel.Fernando Sahmkow2022-10-061-0/+1
|
* VideoCore: implement channels on gpu caches.Fernando Sahmkow2022-10-061-0/+8
|
* chore: make yuzu REUSE compliantAndrea Pappacoda2022-07-271-0/+3
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | [REUSE] is a specification that aims at making file copyright information consistent, so that it can be both human and machine readable. It basically requires that all files have a header containing copyright and licensing information. When this isn't possible, like when dealing with binary assets, generated files or embedded third-party dependencies, it is permitted to insert copyright information in the `.reuse/dep5` file. Oh, and it also requires that all the licenses used in the project are present in the `LICENSES` folder, that's why the diff is so huge. This can be done automatically with `reuse download --all`. The `reuse` tool also contains a handy subcommand that analyzes the project and tells whether or not the project is (still) compliant, `reuse lint`. Following REUSE has a few advantages over the current approach: - Copyright information is easy to access for users / downstream - Files like `dist/license.md` do not need to exist anymore, as `.reuse/dep5` is used instead - `reuse lint` makes it easy to ensure that copyright information of files like binary assets / images is always accurate and up to date To add copyright information of files that didn't have it I looked up who committed what and when, for each file. As yuzu contributors do not have to sign a CLA or similar I couldn't assume that copyright ownership was of the "yuzu Emulator Project", so I used the name and/or email of the commit author instead. [REUSE]: https://reuse.software Follow-up to 01cf05bc75b1e47beb08937439f3ed9339e7b254
* CMakeLists: Make variable shadowing a compile-time errorMorph2022-06-141-5/+0
| | | | Now that the entire project is free of variable shadowing, we can enforce this as a compile time error to prevent any further introduction of this logic bug.
* core/debugger: Improved stepping mechanism and misc fixesLiam2022-06-011-0/+4
|
* video_core/cmake: link against libva explicitly ...liushuyu2021-12-031-0/+1
| | | | ... to fix build on Flatpak (and self-builds)
* Vulkan: Reimplement FSR constant generation functions to avoid GCC warningsMarshall Mohror2021-11-161-1/+0
|
* Presentation: Only use FP16 in scaling shaders on supported devices in VulkanMarshall Mohror2021-11-161-0/+1
|
* vulkan: Implement FidelityFX Super ResolutionMarshall Mohror2021-11-161-0/+2
|
* codecs: Add VP8 codec classameerj2021-11-131-0/+2
|
* cmake: Add VDPAU and NVDEC support to FFmpeglat9nq2021-08-161-0/+1
| | | | Adds {h264_,vp9_}{nvdec,vdpau} hwaccels.
* texture_cache: Address ameerj's reviewyzct123452021-08-051-3/+3
|
* texture_cache: Split templates outyzct123452021-08-051-0/+3
|
* nvdec: Implement VA-API hardware video acceleration (#6713)yzct123452021-08-041-0/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * nvdec: VA-API * Verify formatting * Forgot a semicolon for Windows * Clarify comment about AV_PIX_FMT_NV12 * Fix assert log spam from missing negation * vic: Remove forgotten debug code * Address lioncash's review * Mention VA-API is Intel/AMD * Address v1993's review * Hopefully fix CMakeLists style this time * vic: Improve cache locality * vic: Fix off-by-one error * codec: Async * codec: Forgot the GetValue() * nvdec: Address ameerj's review * codec: Fallback to CPU without VA-API support * cmake: Address lat9nq's review * cmake: Make VA-API optional * vaapi: Multiple GPU * Apply suggestions from code review Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com> * nvdec: Address ameerj's review * codec: Use anonymous instead of static * nvdec: Remove enum and fix memory leak * nvdec: Address ameerj's review * codec: Remove preparation for threading Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com>
* renderer_vulkan: Add setting to log pipeline statisticsReinUsesLisp2021-07-281-0/+2
| | | | | | | | | | | | | | | | Use VK_KHR_pipeline_executable_properties when enabled and available to log statistics about the pipeline cache in a game. For example, this is on Turing GPUs when generating a pipeline cache from Super Smash Bros. Ultimate: Average pipeline statistics ========================================== Code size: 6433.167 Register count: 32.939 More advanced results could be presented, at the moment it's just an average of all 3D and compute pipelines.
* gl_shader_cache: Implement async shadersameerj2021-07-231-0/+1
|
* gl_shader_cache: Rename Program abstractions into PipelineReinUsesLisp2021-07-231-4/+4
|
* video_core: Abstract transform feedback translation utilityReinUsesLisp2021-07-231-0/+2
|
* shader: Initial OpenGL implementationReinUsesLisp2021-07-231-0/+4
|
* shader: Move pipeline cache logic to separate filesReinUsesLisp2021-07-231-0/+3
| | | | | | | | | Move code to separate files to be able to reuse it from OpenGL. This greatly simplifies the pipeline cache logic on Vulkan. Transform feedback state is not yet abstracted and it's still intrusively stored inside vk_pipeline_cache. It will be moved when needed on OpenGL.
* shader_recompiler,video_core: Cleanup some GCC and Clang errorslat9nq2021-07-231-1/+1
| | | | | | | | | | | | | | | | | Mostly fixing unused *, implicit conversion, braced scalar init, fpermissive, and some others. Some Clang errors likely remain in video_core, and std::ranges is still a pertinent issue in shader_recompiler shader_recompiler: cmake: Force bracket depth to 1024 on Clang Increases the maximum fold expression depth thread_worker: Include condition_variable Don't use list initializers in control flow Co-authored-by: ReinUsesLisp <reinuseslisp@airmail.cc>
* shader: Add partial rasterizer integrationReinUsesLisp2021-07-231-1/+5
|
* shader: Primitive Vulkan integrationReinUsesLisp2021-07-231-4/+2
|
* shader: Remove old shader managementReinUsesLisp2021-07-231-64/+0
|
* video_core: Enforce C4242Morph2021-06-281-3/+2
|
* video_core: Enforce C4244ReinUsesLisp2021-06-261-0/+1
| | | | Enforce implicit integer casts to a smaller type as errors.
* textures: Reintroduce CPU ASTC decoderameerj2021-06-161-0/+1
| | | | | | | Users may want to fall back to the CPU ASTC texture decoder due to hangs and crashes that may be caused by keeping the GPU under compute heavy loads for extended periods of time. This is especially the case in games such as Astral Chain which make extensive use of ASTC textures.
* astc_decoder: Refactor for style and more efficient memory useameerj2021-03-251-1/+0
|
* video_core: Reimplement the buffer cacheReinUsesLisp2021-02-131-5/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
* Merge pull request #5880 from lat9nq/ffmpeg-externalAmeer J2021-02-091-6/+5
|\ | | | | cmake: FFmpeg linking rework
| * CMake: Port citra-emu/citra FindFFmpeg.cmakelat9nq2021-02-051-2/+2
| | | | | | | | | | | | | | | | | | | | | | Also renames related CMake variables to match both the Find*FFmpeg* and variables defined within the file. Fixes odd errors produced by the old FindFFmpeg. Citra's FindFFmpeg is slightly modified here: adds Citra's copyright at the beginning, renames FFmpeg_INCLUDES to FFmpeg_INCLUDE_DIR, disables a few components in _FFmpeg_ALL_COMPONENTS, and adds the missing avutil component to the comment above.
| * CMake: Implement YUZU_USE_BUNDLED_FFMPEGlat9nq2021-02-051-6/+5
| | | | | | | | | | | | | | | | For Linux, instructs CMake to use the FFmpeg submodule in externals. This is HEAVILY based on our usage of the late Unicorn. Minimal change to MSVC as it uses the yuzu-emu/ext-windows-bin. MinGW now targets the same ext-windows-bin libraries as MSVC for FFmpeg. Adds FFMPEG_LIBRARIES to WIN32 and simplifies video_core/CMakeLists.txt a bit.
* | video_core: Delete mortonChloe Marcec2021-02-081-2/+0
|/ | | | moron.h & morton.cpp are not used anywhere and are just empty files
* video_core/cmake: Properly generate fatal errors on AftermathReinUsesLisp2021-01-231-2/+2
| | | | | Fix "message(ERROR ..." to "message(FATAL_ERROR ..." to properly stop cmake when Nsight Aftermath can't be configured.
* Merge pull request #5262 from ReinUsesLisp/buffer-baseRodrigo Locatti2021-01-161-0/+1
|\ | | | | buffer_cache/buffer_base: Add a range tracking buffer container and tests
| * buffer_cache/buffer_base: Add a range tracking buffer containerReinUsesLisp2021-01-131-0/+1
| | | | | | | | | | | | | | | | | | It keeps track of the modified CPU and GPU ranges on a CPU page granularity, notifying the given rasterizer about state changes in the tracking behavior of the buffer. Use a small vector optimization to store buffers smaller than 256 KiB locally instead of using free store memory allocations.
* | vulkan_common: Move allocator to the common directoryReinUsesLisp2021-01-151-2/+2
| | | | | | | | Allow using the abstraction from the OpenGL backend.
* | video_core/cmake: Remove Werror flags already defined code-base wideReinUsesLisp2021-01-151-2/+0
|/ | | | These flags are already defined in src/cmake.
* renderer_vulkan/nsight_aftermath_tracker: Move to vulkan_commonReinUsesLisp2021-01-041-2/+2
|
* renderer_vulkan: Move device abstraction to vulkan_commonReinUsesLisp2021-01-041-2/+2
|
* Merge pull request #5230 from ReinUsesLisp/vulkan-commonRodrigo Locatti2021-01-031-2/+10
|\ | | | | vulkan_common: Move reusable Vulkan abstractions to a separate directory
| * renderer_vulkan: Initialize surface in separate fileReinUsesLisp2020-12-311-0/+2
| | | | | | | | | | | | Move surface initialization code to a separate file. It's unlikely to use this code outside of Vulkan, but keeping platform-specific code (Win32, Xlib, Wayland) in its own translation unit keeps things cleaner.
| * renderer_vulkan: Create debug callback on separate file and throwReinUsesLisp2020-12-311-0/+2
| | | | | | | | | | | | | | | | Initialize debug callbacks (messenger) from a separate file. This allows sharing code with different backends. Change our Vulkan error handling to use exceptions instead of error codes, simplifying the initialization process.
| * renderer_vulkan: Move instance initialization to a separate fileReinUsesLisp2020-12-311-0/+2
| | | | | | | | | | | | Simplify Vulkan's backend initialization code by moving it to a separate file, allowing us to initialize a Vulkan instance from different backends.
| * vulkan_common: Rename renderer_vulkan/wrapper.h to vulkan_common/vulkan_wrapper.hReinUsesLisp2020-12-311-2/+2
| | | | | | | | Allows sharing Vulkan wrapper code between different rendering backends.
| * vulkan_common: Move dynamic library load to a separate fileReinUsesLisp2020-12-311-0/+2
| | | | | | | | | | Allows us to initialize a Vulkan dynamic library from different backends without duplicating code.
* | Merge pull request #5208 from bunnei/service-threadsbunnei2020-12-311-4/+1
|\ \ | |/ |/| Service threads
| * video_core: gpu: Refactor out synchronous/asynchronous GPU implementations.bunnei2020-12-291-4/+1
| | | | | | | | - We must always use a GPU thread now, even with synchronous GPU.
* | video_core: Rewrite the texture cacheReinUsesLisp2020-12-301-22/+25
| | | | | | | | | | | | | | | | | | | | | | | | | | | | The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
* | video_core: Add a delayed destruction ring abstractionReinUsesLisp2020-12-301-0/+1
|/
* Merge pull request #5226 from ReinUsesLisp/c4715-vcRodrigo Locatti2020-12-251-0/+1
|\ | | | | video_core: Enforce C4715 (not all control paths return a value)
| * video_core: Enforce C4715 (not all control paths return a value)ReinUsesLisp2020-12-251-0/+1
| | | | | | | | | | | | | | | | | | | | Most of the time people write code that always returns a value, terminates execution, throws an exception, or uses an unconventional jump primitive. This is not always true when we build without asserts on mainline builds. To avoid introducing undefined behavior on our most used builds, enforce this warning signalling an error and stopping the build from shipping.
* | cmake: Always enable VulkanReinUsesLisp2020-12-251-75/+66
|/ | | | | Removes the unnecesary burden of maintaining separate #ifdef paths and allows us sharing generic Vulkan code across APIs.
* video_core: Resolve more variable shadowing scenarios pt.3Lioncash2020-12-051-1/+8
| | | | | Cleans out the rest of the occurrences of variable shadowing and makes any further occurrences of shadowing compiler errors.
* Merge pull request #4848 from ReinUsesLisp/type-limitsLC2020-10-281-0/+1
|\ | | | | video_core: Enforce -Werror=type-limits
| * video_core: Enforce -Werror=type-limitsReinUsesLisp2020-10-281-0/+1
| | | | | | | | Silences one warning and avoids introducing more in the future.
* | video_core: Enforce -Wredundant-move and -Wpessimizing-moveReinUsesLisp2020-10-281-0/+2
|/ | | | Silence three warnings and make them errors to avoid introducing more in the future.
* video_core: NVDEC Implementationameerj2020-10-271-0/+26
| | | | | | | | | | | | | | This commit aims to implement the NVDEC (Nvidia Decoder) functionality, with video frame decoding being handled by the FFmpeg library. The process begins with Ioctl commands being sent to the NVDEC and VIC (Video Image Composer) emulated devices. These allocate the necessary GPU buffers for the frame data, along with providing information on the incoming video data. A Submit command then signals the GPU to process and decode the frame data. To decode the frame, the respective codec's header must be manually composed from the information provided by NVDEC, then sent with the raw frame data to the ffmpeg library. Currently, H264 and VP9 are supported, with VP9 having some minor artifacting issues related mainly to the reference frame composition in its uncompressed header. Async GPU is not properly implemented at the moment. Co-Authored-By: David <25727384+ogniK5377@users.noreply.github.com>
* video_core: Conditially activate relevant compiler warningsLioncash2020-10-211-2/+4
| | | | | | | | These compiler flags aren't shared with clang, so specifying these flags unconditionally can lead to a bit of warning spam. While we're in the area, we can also enable -Wunused-but-set-parameter given this is almost always a bug.
* video_core: Enforce -Wclass-memaccessReinUsesLisp2020-10-091-0/+1
|
* video_core: Enforce -Wunused-variable and -Wunused-but-set-variableReinUsesLisp2020-10-031-1/+7
|
* renderer_vulkan: Make unconditional use of VK_KHR_timeline_semaphoreReinUsesLisp2020-09-191-2/+6
| | | | | | | | | | | | | | | | | | | | | | | This reworks how host<->device synchronization works on the Vulkan backend. Instead of "protecting" resources with a fence and signalling these as free when the fence is known to be signalled by the host GPU, use timeline semaphores. Vulkan timeline semaphores allow use to work on a subset of D3D12 fences. As far as we are concerned, timeline semaphores are a value set by the host or the device that can be waited by either of them. Taking advantange of this, we can have a monolithically increasing atomic value for each submission to the graphics queue. Instead of protecting resources with a fence, we simply store the current logical tick (the atomic value stored in CPU memory). When we want to know if a resource is free, it can be compared to the current GPU tick. This greatly simplifies resource management code and the free status of resources should have less false negatives. To workaround bugs in validation layers, when these are attached there's a thread waiting for timeline semaphores.
* video_core: Enforce -Werror=switchReinUsesLisp2020-09-161-1/+1
| | | | This forces us to fix all -Wswitch warnings in video_core.
* video_core/host_shaders: Add CMake integration for string shadersReinUsesLisp2020-08-241-0/+5
| | | | | | | | | | | | | Add the necessary CMake code to copy the contents in a string source shader (GLSL or GLASM) to a header file then consumed by video_core files. This allows editting GLSL in its own files without having to maintain them in source files. For now, only OpenGL presentation shaders are moved, but we can add GLASM presentation shaders and static SPIR-V generation through glslangValidator in the future.
* async shadersDavid Marcec2020-07-171-0/+4
|
* video_core/compatible_formats: Table to test if two formats are legal to view or copyReinUsesLisp2020-06-271-0/+2
| | | | | | | | Add a flat table to test if it's legal to create a texture view between two formats or copy betweem them. This table is based on ARB_copy_image and ARB_texture_view. Copies are more permissive than views.
* Macro HLE supportDavid Marcec2020-06-241-0/+2
|
* Merge pull request #4041 from ReinUsesLisp/arb-decompbunnei2020-06-161-0/+2
|\ | | | | gl_arb_decompiler: Implement an assembly shader decompiler
| * gl_arb_decompiler: Implement an assembly shader decompilerReinUsesLisp2020-06-121-0/+2
| | | | | | | | | | | | Emit code compatible with NV_gpu_program5. This should emit code compatible with Fermi, but it wasn't tested on that architecture. Pascal has some issues not present on Turing GPUs.
* | rasterizer_cache: Remove files and includesReinUsesLisp2020-06-071-2/+0
| | | | | | | | | | The rasterizer cache is no longer used. Each cache has its own generic implementation optimized for the cached data.
* | shader_cache: Implement a generic shader cacheReinUsesLisp2020-06-071-0/+1
|/ | | | | | | | | Implement a generic shader cache for fast lookups and invalidations. Invalidations are cheap but expensive when a shader is invalidated. Use two mutexes instead of one to avoid locking invalidations for lookups and vice versa. When a shader has to be removed, lookups are locked as expected.
* Implement macro JITDavid Marcec2020-05-301-2/+6
|
* Add xbyak externalDavid Marcec2020-05-301-1/+1
|
* map_interval: Add interval allocator and drop hackReinUsesLisp2020-05-211-0/+1
| | | | | | | | | | Drop the std::list hack to allocate memory indefinitely. Instead use a custom allocator that keeps references valid until destruction. This allocates fixed chunks of memory and puts pointers in a free list. When an allocation is no longer used put it back to the free list, this doesn't heap allocate because std::vector doesn't change the capacity. If the free list is empty, allocate a new chunk.
* Merge pull request #3815 from FernandoS27/command-list-2bunnei2020-05-051-0/+1
|\ | | | | GPU: More optimizations to GPU Command List Processing and DMA Copy Optimizations
| * Clang Format and Documentation.Fernando Sahmkow2020-04-281-0/+1
| |
* | shader/memory_util: Deduplicate codeReinUsesLisp2020-04-261-0/+2
|/ | | | | | | | Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader.
* Merge pull request #3677 from FernandoS27/better-syncbunnei2020-04-231-0/+5
|\ | | | | Introduce Predictive Flushing and Improve ASYNC GPU
| * vk_fence_manager: Initial implementationReinUsesLisp2020-04-221-0/+2
| |
| * GPU: Implement a Fence Manager.Fernando Sahmkow2020-04-221-0/+3
| |
* | renderer_vulkan: Integrate Nvidia Nsight Aftermath on WindowsReinUsesLisp2020-04-141-3/+16
|/ | | | | | | | | | | | | | | | | | | | | Adds optional support for Nsight Aftermath. It is enabled through ENABLE_NSIGHT_AFTERMATH in cmake. A path to the SDK has to be provided by the environment variable NSIGHT_AFTERMATH_SDK. Nsight Aftermath allows an application to generate "minidumps" of the GPU state when a device loss happens. By analysing these on Nsight we can know what a game was doing and why it triggered a device loss. The dump is generated inside %APPDATA%\yuzu\log\gpucrash and this directory is deleted every time a new instance is initialized with Nsight enabled. To enable it on yuzu there has a to be a driver and device capable of running Nsight Aftermath on Vulkan. That means only Turing based GPUs on the latest stable driver, beta drivers won't work for now. It is manually enabled in Configuration>Debug>Enable Graphics Debugging because when using all debugging capabilities there is a runtime cost.
* renderer_vulkan: Drop Vulkan-HppReinUsesLisp2020-04-111-1/+0
|
* video_core/texture: Use a LUT to convert sRGB texture bordersReinUsesLisp2020-04-081-0/+1
| | | | | | | | | | | | | | | | | | This is a reversed look up table extracted from https://gist.github.com/rygorous/2203834#file-gistfile1-cpp-L41-L62 that is used in https://github.com/devkitPro/deko3d/blob/04d4e9e587fa3dc5447b43d273bc45f440226e41/source/maxwell/tsc_generate.cpp#L38 Games usually bind 0xFD expecting a float texture border of 1.0f. The conversion previous to this commit was multiplying the uint8 sRGB texture border color by 255. This is close to 1.0f but when that difference matters, some graphical glitches appear. This look up table is manually changed in the edges, clamping towards 0.0f and 1.0f. While we are at it, move this logic to its own translation unit.
* renderer_vulkan/wrapper: Add ToString function for VkResultReinUsesLisp2020-03-271-0/+1
|
* renderer_vulkan/wrapper: Add Vulakn wrapper and a span helperReinUsesLisp2020-03-271-0/+1
| | | | | | | | | | | | | | | | | | | The intention behind a Vulkan wrapper is to drop Vulkan-Hpp. The issues with Vulkan-Hpp are: - Regular breaks of the API. - Copy constructors that do the same as the aggregates (fixed recently) - External dynamic dispatch that is hard to remove - Alias KHR handles with non-KHR handles making it impossible to use smart handles on Vulkan 1.0 instances with extensions that were included on Vulkan 1.1. - Dynamic dispatchers silently change size depending on preprocessor definitions. Different files will have different dispatch definitions, generating all kinds of hard to debug memory issues. In other words, Vulkan-Hpp is not "production ready" for our needs and this wrapper aims to replace it without losing RAII and exception safety.
* shader/transform_feedback: Add host API friendly TFB builderReinUsesLisp2020-03-131-0/+2
|
* video_core: Rename "const buffer locker" to "registry"ReinUsesLisp2020-03-091-2/+2
|
* gl_shader_cache: Rework shader cache and remove post-specializationsReinUsesLisp2020-03-091-2/+0
| | | | | Instead of pre-specializing shaders and then post-specializing them, drop the later and only "specialize" the shader while decoding it.
* dirty_flags: Deduplicate code between OpenGL and VulkanReinUsesLisp2020-02-281-0/+1
|
* vk_state_tracker: Initial implementationReinUsesLisp2020-02-281-0/+2
| | | | Add support for render targets and viewports.
* video_core: Reintroduce dirty flags infrastructureReinUsesLisp2020-02-281-0/+1
|
* gl_state: Remove completelyReinUsesLisp2020-02-281-2/+0
|
* gl_rasterizer: Remove dirty flagsReinUsesLisp2020-02-281-0/+2
|
* vk_query_cache: Implement generic query cache on VulkanReinUsesLisp2020-02-141-0/+2
|
* query_cache: Abstract OpenGL implementationReinUsesLisp2020-02-141-0/+1
| | | | | | Abstract the current OpenGL implementation into the VideoCommon namespace and reimplement it on top of that. Doing this avoids repeating code and logic in the Vulkan implementation.
* maxwell_3d: Slow implementation of passed samples (query 21)ReinUsesLisp2020-02-141-0/+2
| | | | Implements GL_SAMPLES_PASSED by waiting immediately for queries.
* Merge pull request #3337 from ReinUsesLisp/vulkan-stagedbunnei2020-02-031-0/+1
|\ | | | | yuzu: Implement Vulkan frontend
| * yuzu: Implement Vulkan frontendReinUsesLisp2020-01-291-0/+1
| | | | | | | | | | Adds a Qt and SDL2 frontend for Vulkan. It also finishes the missing bits on Vulkan initialization.
* | GPU: Implement guest driver profile and deduce texture handler sizes.Fernando Sahmkow2020-01-241-0/+2
|/
* vk_blit_screen: Initial implementationReinUsesLisp2020-01-201-0/+2
| | | | | This abstraction takes care of presenting accelerated and non-accelerated or "framebuffer" images to the Vulkan swapchain.
* vk_rasterizer: Implement Vulkan's rasterizerReinUsesLisp2020-01-171-0/+1
| | | | | | This abstraction is Vulkan's equivalent to OpenGL's rasterizer. It takes care of joining all parts of the backend and rendering accordingly on demand.
* renderer_vulkan: Add header as placeholderReinUsesLisp2020-01-171-0/+1
|
* vk_texture_cache: Implement generic texture cache on VulkanReinUsesLisp2020-01-141-1/+4
| | | | | It currently ignores PBO linearizations since these should be dropped as soon as possible on OpenGL.
* vk_compute_pass: Add compute passes to emulate missing Vulkan featuresReinUsesLisp2020-01-081-0/+2
| | | | | | | | | | | This currently only supports quad arrays and u8 indices. In the future we can remove quad arrays with a table written from the CPU, but this was used to bootstrap the other passes helpers and it was left in the code. The blob code is generated from the "shaders/" directory. Read the instructions there to know how to generate the SPIR-V.
* vk_shader_util: Add helper to build SPIR-V shadersReinUsesLisp2020-01-081-0/+2
|
* vk_graphics_pipeline: Initial implementationReinUsesLisp2020-01-071-0/+2
| | | | | | | | | This abstractio represents the state of the 3D engine at a given draw. Instead of changing individual bits of the pipeline how it's done in APIs like D3D11, OpenGL and NVN; on Vulkan we are forced to put everything together into a single, immutable object. It takes advantage of the few dynamic states Vulkan offers.
* vk_compute_pipeline: Initial implementationReinUsesLisp2020-01-071-0/+2
| | | | This abstraction represents a Vulkan compute pipeline.
* vk_pipeline_cache: Add file and define descriptor update template fillerReinUsesLisp2020-01-071-0/+2
| | | | | This function allows us to share code between compute and graphics pipelines compilation.
* vk_rasterizer: Add placeholderReinUsesLisp2020-01-071-0/+1
|
* vk_renderpass_cache: Initial implementationReinUsesLisp2020-01-061-0/+2
| | | | | The renderpass cache is used to avoid creating renderpasses on each draw. The hashed structure is not currently optimized.
* vk_update_descriptor: Initial implementationReinUsesLisp2020-01-061-1/+3
| | | | | | | | | The update descriptor is used to store in flat memory a large chunk of staging data used to update descriptor sets through templates. It provides a push interface to easily insert descriptors following the current pipeline. The order used in the descriptor update template has to be implicitly followed. We can catch bugs here using validation layers.
* Merge pull request #3264 from ReinUsesLisp/vk-descriptor-poolFernando Sahmkow2020-01-051-0/+2
|\ | | | | vk_descriptor_pool: Initial implementation
| * vk_descriptor_pool: Initial implementationReinUsesLisp2020-01-011-0/+2
| | | | | | | | | | | | | | | | | | | | Create a large descriptor pool where we allocate all our descriptors from. It has to be wide enough to support any pipeline, hence its large numbers. If the descritor pool is filled, we allocate more memory at that moment. This way we can take advantage of permissive drivers like Nvidia's that allocate more descriptors than what the spec requires.
* | yuzu: Remove Maxwell debuggerReinUsesLisp2020-01-031-2/+0
|/ | | | | This was carried from Citra and wasn't really used on yuzu. It also adds some runtime overhead. This commit removes it from yuzu's codebase.
* Merge pull request #3248 from ReinUsesLisp/vk-imageFernando Sahmkow2019-12-301-0/+2
|\ | | | | vk_image: Add an image object abstraction
| * vk_image: Add an image object abstractionReinUsesLisp2019-12-251-0/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This object's job is to contain an image and manage its transitions. Since Nvidia hardware doesn't know what a transition is but Vulkan requires them anyway, we have to state track image subresources individually. To avoid the overhead of tracking each subresource in images with many subresources (think of cubemap arrays with several mipmaps), this commit tracks when subresources have diverged. As long as this doesn't happen we can check the state of the first subresource (that will be shared with all subresources) and update accordingly. Image transitions are deferred to the scheduler command buffer.
* | vk_staging_buffer_pool: Add a staging pool for temporary operationsReinUsesLisp2019-12-251-0/+2
|/ | | | | | | The job of this abstraction is to provide staging buffers for temporary operations. Think of image uploads or buffer uploads to device memory. It automatically deletes unused buffers.
* fixed_pipeline_state: Define structure and loadersReinUsesLisp2019-12-231-0/+2
| | | | | | | | | | | | | The intention behind this hasheable structure is to describe the state of fixed function pipeline state that gets compiled to a single graphics pipeline state object. This is all dynamic state in OpenGL but Vulkan wants it in an immutable state, even if hardware can edit it freely. In this commit the structure is defined in an optimized state (it uses booleans, has paddings and many data entries that can be packed to single integers). This is intentional as an initial implementation that is easier to debug, implement and review. It will be optimized in later stages, or it might change if Vulkan gets more dynamic states.
* video_core: Unify ProgramType and ShaderStage into ShaderTypeReinUsesLisp2019-11-231-0/+1
|
* texture_cache: Use a table instead of switch for texture formatsReinUsesLisp2019-11-151-0/+2
| | | | | | Use a large flat array to look up texture formats. This allows us to properly implement formats with different component types. It should also be faster.
* video_core: Enable sign conversion warningsRodrigo Locatti2019-11-111-1/+1
| | | Enable sign conversion warnings but don't treat them as errors.
* video_core: Treat implicit conversions as errorsReinUsesLisp2019-11-081-0/+6
|
* rasterizer_accelerated: Add intermediary for GPU rasterizersReinUsesLisp2019-10-271-0/+2
| | | | | | Add an intermediary class that implements common functions across GPU accelerated rasterizers. This avoids code repetition on different backends.
* VideoCore: Unify const buffer accessing along engines and provide ConstBufferLocker class to shaders.Fernando Sahmkow2019-10-251-2/+5
|
* Shader_Ir: Refactor Decompilation process and allow multiple decompilation modes.Fernando Sahmkow2019-10-051-0/+2
|
* shader_ir: Corrections to outward movements and misc stuffsFernando Sahmkow2019-10-051-0/+1
|
* shader_ir: Initial Decompile SetupFernando Sahmkow2019-10-051-0/+3
|
* Merge pull request #2783 from FernandoS27/new-buffer-cachebunnei2019-08-291-1/+3
|\ | | | | Implement a New LLE Buffer Cache
| * Video_Core: Implement a new Buffer CacheFernando Sahmkow2019-08-211-1/+3
| |
* | shader_ir: Implement VOTEReinUsesLisp2019-08-211-0/+1
|/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
* Merge pull request #2675 from ReinUsesLisp/opengl-buffer-cachebunnei2019-07-151-2/+1
|\ | | | | buffer_cache: Implement a generic buffer cache and its OpenGL backend
| * buffer_cache: Implement a generic buffer cacheReinUsesLisp2019-07-061-0/+1
| | | | | | | | | | | | Implements a templated class with a similar approach to our current generic texture cache. It is designed to be compatible with Vulkan and OpenGL,
| * gl_rasterizer: Drop gl_global_cache in favor of gl_buffer_cacheReinUsesLisp2019-07-061-2/+0
| |
* | shader_ir: Implement a new shader scannerFernando Sahmkow2019-07-091-0/+2
|/
* shader: Decode SUST and implement backing image functionalityReinUsesLisp2019-06-211-0/+1
|
* gl_framebuffer_cache: Use a hashed struct to cache framebuffersReinUsesLisp2019-06-211-0/+2
|
* texture_cache: Split texture cache into different filesReinUsesLisp2019-06-211-2/+7
|
* gl_texture_cache: Initial implementationReinUsesLisp2019-06-211-2/+2
|
* video_core/engines: Move ConstBufferInfo out of Maxwell3DReinUsesLisp2019-06-081-0/+1
|
* Merge pull request #2514 from ReinUsesLisp/opengl-compatZach Hilman2019-06-071-2/+0
|\ | | | | video_core: Drop OpenGL core in favor of OpenGL compatibility
| * gl_rasterizer: Use GL_QUADS to emulate quads renderingReinUsesLisp2019-05-301-2/+0
| |
* | shader: Move Node declarations out of the shader IR headerReinUsesLisp2019-06-071-0/+1
| | | | | | | | | | | | Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h
* | shader: Use shared_ptr to store nodes and move initialization to fileReinUsesLisp2019-06-061-0/+2
|/ | | | | | | | | Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class.
* Merge pull request #2429 from FernandoS27/computebunnei2019-05-091-0/+2
|\ | | | | Corrections and Implementation on GPU Engines
| * Revamp Kepler Memory to use a subegine to manage uploadsFernando Sahmkow2019-04-231-0/+2
| |
* | Merge pull request #2383 from ReinUsesLisp/aoffi-testbunnei2019-04-231-0/+2
|\ \ | |/ |/| gl_shader_decompiler: Disable variable AOFFI on unsupported devices
| * gl_device: Implement interface and add uniform offset alignmentReinUsesLisp2019-04-101-0/+2
| |
* | Merge pull request #2318 from ReinUsesLisp/sampler-cachebunnei2019-04-181-0/+4
|\ \ | | | | | | gl_sampler_cache: Port sampler cache to OpenGL
| * | gl_sampler_cache: Port sampler cache to OpenGLReinUsesLisp2019-04-021-0/+2
| | |
| * | video_core: Abstract vk_sampler_cache into a templated classReinUsesLisp2019-04-021-0/+2
| | |
* | | Merge pull request #2235 from ReinUsesLisp/spirv-decompilerbunnei2019-04-121-1/+6
|\ \ \ | | | | | | | | vk_shader_decompiler: Implement a SPIR-V decompiler
| * | | vk_shader_decompiler: Declare and stub interface for a SPIR-V decompilerReinUsesLisp2019-04-101-0/+2
| | | |
| * | | video_core: Add sirit as optional dependency with VulkanReinUsesLisp2019-04-101-1/+4
| | |/ | |/| | | | | | | sirit is a runtime assembler for SPIR-V
* | | Merge pull request #2278 from ReinUsesLisp/vc-texture-cachebunnei2019-04-111-0/+2
|\ \ \ | |/ / |/| | video_core: Implement API agnostic view based texture cache
| * | video_core: Implement API agnostic view based texture cacheReinUsesLisp2019-03-221-0/+2
| |/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implements an API agnostic texture view based texture cache. Classes defined here are intended to be inherited by the API implementation and used in API-specific code. This implementation exposes protected virtual functions to be called from the implementer. Before executing any surface copies methods (defined in API-specific code) it tries to detect if the overlapping surface is a superset and if it is, it creates a view. Views are references of a subset of a surface, it can be a superset view (the same as referencing the whole texture). Current code manages 1D, 1D array, 2D, 2D array, cube maps and cube map arrays with layer and mipmap level views. Texture 3D slices views are not implemented. If the view attempt fails, the fast path is invoked with the overlapping textures (defined in the implementer). If that one fails (returning nullptr) it will flush and reload the texture.
* | Merge pull request #2093 from FreddyFunk/disk-cache-better-compressionbunnei2019-04-041-1/+1
|\ \ | | | | | | Better LZ4 compression utilization for the disk based shader cache and the yuzu build system
| * | data_compression: Move LZ4 compression from video_core/gl_shader_disk_cache to common/data_compressionunknown2019-03-291-1/+1
| |/
* / vk_swapchain: Implement a swapchain managerReinUsesLisp2019-03-291-1/+3
|/
* vk_sampler_cache: Implement a sampler cacheReinUsesLisp2019-03-131-0/+2
|
* Merge pull request #2147 from ReinUsesLisp/texture-cleanbunnei2019-03-101-0/+1
|\ | | | | shader_ir: Remove "extras" from the MetaTexture
| * shader/decode: Split memory and texture instructions decodingReinUsesLisp2019-02-261-0/+1
| |
* | Merge pull request #2191 from ReinUsesLisp/maxwell-to-vkbunnei2019-03-081-0/+2
|\ \ | | | | | | maxwell_to_vk: Initial implementation
| * | maxwell_to_vk: Initial implementationReinUsesLisp2019-03-041-0/+2
| | |
* | | Merge pull request #2055 from bunnei/gpu-threadbunnei2019-03-071-0/+6
|\ \ \ | | | | | | | | Asynchronous GPU command processing
| * | | gpu: Refactor a/synchronous implementations into their own classes.bunnei2019-03-071-0/+4
| | | |
| * | | gpu: Move command processing to another thread.bunnei2019-03-071-0/+2
| |/ /
* | | Merge pull request #2149 from ReinUsesLisp/decoders-stylebunnei2019-03-071-0/+2
|\ \ \ | |/ / |/| | gl_rasterizer_cache: Move format conversion functions to their own file
| * | gl_rasterizer_cache: Move format conversion to its own fileReinUsesLisp2019-02-271-0/+2
| |/
* | vk_buffer_cache: Implement a buffer cacheReinUsesLisp2019-03-011-0/+2
| | | | | | | | | | This buffer cache is just like OpenGL's buffer cache with some minor style changes. It uses VKStreamBuffer.
* | vk_stream_buffer: Implement a stream bufferReinUsesLisp2019-02-241-1/+3
|/ | | | | | | | | | | | | | This manages two kinds of streaming buffers: one for unified memory models and one for dedicated GPUs. The first one skips the copy from the staging buffer to the real buffer, since it creates an unified buffer. This implementation waits for all fences to finish their operation before "invalidating". This is suboptimal since it should allocate another buffer or start searching from the beginning. There is room for improvement here. This could also handle AMD's "pinned" memory (a heap with 256 MiB) that seems to be designed for buffer streaming.
* vk_scheduler: Implement a schedulerReinUsesLisp2019-02-221-1/+3
| | | | | | | | | | | | | The scheduler abstracts command buffer and fence management with an interface that's able to do OpenGL-like operations on Vulkan command buffers. It returns by value a command buffer and fence that have to be used for subsequent operations until Flush or Finish is executed, after that the current execution context (the pair of command buffers and fences) gets invalidated a new one must be fetched. Thankfully validation layers will quickly detect if this is skipped throwing an error due to modifications to a sent command buffer.
* vk_memory_manager: Implement memory managerReinUsesLisp2019-02-191-0/+2
| | | | | A memory manager object handles the memory allocations for a device. It allocates chunks of Vulkan memory objects and then suballocates.
* vk_resource_manager: Add VKResource interfaceReinUsesLisp2019-02-141-1/+3
| | | | | VKResource is an interface that gets signaled by a fence when it is free to be reused.
* Merge pull request #2113 from ReinUsesLisp/vulkan-basebunnei2019-02-141-0/+10
|\ | | | | vulkan: Add dependencies and device abstraction
| * vk_device: Abstract device handling into a classReinUsesLisp2019-02-131-1/+4
| | | | | | | | | | | | | | VKDevice contains all the data required to manage and initialize a physical device. Its intention is to be passed across Vulkan objects to query device-specific data (for example the logical device and the dispatch loader).
| * renderer_vulkan: Add declarations fileReinUsesLisp2019-02-121-0/+7
| | | | | | | | | | | | | | This file is intended to be included instead of vulkan/vulkan.hpp. It includes declarations of unique handlers using a dynamic dispatcher instead of a static one (which would require linking to a Vulkan library).
* | kepler_compute: Fixup assert and rename enginesReinUsesLisp2019-02-101-2/+2
|/ | | | | | | | | | When I originally added the compute assert I used the wrong documentation. This addresses that. The dispatch register was tested with homebrew against hardware and is triggered by some games (e.g. Super Mario Odyssey). What exactly is missing to get a valid program bound by this engine requires more investigation.
* gl_shader_disk_cache: Compress GLSL code using LZ4ReinUsesLisp2019-02-071-1/+1
|
* gl_shader_disk_cache: Add file and move BaseBindings declarationReinUsesLisp2019-02-071-0/+2
|
* shader_decode: Implement LDG and basic cbuf trackingReinUsesLisp2019-01-301-0/+1
|
* video_core: Rename glsl_decompiler to gl_shader_decompilerReinUsesLisp2019-01-151-2/+2
|
* shader_decode: Implement VMAD and VSETPReinUsesLisp2019-01-151-0/+1
|
* video_core: Replace gl_shader_decompilerReinUsesLisp2019-01-151-2/+0
|
* glsl_decompiler: ImplementationReinUsesLisp2019-01-151-0/+2
|
* shader_ir: Initial implementationReinUsesLisp2019-01-151-0/+27
|
* gl_global_cache: Add dummy global cache managerReinUsesLisp2019-01-081-0/+2
|
* gpu: Rewrite GPU command list processing with DmaPusher class.bunnei2018-11-271-2/+2
| | | | - More accurate impl., fixes Undertale (among other games).
* video_core: Move morton functions to their own fileReinUsesLisp2018-11-251-1/+2
|
* rasterizer_cache: Add missing virtual destructor to RasterizerCacheObjectLioncash2018-11-081-0/+1
| | | | Ensures that destruction will always do the right thing in any context.
* gl_resource_manager: Split implementations in .cpp file.Markus Wick2018-11-061-0/+1
| | | | | Those implementations are quite costly, so there is no need to inline them to the caller. Ressource deletion is often a performance bug, so in this way, we support to add breakpoints to them.
* video_core: Move surface declarations out of gl_rasterizer_cacheReinUsesLisp2018-10-301-0/+2
|
* video_core: Move OpenGL specific utils to its rendererReinUsesLisp2018-10-291-0/+2
|
* gl_rasterizer: Implement quads topologyReinUsesLisp2018-10-041-0/+2
|
* Merge pull request #1290 from FernandoS27/shader-headerbunnei2018-09-181-0/+1
|\ | | | | Implemented (Partialy) Shader Header
| * Implemented (Partialy) Shader HeaderFernandoS272018-09-111-0/+1
| |
* | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf).Subv2018-09-121-0/+2
|/ | | | This engine writes data from a FIFO register into the configured address.
* video_core/CMakeLists: Add missing gl_buffer_cache.hLioncash2018-09-061-0/+1
| | | | | Without this, the header file won't show up by default within IDEs such as Visual Studio.
* renderer_opengl: Implement a buffer cache.Markus Wick2018-09-051-0/+1
| | | | | | | | | The idea of this cache is to avoid redundant uploads. So we are going to cache the uploaded buffers within the stream_buffer and just reuse the old pointers. The next step is to implement a VBO cache on GPU memory, but for now, I want to check the overhead of the cache management. Fetching the buffer over PCI-E should be quite fast.
* renderer_opengl: Implement a new shader cache.bunnei2018-08-281-0/+2
|
* video_core: Add RasterizerCache class for common cache management code.bunnei2018-08-281-0/+1
|
* gl_rasterizer: Implement texture format ASTC_2D_4X4.bunnei2018-06-181-0/+2
|
* GPU: Partially implemented the Maxwell DMA engine.Subv2018-06-121-0/+2
| | | | Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported.
* renderer_opengl: Add gl_shader_manager class.bunnei2018-04-141-0/+2
|
* shader_bytecode: Add initial module for shader decoding.bunnei2018-04-141-0/+1
|
* GPU: Implemented a gpu macro interpreter.Subv2018-04-011-0/+2
| | | | | | The Ryujinx macro interpreter and envydis were used as reference. Macros are programs that are uploaded by the games during boot and can later be called by writing to their method id in a GPU command buffer.
* maxwell_to_gl: Add module and function for decoding VertexType.bunnei2018-03-271-0/+1
|
* Frontend: Ported the GPU breakpoints and surface viewer widgets from citra.Subv2018-03-241-0/+2
|
* GPU: Preliminary work for texture decoding.Subv2018-03-241-0/+3
|
* renderer_gl: Port boilerplate rasterizer code over from Citra.bunnei2018-03-201-0/+3
|
* renderer_gl: Port over gl_shader_gen module from Citra.bunnei2018-03-201-0/+2
|
* renderer_gl: Port over gl_shader_decompiler module from Citra.bunnei2018-03-201-0/+2
|
* renderer_gl: Port over gl_rasterizer_cache module from Citra.bunnei2018-03-201-0/+2
|
* renderer_gl: Port over gl_stream_buffer module from Citra.bunnei2018-03-201-0/+2
|
* GPU: Move the GPU's class constructor and destructors to a cpp file.Subv2018-03-181-0/+1
| | | | This should reduce recompile times when editing the Maxwell3D register structure.
* Make a GPU class in VideoCore to contain the GPU state.Subv2018-02-121-0/+3
| | | | Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there.
* GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines.Subv2018-02-121-0/+8
|
* CMakeLists: Derive the source directory grouping from targets themselvesLioncash2018-01-181-19/+15
| | | | | Removes the need to store to separate SRC and HEADER variables, and then construct the target in most cases.
* Remove references to PICA and rasterizers in video_coreJames Rowe2018-01-131-74/+1
|
* pica/command_processor: build geometry pipeline and run geometry shaderwwylele2017-08-191-0/+2
| | | | | | | | | | | | The geometry pipeline manages data transfer between VS, GS and primitive assembler. It has known four modes: - no GS mode: sends VS output directly to the primitive assembler (what citra currently does) - GS mode 0: sends VS output to GS input registers, and sends GS output to primitive assembler - GS mode 1: sends VS output to GS uniform registers, and sends GS output to primitive assembler. It also takes an index from the index buffer at the beginning of each primitive for determine the primitive size. - GS mode 2: similar to mode 1, but doesn't take the index and uses a fixed primitive size. hwtest shows that immediate mode also supports GS (at least for mode 0), so the geometry pipeline gets refactored into its own class for supporting both drawing mode. In the immediate mode, some games don't set the pipeline registers to a valid value until the first attribute input, so a geometry pipeline reset flag is set in `pipeline.vs_default_attributes_setup.index` trigger, and the actual pipeline reconfigure is triggered in the first attribute input. In the normal drawing mode with index buffer, the vertex cache is a little bit modified to support the geometry pipeline. Instead of OutputVertex, it now holds AttributeBuffer, which is the input to the geometry pipeline. The AttributeBuffer->OutputVertex conversion is done inside the pipeline vertex handler. The actual hardware vertex cache is believed to be implemented in a similar way (because this is the only way that makes sense). Both geometry pipeline and GS unit rely on states preservation across drawing call, so they are put into the global state. In the future, the other three vertex shader units should be also placed in the global state, and a scheduler should be implemented on top of the four units. Note that the current gs_unit already allows running VS on it in the future.
* SwRasterizer/Lighting: shorten file namewwylele2017-08-031-2/+2
|
* SwRasterizer/Lighting: move to its own filewwylele2017-08-021-0/+2
|
* CMake: Create INTERFACE targets for microprofile and nihstroYuri Kunde Schlesner2017-05-281-1/+1
|
* CMake: Use IMPORTED target for libpngYuri Kunde Schlesner2017-05-281-3/+2
|
* CMake: Correct inter-module dependencies and library visibilityYuri Kunde Schlesner2017-05-281-5/+7
| | | | | | | | | | Modules didn't correctly define their dependencies before, which relied on the frontends implicitly including every module for linking to succeed. Also changed every target_link_libraries call to specify visibility of dependencies to avoid leaking definitions to dependents when not necessary.
* pica/swrasterizer: implement procedural texturewwylele2017-05-201-0/+2
|
* SWRasterizer: Move texturing functions to their own fileYuri Kunde Schlesner2017-02-131-0/+2
|
* SWRasterizer: Move framebuffer operation functions to their own fileYuri Kunde Schlesner2017-02-131-0/+2
|
* VideoCore: Move software rasterizer files to sub-directoryYuri Kunde Schlesner2017-02-131-6/+6
|
* VideoCore: Move Regs to its own fileYuri Kunde Schlesner2017-02-041-0/+2
|
* VideoCore: Split shader regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* VideoCore: Split geometry pipeline regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* VideoCore: Split lighting regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* VideoCore: Split framebuffer regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* VideoCore: Split texturing regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* VideoCore: Split rasterizer regs from Regs structYuri Kunde Schlesner2017-02-041-0/+1
|
* Pica/Texture: Move part of ETC1 decoding to new file and cleanupsYuri Kunde Schlesner2017-02-041-0/+2
|
* VideoCore: Move LookupTexture out of debug_utils.hYuri Kunde Schlesner2017-02-041-16/+18
|
* VideoCore/Shader: Split interpreter and JIT into separate ShaderEnginesYuri Kunde Schlesner2017-01-261-0/+2
|
* VideoCore/Shader: Rename shader_jit_x64{ => _compiler}.{cpp,h}Yuri Kunde Schlesner2017-01-261-2/+2
|
* VideoCore/Shader: Move DebugData to a separate fileYuri Kunde Schlesner2016-12-161-0/+1
|
* VideoCore: Convert x64 shader JIT to use Xbyak for assemblyYuri Kunde Schlesner2016-12-151-0/+3
|
* Remove TGA dumperJannik Vogel2016-04-301-1/+0
|
* Refactor: Extract VertexLoader from command_processor.cpp.Henrik Rydgard2016-04-281-0/+2
| | | | Preparation for a similar concept to Dolphin or PPSSPP. These can be JIT-ed and cached.
* Add immediate mode vertex submissionDwayne Slater2016-03-031-0/+1
|
* pica: Add pica_types module and move float24 definition.bunnei2016-02-051-0/+1
|
* VideoCore: Unify interface to OpenGL and SW rasterizersYuri Kunde Schlesner2015-12-081-1/+4
| | | | | | This removes explicit checks sprinkled all over the codebase to instead just have the SW rasterizer expose an implementation with no-ops for most operations.
* renderer_opengl: Refactor shader generation/caching to be more organized + various cleanups.bunnei2015-10-221-1/+2
|
* Replace the previous OpenGL loader with a glad-generated 3.3 oneYuri Kunde Schlesner2015-08-301-2/+1
| | | | | | The main advantage of switching to glad from glLoadGen is that, apart from being actively maintained, it supports a customizable entrypoint loader function, which makes it possible to also support OpenGL ES.
* Rename ARCHITECTURE_X64 definition to ARCHITECTURE_x86_64.bunnei2015-08-161-1/+1
|
* x64: Refactor to remove fake interfaces and general cleanups.bunnei2015-08-161-6/+4
|
* Shader: Initial implementation of x86_x64 JIT compiler for Pica vertex shaders.bunnei2015-08-161-0/+10
| | | | | - Config: Add an option for selecting to use shader JIT or interpreter. - Qt: Add a menu option for enabling/disabling the shader JIT.
* Shader: Define a common interface for running vertex shader programs.bunnei2015-08-151-0/+2
|
* Shader: Move shader code to its own subdirectory, "shader".bunnei2015-08-151-2/+2
|
* GPU: Refactor "VertexShader" namespace to "Shader".bunnei2015-08-151-2/+2
| | | | - Also renames "vertex_shader.*" to "shader_interpreter.*"
* OpenGL: Make OpenGL object resource wrappers fully inlineYuri Kunde Schlesner2015-07-261-1/+0
| | | | | The functions are so simple that having them separate only bloats the code and hinders optimization.
* Move video_core/color.h to common/color.harchshift2015-05-301-1/+0
|
* Move video_core/math.h to common/vector_math.harchshift2015-05-301-1/+0
| | | | The file only contained vector manipulation code, and such widely-useable code doesn't belong in video_core.
* Pica: Create 'State' structure and move state memory there.bunnei2015-05-231-0/+1
|
* OpenGL renderertfarley2015-05-231-1/+11
|
* GPU: Added RGB565/RGB8 framebuffer support and various cleanups.bunnei2015-03-041-0/+1
| | | | | | - Centralizes color format encode/decode functions. - Fixes endianness issues. - Implements remaining framebuffer formats in the debugger.
* CMake cleanupYuri Kunde Schlesner2014-09-011-13/+26
| | | | | | | | Several cleanups to the buildsystem: - Do better factoring of common libs between platforms. - Add support to building on Windows. - Remove Qt4 support. - Re-sort file lists and add missing headers.
* Replace GLEW with a glLoadGen loader.Yuri Kunde Schlesner2014-09-011-0/+2
| | | | | | | | | This should fix the GL loading errors that occur in some drivers due to the use of deprecated functions by GLEW. Side benefits are more accurate auto-completion (deprecated function and symbols don't exist) and faster pointer loading (less entrypoints to load). In addition it removes an external library depency, simplifying the build system a bit and eliminating one set of binary libraries for Windows.
* Rewrite of OpenGL renderer, including OS X supportKevin Hartman2014-08-261-4/+7
| | | | | | Screen contents are now displayed using textured quads. This can be updated to expose an FBO once an OpenGL backend for when Pica rendering is being worked on. That FBO's texture can then be applied to the quads. Previously, FBO blitting was used in order to display screen contents, which did not work on OS X. The new textured quad approach is less of a compatibility risk.
* Pica: Add debug utility functions for dumping geometry data.Tony Wasserka2014-08-251-0/+2
|
* Pica: Add basic rasterizer.Tony Wasserka2014-08-121-0/+2
|
* Pica: Add triangle clipper.Tony Wasserka2014-08-121-2/+4
|
* Pica: Add primitive assembly stage.Tony Wasserka2014-08-121-0/+2
|
* Pica: Add vertex shader implementation.Tony Wasserka2014-08-121-0/+2
|
* Pica: Add command processor.Tony Wasserka2014-08-121-2/+5
|
* Video core: Add utility class for vector operations.Tony Wasserka2014-08-121-1/+2
| | | | | I wrote most of this for ppsspp, so I hold full copyright over it. In addition to the original release in ppsspp, this provides functionality to easily extend e.g. two-dimensional vectors to three-dimensional vectors.
* CMakeLists: rename HEADS, improved commentsarchshift2014-05-201-2/+2
| | | | Changes for clarity of comments, removed redundant compiler flags.
* IT'S ALIVE!archshift2014-04-291-1/+6
|
* fixed a bunch of errors in CMakeListsbunnei2014-04-101-3/+3
|
* updated CMakeListsbunnei2014-04-101-16/+2
|
* added video_core project to solutionbunnei2014-04-051-0/+19