Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | VideoCore: Add option to dump the macros. | Fernando Sahmkow | 2022-05-09 | 1 | -0/+27 |
| | | | | Co-Authored-By: liamwhite <liamwhite@users.noreply.github.com> | ||||
* | video_core/macro_jit_x64: warn on invalid parameter access | Liam | 2022-05-08 | 1 | -3/+21 |
| | |||||
* | Merge pull request #8280 from Tachi107/spdx-fixup | Mai M | 2022-04-29 | 8 | -151/+17 |
|\ | | | | | chore: add missing SPDX tags | ||||
| * | chore: add missing SPDX tags | Andrea Pappacoda | 2022-04-28 | 8 | -151/+17 |
| | | | | | | | | Follow-up to 99ceb03a1cfcf35968cab589ea188a8c406cda52 | ||||
* | | Merge pull request #8282 from liamwhite/gcc-12 | Mai M | 2022-04-29 | 1 | -2/+2 |
|\ \ | |/ |/| | GCC 12 fixes | ||||
| * | GCC 12 fixes | Liam | 2022-04-28 | 1 | -2/+2 |
| | | |||||
* | | renderer_vulkan: Update screen info if the framebuffer size has changed | Morph | 2022-04-26 | 1 | -0/+5 |
|/ | |||||
* | general: Convert source file copyright comments over to SPDX | Morph | 2022-04-23 | 225 | -675/+450 |
| | | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later. | ||||
* | bootmanager: Don't create another screenshot request if previous one is not done yet | german77 | 2022-04-18 | 2 | -0/+7 |
| | |||||
* | video_core: implement formats for N64 emulation | Fernando Sahmkow | 2022-04-14 | 8 | -7/+102 |
| | |||||
* | buffer_cache: cap vertex buffer sizes | Liam | 2022-04-14 | 1 | -1/+14 |
| | |||||
* | maxwell3d: add small_index_2 register | Liam | 2022-04-14 | 2 | -1/+11 |
| | |||||
* | Merge pull request #8149 from liamwhite/front-face | bunnei | 2022-04-09 | 1 | -1/+8 |
|\ | | | | | OpenGL: flip front faces if Z scale is inverted | ||||
| * | OpenGL: propagate face flip condition | Liam | 2022-04-04 | 1 | -4/+10 |
| | | |||||
| * | OpenGL: flip front faces if Z scale is inverted | Liam | 2022-04-04 | 1 | -2/+3 |
| | | |||||
* | | video_core: Replace lock_guard with scoped_lock | Merry | 2022-04-07 | 11 | -18/+18 |
| | | |||||
* | | Merge pull request #8161 from liamwhite/gl-s8d24 | Fernando S | 2022-04-07 | 6 | -4/+58 |
|\ \ | | | | | | | OpenGL: fix S8D24 to ABGR8 conversions | ||||
| * | | OpenGL: fix S8D24 to ABGR8 conversions | Liam | 2022-04-07 | 6 | -4/+58 |
| | | | |||||
| * | | Revert "texture_cache/util: Remove unneeded ReadBlockUnsafe" | bunnei | 2022-04-05 | 1 | -0/+1 |
| | | | |||||
| * | | texture_cache/util: Remove unneeded ReadBlockUnsafe | ameerj | 2022-04-04 | 1 | -1/+0 |
| |/ | | | | | | | This call was reading GPU memory into the dst buffer, which is then overwritten by the SwizzleTexture call. | ||||
* | | Merge pull request #8152 from liamwhite/gl-crop | Fernando S | 2022-04-07 | 3 | -1/+10 |
|\ \ | | | | | | | OpenGL: fix cropping | ||||
| * | | OpenGL: fix cropping | Liam | 2022-04-04 | 3 | -1/+10 |
| |/ | |||||
* / | Vulkan: crop to screen dimensions if crop not explicitly requested | Liam | 2022-04-04 | 1 | -2/+3 |
|/ | |||||
* | fix: typos | Andrea Pappacoda | 2022-04-02 | 1 | -1/+1 |
| | |||||
* | Merge pull request #8128 from FernandoS27/gc-fixes | Fernando S | 2022-04-01 | 2 | -3/+1 |
|\ | | | | | GPU Garbage Collection: Fix regressions. | ||||
| * | GPU Garbage Collection: Fix regressions. | Fernando Sahmkow | 2022-04-01 | 2 | -3/+1 |
| | | |||||
* | | Merge pull request #8066 from ameerj/gpu-decode-fixes | Fernando S | 2022-04-01 | 1 | -14/+21 |
|\ \ | |/ |/| | codec: Fix GPU decoder detection on Windows and a memory leak | ||||
| * | codec: Plug GPU decoder memory leak | ameerj | 2022-03-22 | 1 | -0/+2 |
| | | |||||
| * | codec: Disable HW_FRAMES method check on Windows | ameerj | 2022-03-22 | 1 | -14/+19 |
| | | | | | | | | | | | | It was reported that this method causes crashes on certain Linux decoding backends, hence the check to avoid it. This subsequently caused Windows GPU decoders to never be selected and always fall back to CPU decoding, disable the check on Windows for now. | ||||
* | | Merge pull request #8076 from ameerj/nv-vk-msaa-scale | bunnei | 2022-03-31 | 3 | -7/+8 |
|\ \ | | | | | | | Vulkan: Use 3D helpers for MSAA scaling on NV drivers 510+ | ||||
| * | | Vulkan: Use 3D helpers for MSAA scaling on NV drivers 510+ | ameerj | 2022-03-24 | 3 | -7/+8 |
| | | | | | | | | | | | | Nvidia Vulkan drivers 510+ crash when blitting MSAA images. Fall-back to 3D scale helpers for MSAA image scaling. | ||||
* | | | gl_rasterizer: Avoid scenario locking already owned mutex | ameerj | 2022-03-29 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | gpu.TickWork() may lock the texture_cache and buffer_cache mutexes, which are owned by the thread prior to invoking TickWork(). Defer invoking gpu.TickWork() until the scope ends, where the owned mutexes are released. | ||||
* | | | Revert "Memory GPU <-> CPU: reduce infighting in the texture cache by adding CPU Cached memory." | bunnei | 2022-03-26 | 5 | -64/+3 |
| | | | |||||
* | | | Texture Cache: Add Cached CPU system. | Fernando Sahmkow | 2022-03-25 | 5 | -3/+64 |
| | | | |||||
* | | | Merge pull request #7720 from FernandoS27/yfc-gc | bunnei | 2022-03-25 | 20 | -43/+259 |
|\ \ \ | | | | | | | | | First Nugget: Reworked Garbage Collection to be smarter [originally from Project YFC] | ||||
| * | | | GC: Address Feedback. | Fernando Sahmkow | 2022-03-25 | 7 | -29/+37 |
| | | | | |||||
| * | | | Garbage Collection: Final tuning. | Fernando Sahmkow | 2022-03-25 | 6 | -24/+36 |
| | | | | |||||
| * | | | Buffer Cache: Tune to the levels of the new GC. | Fernando Sahmkow | 2022-03-25 | 6 | -6/+78 |
| | | | | |||||
| * | | | Garbage Collection: Redesign the algorithm to do a better use of memory. | Fernando Sahmkow | 2022-03-25 | 13 | -32/+156 |
| | | | | |||||
* | | | | hle: nvflinger: Migrate android namespace -> Service::android. | bunnei | 2022-03-25 | 6 | -18/+18 |
| | | | | |||||
* | | | | hle: vi: Integrate new NVFlinger and HosBinderDriverServer service. | bunnei | 2022-03-25 | 1 | -0/+1 |
| | | | | |||||
* | | | | hle: nvflinger: Move BufferTransformFlags to its own header. | bunnei | 2022-03-25 | 1 | -17/+2 |
| | | | | |||||
* | | | | hle: nvflinger: Move PixelFormat to its own header. | bunnei | 2022-03-25 | 6 | -23/+19 |
|/ / / | |||||
* / / | buffer_cache: reset cached write bits after flushing invalidations | Liam | 2022-03-24 | 1 | -1/+2 |
|/ / | |||||
* | | Merge pull request #8048 from ameerj/include-purge | bunnei | 2022-03-22 | 76 | -138/+14 |
|\ \ | |/ |/| | general: Reduce unused includes across the project | ||||
| * | general: Fix clang/gcc build errors | ameerj | 2022-03-20 | 2 | -0/+2 |
| | | |||||
| * | common: Reduce unused includes | ameerj | 2022-03-19 | 1 | -0/+1 |
| | | |||||
| * | video_core: Reduce unused includes | ameerj | 2022-03-19 | 75 | -139/+12 |
| | | |||||
* | | Merge pull request #7812 from FernandoS27/made-straight-from-the-nut | bunnei | 2022-03-20 | 1 | -6/+14 |
|\ \ | | | | | | | BufferCache: Find direction of the stream buffer increase. | ||||
| * | | BufferCache: Find direction of the stream buffer increase. | Fernando Sahmkow | 2022-03-20 | 1 | -6/+14 |
| | | | |||||
* | | | Merge pull request #8036 from ameerj/starbit-nv | Fernando S | 2022-03-20 | 1 | -5/+0 |
|\ \ \ | | | | | | | | | vk_texture_cache: Do not reinterpret DepthStencil source images | ||||
| * | | | vk_texture_cache: Do not reinterpret DepthStencil source images | ameerj | 2022-03-18 | 1 | -5/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | Fixes star pointer interactions in Super Mario Galaxy on some drivers, notably Nvidia. Co-Authored-By: Fernando S. <1731197+fernandos27@users.noreply.github.com> | ||||
* | | | | Merge pull request #7840 from lioncash/bitor | bunnei | 2022-03-20 | 1 | -15/+3 |
|\ \ \ \ | |_|_|/ |/| | | | texture_cache: Amend unintended bitwise OR in SynchronizeAliases | ||||
| * | | | texture_cache: Ensure has_blacklisted is always initialized | Lioncash | 2022-02-02 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | Resolves a -Wmaybe_uninitialized warning | ||||
| * | | | texture_cache: Remove dead code within SynchronizeAliases | Lioncash | 2022-02-02 | 1 | -13/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Since these were being copied by value, none of the changes applied in the loop would be reflected. However, from the looks of it, this would already be applied within CopyImage() anyways, so this can be removed. | ||||
| * | | | texture_cache: Amend unintended bitwise OR in SynchronizeAliases | Lioncash | 2022-02-02 | 1 | -1/+1 |
| | | | | |||||
* | | | | general: Reduce core.h includes | ameerj | 2022-03-18 | 4 | -5/+0 |
| |/ / |/| | | |||||
* | | | Merge pull request #8030 from liamwhite/s8d24-conversion | Fernando S | 2022-03-18 | 5 | -2/+41 |
|\ \ \ | | | | | | | | | Vulkan: convert S8D24 <-> ABGR8 | ||||
| * | | | Address review comments | Liam | 2022-03-16 | 2 | -2/+2 |
| | | | | |||||
| * | | | Vulkan: convert S8D24 <-> ABGR8 | Liam | 2022-03-16 | 5 | -2/+41 |
| | | | | |||||
* | | | | Merge pull request #8023 from ameerj/kirby-pop-in | Fernando S | 2022-03-16 | 2 | -70/+12 |
|\ \ \ \ | |/ / / |/| | | | maxwell_3d: Implement a safer CB data upload | ||||
| * | | | maxwell_3d: Implement a safer CB data upload | ameerj | 2022-03-15 | 2 | -70/+12 |
| | | | | | | | | | | | | | | | | This makes constant buffer uploads safer and more accurate by updating the GPU memory as soon as the CB Data method is invoked. The previous implementation was deferring the updates until a different maxwell 3d method was detected, then writing all CB data at once. | ||||
* | | | | Merge pull request #8000 from liamwhite/hagi | Fernando S | 2022-03-15 | 3 | -3/+77 |
|\ \ \ \ | |/ / / |/| | | | Initial support for Wii Hagi emulator | ||||
| * | | | Maxwell3D: Link to override constant definition in nouveau | byte[] | 2022-03-14 | 1 | -0/+2 |
| | | | | |||||
| * | | | Maxwell3D: restore original topology when topology overrides are disabled | byte[] | 2022-03-14 | 1 | -0/+2 |
| | | | | |||||
| * | | | Maxwell3D: Use override constants from nouveau | Liam | 2022-03-14 | 2 | -2/+37 |
| | | | | | | | | | | | | | | | | This fixes some incorrect rendering in Sunshine | ||||
| * | | | Maxwell3D: Restrict topology override effect to after the register is set | Liam | 2022-03-12 | 2 | -1/+5 |
| | | | | |||||
| * | | | Maxwell3D: mark index buffers as dirty after updating counts | Liam | 2022-03-11 | 1 | -0/+2 |
| | | | | |||||
| * | | | TextureCacheRuntime: allow converting D24S8 to ABGR8 | Liam | 2022-03-11 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | I can't see how this would be useful, but Galaxy uses it. | ||||
| * | | | Maxwell3D: read small-index draw and primitive topology override registers | Liam | 2022-03-11 | 2 | -2/+30 |
| | | | | | | | | | | | | | | | | This allows Galaxy and Sunshine to render for the first time. | ||||
* | | | | emit_spirv, vk_compute_pass: Resolve VS2022 compiler errors | ameerj | 2022-03-12 | 1 | -1/+2 |
|/ / / | |||||
* | | | Merge pull request #7986 from lat9nq/vk-callback | bunnei | 2022-03-08 | 1 | -0/+1 |
|\ \ \ | | | | | | | | | core, video_core: Fix two crashes when failing to create the emulated GPU instance | ||||
| * | | | video_core: Cancel Scoped's exit call on GPU failure | lat9nq | 2022-03-08 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When CreateRenderer fails, the GraphicsContext that was std::move'd into it is destroyed before the Scoped that was created to manage its currency. In that case, the GraphicsContext::Scoped will still call its destructor at the ending of the function. And because the context is destroyed, the Scoped will cause a crash as it attempts to call a destroyed object's DoneCurrent function. Since we know when the call would be invalid, call the Scoped's Cancel method. This prevents it from calling a method on a destroyed object. | ||||
* | | | | Merge pull request #7930 from asLody/dma-semaphore | Fernando S | 2022-03-07 | 2 | -1/+21 |
|\ \ \ \ | |/ / / |/| | | | MaxwellDMA: Implement semaphore operations | ||||
| * | | | MaxwellDMA: Implement semaphore operations | Lody | 2022-03-07 | 2 | -1/+21 |
| | | | | |||||
* | | | | gl_graphics_pipeline: Improve shader builder synchronization using fences (#7969) | Ameer J | 2022-03-06 | 2 | -21/+32 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * gl_graphics_pipeline: Improve shader builder synchronization Make use of GLsync objects to ensure better synchronization between shader builder threads and the main context * gl_graphics_pipeline: Make built_fence access threadsafe * gl_graphics_pipeline: Use GLsync objects only when building in parallel * gl_graphics_pipeline: Replace GetSync calls with non-blocking waits The spec states that a ClientWait on a Fence object ensures the changes propagate to the calling context | ||||
* | | | | gl_fence_manager: Minor optimization to signal querying | ameerj | 2022-02-27 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | Per the spec, bufSize is the number of integers that will be written, in this case, 1. Also, the length argument is optional if the information of the number of elements written is not needed. | ||||
* | | | | Merge pull request #7953 from ameerj/radv-rdna2-crash | bunnei | 2022-02-26 | 1 | -4/+21 |
|\ \ \ \ | | | | | | | | | | | vulkan_device: Blacklist RADV on RDNA2 from VK_EXT_vertex_input_dynamic_state | ||||
| * | | | | vulkan_device: Blacklist RADV on RDNA2 from VK_EXT_vertex_input_dynamic_state | Ameer J | 2022-02-26 | 1 | -4/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | RDNA2 devices running under the RADV driver were crashing when VK_EXT_vertex_input_dynamic_state was enabled. Blacklisting these devices until a proper fix is established. | ||||
* | | | | | Merge pull request #7948 from Morph1984/11-11-10-float | Mai M | 2022-02-26 | 2 | -0/+4 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_to_(gl/vk): Add 11_11_10 float vertex format | ||||
| * | | | | | maxwell_to_(gl/vk): Add 11_11_10 float vertex format | Morph | 2022-02-25 | 2 | -0/+4 |
| |/ / / / | | | | | | | | | | | | | | | | - Used by パワプロクンポケットR | ||||
* / / / / | vk_blit_screen: Add missing format bgra8 | Lody | 2022-02-24 | 1 | -0/+2 |
|/ / / / | |||||
* / / / | vulkan_device: fix missing format in ANV | voidanix | 2022-02-21 | 3 | -2/+21 |
|/ / / | | | | | | | | | | | | | Currently Mesa's ANV driver does not support VK_FORMAT_B5G6R5_UNORM_PACK16, implement an alternative for it. | ||||
* | | | Merge pull request #7814 from FernandoS27/another-bug-in-my-schedule | bunnei | 2022-02-03 | 2 | -4/+6 |
|\ \ \ | | | | | | | | | Vulkan: Fix Scheduler Chunks when their FuncType is 0. | ||||
| * | | | Vulkan: Fix Scheduler Chunks when their FuncType is 0. | Fernando Sahmkow | 2022-01-31 | 2 | -4/+6 |
| | |/ | |/| | |||||
* | | | general: Replace NonCopyable struct with equivalents | Lioncash | 2022-02-02 | 2 | -15/+43 |
| |/ |/| | |||||
* | | Merge pull request #7824 from lioncash/scache | Morph | 2022-02-01 | 2 | -4/+3 |
|\ \ | | | | | | | video_core/shader_cache: Take std::span in RemoveShadersFromStorage() | ||||
| * | | video_core/shader_cache: Remove unused algorithm include | Lioncash | 2022-02-01 | 1 | -1/+0 |
| | | | |||||
| * | | video_core/shader_cache: Take std::span in RemoveShadersFromStorage() | Lioncash | 2022-02-01 | 2 | -3/+3 |
| |/ | | | | | | | | | Same behavior, but without the need to move into the function to avoid an allocation. | ||||
* | | Rasterizer: Refactor inlineToMemory. | Fernando Sahmkow | 2022-02-01 | 9 | -15/+16 |
| | | |||||
* | | GPU: Improve syncing. | Fernando Sahmkow | 2022-01-29 | 1 | -3/+10 |
| | | |||||
* | | Rasterizer: Implement Inline2Memory Acceleration. | Fernando Sahmkow | 2022-01-29 | 14 | -6/+122 |
| | | |||||
* | | Inline2Memory: Flush before writting buffer. | Fernando Sahmkow | 2022-01-29 | 2 | -2/+3 |
|/ | |||||
* | buffer_cache: Reduce stream buffer allocations when expanding from the left | ameerj | 2022-01-27 | 1 | -0/+2 |
| | | | | | The existing stream buffer optimization accounts for size increases at the end of the allocated buffer. This adds the same optimization, increasing the size from the beginning of the buffer as well to reduce buffer allocations when expanding the same buffer from the left. | ||||
* | common/xbyak_api: Make BuildRegSet() constexpr | Lioncash | 2022-01-26 | 1 | -1/+1 |
| | | | | | This allows us to eliminate any static constructors that would have been emitted due to the function not being constexpr. | ||||
* | Merge pull request #7780 from lioncash/macro | bunnei | 2022-01-26 | 9 | -213/+204 |
|\ | | | | | video_core/macro: Move impl classes into their cpp files | ||||
| * | video_core/macro: Add missing <cstring> header | Lioncash | 2022-01-25 | 1 | -2/+3 |
| | | | | | | | | Necessary since memcpy is used. | ||||
| * | video_core/macro_interpreter: Move impl class to the cpp file | Lioncash | 2022-01-25 | 2 | -84/+86 |
| | | | | | | | | | | Keeps the implementation hidden from the intended API and lessens the header dependencies on the interpreter's header. | ||||
| * | video_core/macro_hle: Return unique_ptr directly from GetHLEProgram() | Lioncash | 2022-01-25 | 3 | -7/+7 |
| | | | | | | | | Same behavior, but less code and header dependencies. | ||||
| * | video_core/macro: Remove unused parameter from Execute() | Lioncash | 2022-01-25 | 3 | -4/+3 |
| | | | | | | | | Simplifies the function interface. | ||||
| * | video_core/macro_jit_x64: Remove unused impl class member | Lioncash | 2022-01-25 | 1 | -1/+0 |
| | | | | | | | | Reduces the size of the impl class a tiny bit. | ||||
| * | video_core/macro_jit_x64: Decouple PersistentCallerSavedRegs() from impl | Lioncash | 2022-01-25 | 1 | -5/+4 |
| | | | | | | | | This doesn't depend on class state and can just be a regular function. | ||||
| * | video_core/macro_jit_x64: Move impl class into cpp file | Lioncash | 2022-01-25 | 2 | -87/+86 |
| | | | | | | | | | | | | | | Keeps the implementation internalized and also reduces API-facing header dependencies. Notably, this fully internalizes all of the xbyak externals. | ||||
| * | video_core/macro_hle: Move impl class into cpp file | Lioncash | 2022-01-25 | 2 | -27/+19 |
| | | | | | | | | | | | | | | | | Given it's intended to be an internal implementation class, we can move it into the cpp file to ensure that. This also lets us move some header dependencies into the cpp file as well. | ||||
* | | Merge pull request #7779 from lioncash/gpu-iface | Morph | 2022-01-25 | 1 | -16/+0 |
|\ \ | | | | | | | gpu: Remove obsoleted CDmaPusher() accessors | ||||
| * | | gpu: Tidy up forward declarations | Lioncash | 2022-01-25 | 1 | -10/+0 |
| | | | | | | | | | | | | | | | Over time a few forward declarations became unnecessary, so we can remove these to tidy up the header a little bit. | ||||
| * | | gpu: Remove obsoleted CDMAPusher() accessors | Lioncash | 2022-01-25 | 1 | -6/+0 |
| |/ | | | | | | | | | These were obsoleted in 2c47f8aa1886522898b5b3a73185b5662be3e9f3 but were accidentally overlooked. | ||||
* / | vk_fsr: Replace comma operator with semicolon | Lioncash | 2022-01-25 | 1 | -1/+1 |
|/ | | | | | | Generally, we should be ending statements with a semicolon not a comma Resolves a clang diagnostic. | ||||
* | video_core: constify AVCodec for ffmpeg >= 5.0 | Jan Beich | 2022-01-20 | 1 | -1/+1 |
| | | | | | | src/video_core/command_classes/codecs/codec.cpp:177:16: error: assigning to 'AVCodec *' from 'const AVCodec *' discards qualifiers av_codec = avcodec_find_decoder(codec); ^~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||||
* | vulkan_device: Fix sType for VkPhysicalDeviceShaderAtomicInt64Features | Georg Lehmann | 2022-01-19 | 1 | -1/+1 |
| | |||||
* | astc_decoder: Combine FastReplicate functions to work around new NV driver bug | ameerj | 2022-01-16 | 1 | -34/+46 |
| | | | | | | The new Nvidia drivers have a bug where the FastReplicateTo6 function produces a lookup into the REPLICATE_TO_8 table rather than the REPLICATE_TO_6 table. This seems to be an optimization gone wrong. Combining the logic of the FastReplicate functions seems to address the bug. | ||||
* | Merge pull request #7658 from ameerj/sparse-fixes | Fernando S | 2022-01-06 | 3 | -61/+44 |
|\ | | | | | video_core/memory_manager: Fixes for sparse memory management | ||||
| * | video_core/memory_manager: Fixes for sparse memory management | ameerj | 2021-12-31 | 2 | -14/+12 |
| | | |||||
| * | video_core/memory_manager: Deduplicate Read/WriteBlock | ameerj | 2021-12-31 | 2 | -47/+32 |
| | | |||||
* | | Merge pull request #7673 from german77/no_return | Mai M | 2022-01-05 | 1 | -1/+1 |
|\ \ | | | | | | | glsl: Remove unreachable return | ||||
| * | | video_core: Remove unnecesary maybe_unused flag | Narr the Reg | 2022-01-05 | 1 | -1/+1 |
| | | | |||||
* | | | Merge pull request #7636 from vonchenplus/buffer_queue_query | bunnei | 2022-01-04 | 1 | -1/+0 |
|\ \ \ | | | | | | | | | core:hle:service:nvflinger Implement few type in bufferqueue query method | ||||
| * | | | Remove invalid header include | Feng Chen | 2021-12-28 | 1 | -1/+0 |
| | |/ | |/| | |||||
* | | | Merge pull request #7670 from ameerj/vsync-block | Fernando S | 2022-01-04 | 2 | -8/+27 |
|\ \ \ | |_|/ |/| | | gpu: Add shut down method to synchronize threads before destruction | ||||
| * | | gpu: Add shut down method to synchronize threads before destruction | ameerj | 2022-01-04 | 2 | -0/+13 |
| | | | |||||
| * | | Revert "Merge pull request #7668 from ameerj/fence-stop-token" | ameerj | 2022-01-04 | 2 | -8/+14 |
| | | | | | | | | | | | | | | | This reverts commit e7733544779f2706d108682dd027d44e7fa5ff4b, reversing changes made to abbbdc2bc027ed7af236625ae8427a46df63f7e7. | ||||
* | | | Merge pull request #7251 from FernandoS27/shader-dump | bunnei | 2022-01-04 | 4 | -1/+79 |
|\ \ \ | |/ / |/| | | ShaderDecompiler: Add a debug option to dump the game's shaders. | ||||
| * | | ShaderDecompiler: Add a debug option to dump the game's shaders. | Fernando Sahmkow | 2022-01-04 | 4 | -1/+79 |
| | | | |||||
* | | | gpu: Use std::stop_token in WaitFence for VSync thread | ameerj | 2022-01-03 | 2 | -14/+8 |
|/ / | | | | | | | Fixes a hang that may occur when stopping emulation and the VSync thread is blocked on the syncpoint condition variable. | ||||
* | | Merge pull request #7624 from ameerj/intel-msaa-scale | Fernando S | 2022-01-03 | 4 | -20/+35 |
|\ \ | | | | | | | vk_texture_cache: Use 3D scale helpers for MSAA texture scaling on Intel Windows drivers | ||||
| * | | vk_texture_cache: Use 3D scale helpers for MSAA texture scaling on Intel Windows drivers | ameerj | 2021-12-24 | 4 | -20/+35 |
| | | | | | | | | | | | | Fixes a crash when scaling MSAA textures in titles such as Sonic Colors Ultimate. | ||||
* | | | Merge pull request #7629 from ameerj/nv-driver-fixes | Fernando S | 2022-01-03 | 3 | -7/+15 |
|\ \ \ | | | | | | | | | shaders: Add fixes for NVIDIA drivers 495+ | ||||
| * | | | glsl: Add boolean reference workaround | ameerj | 2021-12-30 | 3 | -0/+7 |
| | | | | |||||
| * | | | glsl_context_get_set: Add alternative cbuf type for broken drivers | ameerj | 2021-12-30 | 3 | -7/+8 |
| | | | | | | | | | | | | | | | | some drivers have a bug bitwise converting floating point cbuf values to uint variables. This adds a workaround for these drivers to make all cbufs uint and convert to floating point as needed. | ||||
* | | | | texture_cache/util: Fix s32 overflow when resolving overlaps | ameerj | 2022-01-01 | 1 | -5/+5 |
| |_|/ |/| | | |||||
* | | | Merge pull request #7622 from ameerj/vk-rescale-invalid-ptr | bunnei | 2021-12-28 | 5 | -8/+21 |
|\ \ \ | |/ / |/| | | vk_texture_cache: Fix invalidated pointer access | ||||
| * | | vk_texture_cache: Fix invalidated pointer access | ameerj | 2021-12-24 | 5 | -8/+21 |
| |/ | | | | | | | | | The vulkan ImageView held a reference to its source image for rescale status checking. This pointer is sometimes invalidated when the texture cache slot_images container is resized. To avoid an invalid pointer dereference, the ImageView now holds a reference to the container itself. | ||||
* / | blit_image: Remove unused function | ameerj | 2021-12-24 | 2 | -50/+0 |
|/ | |||||
* | Merge pull request #7375 from vonchenplus/convert_legacy | Fernando S | 2021-12-22 | 2 | -0/+5 |
|\ | | | | | Convert all legacy attributes to generic attributes | ||||
| * | Address format clang | vonchenplus | 2021-12-18 | 2 | -2/+2 |
| | | |||||
| * | Merge branch 'yuzu-emu:master' into convert_legacy | Feng Chen | 2021-12-18 | 28 | -130/+698 |
| |\ | |||||
| * | | Implement convert legacy to generic | Feng Chen | 2021-11-19 | 2 | -0/+5 |
| | | | |||||
* | | | Merge pull request #7599 from FernandoS27/primrestart-vulkan | bunnei | 2021-12-22 | 3 | -5/+50 |
|\ \ \ | | | | | | | | | Vulkan: Fix Primitive Restart and implement Logical Operations | ||||
| * | | | Vulkan: Fix the checks for primitive restart extension. | Fernando Sahmkow | 2021-12-18 | 3 | -21/+28 |
| | | | | |||||
| * | | | Vulkan: implement Logical Operations. | Fernando Sahmkow | 2021-12-18 | 2 | -3/+3 |
| | | | | |||||
| * | | | Vulkan: Implement VK_EXT_primitive_topology_list_restart | Fernando Sahmkow | 2021-12-18 | 3 | -2/+40 |
| | |/ | |/| | |||||
* | | | video_core/codecs: re-enable VAAPI/VDPAU on BSDs after 72aa418b0b41 | Jan Beich | 2021-12-18 | 1 | -1/+1 |
| | | | |||||
* | | | Merge pull request #7587 from liushuyu/fix-linux-decoding | bunnei | 2021-12-18 | 1 | -0/+6 |
|\ \ \ | | | | | | | | | [Patch v2] externals/ffmpeg: refactor ffmpeg searching and handling in cmake | ||||
| * | | | video_core/codecs: (re-spin) refactor ffmpeg searching and handling | liushuyu | 2021-12-16 | 1 | -0/+6 |
| | | | | |||||
* | | | | Merge pull request #7399 from ameerj/art-refactor | Fernando S | 2021-12-18 | 8 | -152/+147 |
|\ \ \ \ | |_|/ / |/| | | | video_core: Refactoring post A.R.T. merge | ||||
| * | | | vk_texture_cache: Add ABGR src format check for D24S8 conversions | ameerj | 2021-12-05 | 1 | -1/+5 |
| | | | | |||||
| * | | | renderer_opengl: Minor refactoring of filter selection | ameerj | 2021-12-05 | 1 | -30/+20 |
| | | | | |||||
| * | | | texture_cache: Fix image convert dimensions assertion | ameerj | 2021-12-05 | 1 | -1/+12 |
| | | | | |||||
| * | | | blit_image: Refactor upscale factors usage | ameerj | 2021-12-05 | 6 | -62/+53 |
| | | | | | | | | | | | | | | | | The image view itself can be queried to see if it is being rescaled or not, removing the need to pass the upscale/down shift factors from the texture cache. | ||||
| * | | | vk_texture_cache: Add a function to ImageView to check if src image is rescaled | ameerj | 2021-12-05 | 2 | -4/+22 |
| | | | | |||||
| * | | | blit_image: Refactor ConvertPipeline functions | ameerj | 2021-12-05 | 2 | -29/+15 |
| | | | | |||||
| * | | | blit_image: Refactor ConvertPipelineEx functions | ameerj | 2021-12-05 | 2 | -33/+18 |
| | | | | | | | | | | | | | | | | reduces much of the duplication between the color/depth variants | ||||
| * | | | vk_blit_screen: Minor refactor of filter pipeline selection | ameerj | 2021-12-05 | 1 | -21/+16 |
| | | | | |||||
| * | | | Revert "Merge pull request #7395 from Morph1984/resolve-comments" | ameerj | 2021-12-05 | 3 | -16/+31 |
| | | | | | | | | | | | | | | | | | | | | This reverts commit d20f91da11fe7c5d5f1bd4f63cc3b4d221be67a4, reversing changes made to 5082712b4e44ebfe48bd587ea2fa38767b7339cb. | ||||
* | | | | Merge pull request #7551 from vonchenplus/fix_blit_image_view_mismatching | bunnei | 2021-12-16 | 1 | -1/+6 |
|\ \ \ \ | |_|/ / |/| | | | Fix blit image/view not compatible | ||||
| * | | | Fix blit image/view not compatible | Feng Chen | 2021-12-10 | 1 | -1/+6 |
| | | | | |||||
* | | | | Revert "video_core/codecs: refactor ffmpeg searching and handling in cmake" | bunnei | 2021-12-15 | 1 | -6/+0 |
| | | | | |||||
* | | | | Merge pull request #7565 from liushuyu/fix-linux-decoding | bunnei | 2021-12-15 | 1 | -0/+6 |
|\ \ \ \ | | | | | | | | | | | video_core/codecs: refactor ffmpeg searching and handling in cmake | ||||
| * | | | | CI: fix CI on Linux | liushuyu | 2021-12-14 | 1 | -3/+0 |
| | | | | | |||||
| * | | | | video_core/codecs: skip decoders that use hw frames ... | liushuyu | 2021-12-14 | 1 | -0/+9 |
| |/ / / | | | | | | | | | | | | | | | | | ... this would resolve some edge-cases where multiple devices are present and ffmpeg is unable to auto-supply the hw surfaces | ||||
* | | | | maxwell_to_vk: Add ASTC_2D_5X4_UNORM | Morph | 2021-12-11 | 1 | -1/+1 |
| | | | | |||||
* | | | | maxwell_to_vk: Add ASTC_2D_8X5_UNORM | Morph | 2021-12-09 | 1 | -1/+1 |
|/ / / | | | | | | | | | | - Used by Lego City Undercover | ||||
* | | | Merge pull request #7495 from FernandoS27/text-blit-fix-again | Morph | 2021-12-09 | 1 | -3/+6 |
|\ \ \ | | | | | | | | | Texture Cache: Fix mismatching image/views on blits | ||||
| * | | | Texture Cache: Fix crashes on NVIDIA. | Fernando Sahmkow | 2021-12-04 | 1 | -3/+6 |
| |/ / | |||||
* | | | renderer_vulkan: Add R16G16_UINT | Morph | 2021-12-08 | 2 | -1/+2 |
| | | | | | | | | | | | | - Used by Immortals Fenyx Rising | ||||
* | | | Merge pull request #7488 from vonchenplus/support_multiple_videos_playing | bunnei | 2021-12-08 | 2 | -32/+16 |
|\ \ \ | |/ / |/| | | Support multiple videos playing | ||||
| * | | Address feedback | Feng Chen | 2021-12-04 | 1 | -4/+5 |
| | | | |||||
| * | | Support multiple videos playing | Feng Chen | 2021-12-02 | 2 | -32/+15 |
| | | | |||||
* | | | Merge pull request #7467 from liushuyu/fix-linux-decoding | bunnei | 2021-12-04 | 2 | -66/+50 |
|\ \ \ | | | | | | | | | video_core/codecs: more robust ffmpeg hwdecoder selection logic | ||||
| * | | | video_core/cmake: link against libva explicitly ... | liushuyu | 2021-12-03 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | ... to fix build on Flatpak (and self-builds) | ||||
| * | | | video_core/codecs: more fixes for VAAPI detection ... | liushuyu | 2021-12-03 | 1 | -63/+25 |
| | | | | | | | | | | | | | | | | | | | | * skip impersonated VAAPI implementaions ("imposter detection") * place VAAPI priority below CUDA/NVDEC/CUVID | ||||
| * | | | video_core/codec: address comments | liushuyu | 2021-12-03 | 1 | -8/+12 |
| | | | | |||||
| * | | | video_core/codecs: more robust ffmpeg hwdecoder selection logic | liushuyu | 2021-12-03 | 1 | -10/+27 |
| |/ / | |||||
* / / | general: Replace high_resolution_clock with steady_clock | Morph | 2021-12-02 | 2 | -2/+2 |
|/ / | | | | | | | On some OSes, high_resolution_clock is an alias to system_clock and is not monotonic in nature. Replace this with steady_clock. | ||||
* | | Add missing pixel format mapping | Feng Chen | 2021-11-29 | 1 | -0/+2 |
| | | |||||
* | | Merge pull request #7396 from FernandoS27/blit-this-mf | Fernando S | 2021-11-28 | 14 | -223/+168 |
|\ \ | | | | | | | TextureCache: Eliminate format deduction as full depth conversion has been supported. | ||||
| * | | Texture Cache: Secure insertions against deletions. | Fernando Sahmkow | 2021-11-28 | 1 | -3/+13 |
| | | | |||||
| * | | Texture Cache: Redesigning the blitting system (again). | Fernando Sahmkow | 2021-11-27 | 3 | -23/+64 |
| | | | |||||
| * | | Texture Cache: Further fix regressions. | Fernando Sahmkow | 2021-11-26 | 1 | -11/+15 |
| | | | |||||
| * | | Texture Cache: Fix issue with blitting 3D textures. | Fernando Sahmkow | 2021-11-22 | 1 | -2/+4 |
| | | | |||||
| * | | Texture Cache: Correct conversion shaders. | Fernando Sahmkow | 2021-11-22 | 2 | -2/+2 |
| | | | |||||
| * | | Texture Cache: Always copy on NVIDIA. | Fernando Sahmkow | 2021-11-22 | 1 | -0/+5 |
| | | | |||||
| * | | TextureCache: Simplify blitting of D24S8 formats and fix bugs. | Fernando Sahmkow | 2021-11-22 | 10 | -195/+73 |
| | | | |||||
| * | | VulkanTexturECache: Use reinterpret on D32_S8 formats. | Fernando Sahmkow | 2021-11-21 | 1 | -2/+7 |
| | | | |||||
| * | | HostShaders: Fix D24S8 convertion shaders. | Fernando Sahmkow | 2021-11-21 | 6 | -23/+47 |
| | | | |||||
| * | | TextureCache: Eliminate format deduction as full depth conversion has been supported. | Fernando Sahmkow | 2021-11-21 | 2 | -29/+5 |
| | | | |||||
* | | | video_core/codec: address comments | liushuyu | 2021-11-25 | 1 | -17/+11 |
| | | | |||||
* | | | video_core/codecs: fix multiple decoding issues on Linux ... | liushuyu | 2021-11-25 | 1 | -2/+47 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * when someone installed Intel video drivers on an AMD system, the decoder will select the Intel VA-API decoding driver and yuzu will crash due to incorrect driver selection; the fix will check if the currently about-to-use driver is loaded in the kernel * when using NVIDIA driver on Linux with a ffmpeg that does not have CUDA capability enabled, the decoder will crash; the fix simply making the decoder prefers the VDPAU driver over CUDA on Linux | ||||
* | | | Merge pull request #7395 from Morph1984/resolve-comments | bunnei | 2021-11-21 | 3 | -31/+16 |
|\ \ \ | | | | | | | | | general: Resolve comments in PR #7368 | ||||
| * | | | vk_texture_cache: Mark VkBufferUsageFlags as static constexpr | Morph | 2021-11-21 | 1 | -3/+3 |
| | | | | |||||
| * | | | vk_blit_image: Consolidate CreatePipelineTargetEx functions | Morph | 2021-11-21 | 2 | -28/+13 |
| |/ / | |||||
* | | | Merge pull request #7389 from ameerj/screenshot-1x | bunnei | 2021-11-21 | 2 | -8/+0 |
|\ \ \ | |/ / |/| | | Fix screenshot dimensions when at 1x scale | ||||
| * | | Fix screenshot dimensions when at 1x scale | ameerj | 2021-11-20 | 2 | -8/+0 |
| | | | | | | | | | | | | | | | | | | | | | This was regressed by ART. Prior to ART, the screenshots were saved at the title's framebuffer resolution. A misunderstanding of the existing logic led to screenshot dimensions becoming dependent on the host render window size. This changes the behavior to match how it was prior to ART at 1x, with screenshots now always being the title's framebuffer dimensions scaled by the resolution scaling factor. | ||||
* | | | Merge pull request #7368 from FernandoS27/vulkan-conv | bunnei | 2021-11-21 | 17 | -23/+588 |
|\ \ \ | |/ / |/| | | Fix ART Blit detection regression and add D24S8 <-> RGBA8 conv to Vulkan | ||||
| * | | TextureCache: Refactor and fix linux compiling. | Fernando Sahmkow | 2021-11-20 | 2 | -9/+4 |
| | | | |||||
| * | | TextureCache: Assure full conversions on depth/stencil write shaders. | Fernando Sahmkow | 2021-11-20 | 3 | -6/+6 |
| | | | |||||
| * | | TextureCache: Implement buffer copies on Vulkan. | Fernando Sahmkow | 2021-11-20 | 6 | -9/+193 |
| | | | |||||
| * | | TextureCache: Add R16G16 to D24S8 converter. | Fernando Sahmkow | 2021-11-20 | 5 | -0/+38 |
| | | | |||||
| * | | TextureCache: Add B10G11R11 to D24S8 converter. | Fernando Sahmkow | 2021-11-19 | 5 | -13/+84 |
| | | | |||||
| * | | TextureCache: Further fixes on resolve algorithm. | Fernando Sahmkow | 2021-11-19 | 2 | -16/+17 |
| | | | |||||
| * | | TextureCache: Implement additional D24S8 convertions. | Fernando Sahmkow | 2021-11-19 | 6 | -0/+86 |
| | | | |||||
| * | | TextureCache: force same image format when resolving an image. | Fernando Sahmkow | 2021-11-19 | 2 | -2/+9 |
| | | | |||||
| * | | TextureCache: Fix regression caused by ART and improve blit detection algorithm to be smarter. | Fernando Sahmkow | 2021-11-19 | 2 | -10/+27 |
| | | | |||||
| * | | Vulkan: implement D24S8 <-> RGBA8 convertions. | Fernando Sahmkow | 2021-11-19 | 6 | -0/+166 |
| |/ | |||||
* | | Merge pull request #7294 from vonchenplus/fix_image_update_error_when_width_too_small | bunnei | 2021-11-20 | 2 | -10/+18 |
|\ \ | | | | | | | Fix image update/download error when width too small | ||||
| * | | Fix image update/download error when width too small | Feng Chen | 2021-11-17 | 2 | -10/+18 |
| | | | |||||
* | | | Merge pull request #7357 from Morph1984/s8_uint | bunnei | 2021-11-19 | 10 | -9/+64 |
|\ \ \ | |_|/ |/| | | video_core: Implement S8_UINT format | ||||
| * | | renderer_vulkan: Implement S8_UINT stencil format | Morph | 2021-11-18 | 3 | -0/+18 |
| | | | | | | | | | | | | It should be noted that on Windows, only nvidia gpus support this format natively as of this commit. | ||||
| * | | renderer_opengl: Implement S8_UINT stencil format | Morph | 2021-11-17 | 3 | -6/+25 |
| | | | |||||
| * | | video_core: Add S8_UINT stencil format | Morph | 2021-11-17 | 4 | -3/+21 |
| |/ | |||||
* | | gl_texture_cache: Round format conversion PBO to next power of 2 | ameerj | 2021-11-18 | 1 | -1/+5 |
| | | |||||
* | | texture_cache: Use pixel format conversion when supported by the runtime | ameerj | 2021-11-17 | 5 | -0/+15 |
| | | |||||
* | | gl_texture_cache: Make FormatConversionPass more generic | ameerj | 2021-11-17 | 1 | -7/+12 |
| | | | | | | | | This allows the usage of the FormatConversionPass to be applied to more than the previously used BGR conversion scenarios. | ||||
* | | gl_texture_cache: Rename BGRCopyPass to FormatConversionPass | ameerj | 2021-11-17 | 2 | -21/+18 |
|/ | |||||
* | TextureCache: Fix Automatic Anisotropic. | Fernando Sahmkow | 2021-11-17 | 1 | -6/+5 |
| | |||||
* | TextureCache: OGL query device memory if possible. | FernandoS27 | 2021-11-17 | 2 | -2/+14 |
| | |||||
* | TextureCache: Fix OGL cleaning | Fernando Sahmkow | 2021-11-17 | 5 | -0/+43 |
| | |||||
* | TextureCache: Add automatic anisotropic filtering and refactor code. | Fernando Sahmkow | 2021-11-16 | 3 | -15/+16 |
| | |||||
* | TextureCache: Make a better Anisotropic setter. | Fernando Sahmkow | 2021-11-16 | 3 | -20/+17 |
| | |||||
* | Texture Cache: revert Image changes. | Fernando Sahmkow | 2021-11-16 | 1 | -0/+4 |
| | |||||
* | HostShader: fix Gaussian filter. | FernandoS27 | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | Texture Cahe/Shader decompiler: Resize PointSize on rescaling, refactor and make reaper more agressive on 4Gb GPUs. | FernandoS27 | 2021-11-16 | 4 | -22/+8 |
| | |||||
* | texture_cache: Refactor Render Target scaling function | ameerj | 2021-11-16 | 2 | -14/+24 |
| | |||||
* | gl_resource_manager: Ensure non EXT_framebuffer objects are created | ameerj | 2021-11-16 | 2 | -13/+8 |
| | |||||
* | Texture Cache: Fix memory usage on ScaleDown. | FernandoS27 | 2021-11-16 | 1 | -4/+0 |
| | |||||
* | OpenGL: Fix viewport/Scissor scaling on downscaling. | FernandoS27 | 2021-11-16 | 1 | -6/+28 |
| | |||||
* | Vulkan: fix regression. | FernandoS27 | 2021-11-16 | 1 | -14/+17 |
| | |||||
* | host_shaders: Misc copyright/style changes | ameerj | 2021-11-16 | 4 | -10/+12 |
| | |||||
* | FSR: Fix GCC build errors | ameerj | 2021-11-16 | 3 | -43/+50 |
| | |||||
* | Vulkan: Reimplement FSR constant generation functions to avoid GCC warnings | Marshall Mohror | 2021-11-16 | 2 | -9/+145 |
| | |||||
* | vk_blit_screen: Fix AA destruction order | ameerj | 2021-11-16 | 1 | -9/+10 |
| | |||||
* | Presentation: Only use FP16 in scaling shaders on supported devices in Vulkan | Marshall Mohror | 2021-11-16 | 14 | -116/+197 |
| | |||||
* | renderer_vulkan/blit_image: Use generic color state on Depth to Color blits | ameerj | 2021-11-16 | 1 | -1/+1 |
| | | | | Fixes Bayonetta 2 on AMD | ||||
* | vk_texture_cache: Refactor 3D scaling helpers | ameerj | 2021-11-16 | 2 | -113/+74 |
| | |||||
* | gl_rasterizer: Fix ScissorTest and Clear when scaling | ameerj | 2021-11-16 | 1 | -10/+6 |
| | |||||
* | gl_texture_cache: Simplify scaling procedures | ameerj | 2021-11-16 | 2 | -57/+28 |
| | |||||
* | OpenGlTextureCache: Fix state invalidation on rescaling. | Fernando Sahmkow | 2021-11-16 | 3 | -2/+17 |
| | |||||
* | VulkanBufferCache: Avoid adding barriers between multiple copies. | Fernando Sahmkow | 2021-11-16 | 3 | -5/+43 |
| | |||||
* | HostShader: Fix gaussian and add attribution. | Fernando Sahmkow | 2021-11-16 | 1 | -23/+19 |
| | |||||
* | Vulkan: Fix FXAA in AMD. | Fernando Sahmkow | 2021-11-16 | 1 | -2/+40 |
| | |||||
* | Texture Cache: Fix blitting. | Fernando Sahmkow | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | Vulkan: Implement FXAA | FernandoS27 | 2021-11-16 | 3 | -22/+387 |
| | |||||
* | OpenGL: fix FXAA with scaling | Marshall Mohror | 2021-11-16 | 2 | -9/+31 |
| | |||||
* | OpenGL: Implement FXAA | Marshall Mohror | 2021-11-16 | 6 | -35/+194 |
| | |||||
* | QtGUI: Add buttton to toggle the filter. | FernandoS27 | 2021-11-16 | 1 | -0/+1 |
| | |||||
* | VideoCore: Add gaussian filtering. | FernandoS27 | 2021-11-16 | 6 | -0/+132 |
| | |||||
* | TextureCache: Improve Reaper. | FernandoS27 | 2021-11-16 | 2 | -14/+26 |
| | |||||
* | Vulkan: fix waiting on semaphore. | FernandoS27 | 2021-11-16 | 1 | -1/+3 |
| | |||||
* | Update scaleforce to use FP16 | Marshall Mohror | 2021-11-16 | 1 | -88/+55 |
| | |||||
* | TextureCache: fix rescaling in aliases and overlap joins. | FernandoS27 | 2021-11-16 | 4 | -23/+48 |
| | |||||
* | Presentation: Fix turning FSR on and off in settings | Marshall Mohror | 2021-11-16 | 1 | -0/+11 |
| | |||||
* | Video Core: fix building for GCC. | Fernando Sahmkow | 2021-11-16 | 4 | -22/+40 |
| | |||||
* | Vulkan Rasterizer: Fix clears on integer textures. | FernandoS27 | 2021-11-16 | 3 | -1/+84 |
| | |||||
* | Texture cache: fix Intel with rescaler. | FernandoS27 | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | TextureCache: Fix blitting filter in Vulkan and correct viewport/scissor calculation when downscaling. | FernandoS27 | 2021-11-16 | 2 | -20/+44 |
| | |||||
* | Texture Cache: fix memory managment and optimize scaled downloads, uploads. | Fernando Sahmkow | 2021-11-16 | 7 | -28/+57 |
| | |||||
* | Texture Cache: ease the requirements of textures being blacklisted. | Fernando Sahmkow | 2021-11-16 | 2 | -22/+7 |
| | |||||
* | Vulkan: Fix Blit Depth Stencil | Fernando Sahmkow | 2021-11-16 | 2 | -14/+20 |
| | |||||
* | Texture Cache: Fix downscaling and correct memory comsumption. | Fernando Sahmkow | 2021-11-16 | 8 | -36/+147 |
| | |||||
* | Presentation: add Nearest Neighbor filter. | Fernando Sahmkow | 2021-11-16 | 4 | -9/+56 |
| | |||||
* | vulkan: Implement FidelityFX Super Resolution | Marshall Mohror | 2021-11-16 | 9 | -17/+637 |
| | |||||
* | Texture Cache: Rescale conversions between depth and color | FernandoS27 | 2021-11-16 | 6 | -25/+37 |
| | |||||
* | Texture cache: Fix memory consumption and ignore rating when a depth texture is rendered. | Fernando Sahmkow | 2021-11-16 | 3 | -7/+19 |
| | |||||
* | vulkan: Fix rescaling push constant usage | ameerj | 2021-11-16 | 4 | -35/+42 |
| | |||||
* | Texture Cahe: Fix downscaling on SMO. | Fernando Sahmkow | 2021-11-16 | 3 | -0/+8 |
| | |||||
* | texture_cache_base: Remove unused function declarations | ameerj | 2021-11-16 | 1 | -8/+0 |
| | |||||
* | vk_texture_cache: Use 3D to scale images when blit is unsupported | ameerj | 2021-11-16 | 4 | -29/+87 |
| | |||||
* | texture_cache: Fix infinitely recursive ImageCanRescale check | ameerj | 2021-11-16 | 3 | -10/+13 |
| | |||||
* | vk_texture_cache: Fix BlitScale of non-2D images | ameerj | 2021-11-16 | 1 | -10/+9 |
| | |||||
* | video_core: Refactor resolution scale function | ameerj | 2021-11-16 | 3 | -46/+20 |
| | |||||
* | texture_cache: Fix image resolves when src/dst are not both scaled | ameerj | 2021-11-16 | 1 | -5/+8 |
| | |||||
* | video_core,yuzu: Move UpdateRescalingInfo call to video_core | lat9nq | 2021-11-16 | 1 | -0/+2 |
| | | | | | | This only needs to happen once per game boot, so we can just call it during CreateGPU and be done with it, avoiding the need to call it in the frontends. | ||||
* | gl_texture_cache: Disable scissor test when scaling textures | ameerj | 2021-11-16 | 1 | -0/+8 |
| | | | | Fixes a bug on BOTW where some objects were no longer being rendered after blitting | ||||
* | vk_texture_cache: Fix unsupported blit format error checking | ameerj | 2021-11-16 | 2 | -9/+9 |
| | |||||
* | vk_texture_cache: Fix early returns on unsupported scales | ameerj | 2021-11-16 | 2 | -19/+11 |
| | |||||
* | video_core: Misc resolution scaling related refactoring | ameerj | 2021-11-16 | 7 | -46/+50 |
| | |||||
* | texture_cache: Refactor scaled image size calculation | ameerj | 2021-11-16 | 2 | -12/+13 |
| | |||||
* | Texture Cache: Fix calculations when scaling. | Fernando Sahmkow | 2021-11-16 | 1 | -0/+12 |
| | |||||
* | gl_texture_cache: Fix BGR pbo size for scaled textures | ameerj | 2021-11-16 | 1 | -11/+10 |
| | |||||
* | Texture Cache: Fix Rescaling on Multisample | Fernando Sahmkow | 2021-11-16 | 3 | -8/+21 |
| | |||||
* | TextureCache: Base fixes on rescaling. | Fernando Sahmkow | 2021-11-16 | 2 | -4/+6 |
| | |||||
* | vk_texture_cache: Simplify scaled image management | ameerj | 2021-11-16 | 2 | -107/+34 |
| | |||||
* | gl_texture_cache: Fix scaling backup logic | ameerj | 2021-11-16 | 2 | -20/+16 |
| | |||||
* | vk_rasterizer: Fix scaling on Y_NEGATE | ameerj | 2021-11-16 | 1 | -3/+9 |
| | |||||
* | vk_texture_cache: Use nearest neighbor scaling when available | ameerj | 2021-11-16 | 4 | -29/+36 |
| | |||||
* | gl_texture_cache: Fix depth and integer format scaling blits | ameerj | 2021-11-16 | 2 | -16/+61 |
| | |||||
* | gl_texture_cache/rescaling_pass: minor cleanup | ameerj | 2021-11-16 | 2 | -4/+2 |
| | |||||
* | vk_texture_cache: Minor cleanup | ameerj | 2021-11-16 | 2 | -11/+8 |
| | |||||
* | image_info: Mark MSAA textures as non-rescalable | ameerj | 2021-11-16 | 1 | -2/+2 |
| | | | | Blitting or resolving multisampled images requires the dimensions of the src and dst to be equal for valid usage, making them difficult for resolution scaling using the current implementation. | ||||
* | gl_texture_cache: Simplify scaling | ameerj | 2021-11-16 | 2 | -31/+39 |
| | | | | | We don't need to reconstruct new textures every time we ScaleUp/ScaleDown. We can scale up once, and revert to the original texture whenever scaling down. Fixes memory leaks due to glDeleteTextures being deferred for later handling on some drivers | ||||
* | Renderers: Unify post processing filter shaders | ameerj | 2021-11-16 | 7 | -211/+36 |
| | |||||
* | gl_texture_cache: fix scaling on upload | ameerj | 2021-11-16 | 1 | -0/+7 |
| | |||||
* | Renderer: Implement Bicubic and ScaleForce filters. | Fernando Sahmkow | 2021-11-16 | 9 | -13/+537 |
| | |||||
* | Texture Cache: fix scaling on upload and stop scaling on base resolution. | Fernando Sahmkow | 2021-11-16 | 1 | -14/+32 |
| | |||||
* | shader, video_core: Fix GCC build errors | ameerj | 2021-11-16 | 2 | -10/+3 |
| | |||||
* | emit_spirv: Fix RescalingLayout alignment | ameerj | 2021-11-16 | 2 | -4/+7 |
| | |||||
* | TextureCache: Fix Buffer Views Scaling. | Fernando Sahmkow | 2021-11-16 | 2 | -5/+9 |
| | |||||
* | Texture Cache: Correctly fix Blits Rescaling. | Fernando Sahmkow | 2021-11-16 | 1 | -9/+12 |
| | |||||
* | texture_cache: Disable dst_image scaling in BlitImage | ameerj | 2021-11-16 | 1 | -5/+7 |
| | | | | Fixes scaling in Super Mario Party | ||||
* | emit_spirv: Fix RescalingLayout alignment | ameerj | 2021-11-16 | 1 | -1/+1 |
| | |||||
* | shader: Properly scale image reads and add GL SPIR-V support | ReinUsesLisp | 2021-11-16 | 5 | -26/+57 |
| | | | | Thanks for everything! | ||||
* | shader: Properly blacklist and scale image loads | ReinUsesLisp | 2021-11-16 | 4 | -8/+12 |
| | |||||
* | texture_cache: Add getter to query if image view is rescaled | ReinUsesLisp | 2021-11-16 | 5 | -22/+12 |
| | |||||
* | vk_rasterizer: Minor style change | ReinUsesLisp | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | gl_texture_cache: Fix scaling blits | ReinUsesLisp | 2021-11-16 | 1 | -20/+12 |
| | |||||
* | glsl/glasm: Pass and use scaling parameters in shaders | ReinUsesLisp | 2021-11-16 | 3 | -21/+40 |
| | |||||
* | gl_rasterizer: Properly scale viewports and scissors | ReinUsesLisp | 2021-11-16 | 1 | -23/+24 |
| | |||||
* | gl_texture_cache: Fix multi layered texture Scale | ameerj | 2021-11-16 | 1 | -11/+15 |
| | |||||
* | gl_compute_pipeline: Add downscale factor to shader uniforms | ameerj | 2021-11-16 | 1 | -0/+9 |
| | |||||
* | gl_rasterizer: Fix rescale dirty state checking | ameerj | 2021-11-16 | 1 | -4/+9 |
| | |||||
* | gl_graphics_pipeline: Add downscale factor to shader uniforms | ameerj | 2021-11-16 | 1 | -1/+14 |
| | |||||
* | texture_cache: Fix blacklists on compute | ReinUsesLisp | 2021-11-16 | 1 | -1/+1 |
| | |||||
* | texture_cache: Simplify image view queries and blacklisting | ReinUsesLisp | 2021-11-16 | 16 | -192/+192 |
| | |||||
* | Vulkan: Fix downscaling Blit. | Fernando Sahmkow | 2021-11-16 | 1 | -14/+18 |
| | |||||
* | Texture Cache: Implement Rating System. | Fernando Sahmkow | 2021-11-16 | 5 | -15/+47 |
| | |||||
* | OpenGL: set linear mag filter when blitting a downscaled image. | Fernando Sahmkow | 2021-11-16 | 1 | -0/+1 |
| | |||||
* | Vulkan: Fix AA when rescaling. | Fernando Sahmkow | 2021-11-16 | 1 | -1/+1 |
| | |||||
* | Texture Cache: Implement Blacklisting. | Fernando Sahmkow | 2021-11-16 | 5 | -4/+90 |
| | |||||
* | vulkan: Implement rescaling shader patching | ReinUsesLisp | 2021-11-16 | 8 | -27/+103 |
| | |||||
* | vk_texture_cache: Properly scale blit source images | ReinUsesLisp | 2021-11-16 | 1 | -2/+2 |
| | |||||
* | vk_graphics_pipeline: Use Shader::NumDescriptors when possible | ReinUsesLisp | 2021-11-16 | 1 | -18/+6 |
| | |||||
* | opengl: Use Shader::NumDescriptors when possible | ReinUsesLisp | 2021-11-16 | 3 | -46/+20 |
| | |||||
* | texture_cache: Add image getters | ReinUsesLisp | 2021-11-16 | 2 | -0/+16 |
| | |||||
* | gl_texture_cache: Simplify rescaling | ameerj | 2021-11-16 | 2 | -19/+15 |
| | |||||
* | texture_cache: Fix typo in aliased image rescaling | ameerj | 2021-11-16 | 1 | -1/+1 |
| | |||||
* | vk_texture_cache: Simplify and optimize scaling blits | ReinUsesLisp | 2021-11-16 | 1 | -106/+62 |
| | |||||
* | vk_texture_cache: Fix scaling blit validation errors | ReinUsesLisp | 2021-11-16 | 1 | -81/+78 |
| | |||||
* | gl_texture_cache: Implement ScaleDown | ameerj | 2021-11-16 | 2 | -26/+36 |
| | |||||
* | gl_texture_cache: Rescale fixes for multi-layered textures | ameerj | 2021-11-16 | 2 | -16/+32 |
| | |||||
* | Texture Cache: Implement Rescaling on Aliases and Blits. | Fernando Sahmkow | 2021-11-16 | 1 | -5/+53 |
| | |||||
* | Fix blits with mips | ReinUsesLisp | 2021-11-16 | 1 | -12/+16 |
| | |||||
* | Fix blits | ReinUsesLisp | 2021-11-16 | 1 | -10/+10 |
| | |||||
* | renderer_gl: Resolution scaling fixes | ameerj | 2021-11-16 | 3 | -61/+107 |
| | |||||
* | TextureCache: Fix rescaling of ImageCopies | Fernando Sahmkow | 2021-11-16 | 3 | -18/+67 |
| | |||||
* | TextureCache: Modify Viewports/Scissors according to Rescale. | Fernando Sahmkow | 2021-11-16 | 5 | -35/+89 |
| | |||||
* | Settings: eliminate rescaling_factor. | Fernando Sahmkow | 2021-11-16 | 2 | -6/+5 |
| | |||||
* | Texture Cache: More rescaling fixes. | Fernando Sahmkow | 2021-11-16 | 4 | -84/+96 |
| | |||||
* | gl_texture_cache: WIP texture rescale | ameerj | 2021-11-16 | 2 | -3/+69 |
| | |||||
* | Texture Cache: Implement Vulkan UpScaling & DownScaling | Fernando Sahmkow | 2021-11-16 | 6 | -42/+327 |
| | |||||
* | VideoCore: Initial Setup for the Resolution Scaler. | Fernando Sahmkow | 2021-11-16 | 9 | -18/+236 |
| | |||||
* | codes: Rename ComposeFrameHeader to ComposeFrame | ameerj | 2021-11-13 | 7 | -14/+14 |
| | | | | These functions were composing the entire frame, not just the headers. Rename to more accurately describe them. | ||||
* | vp8: Implement header composition | ameerj | 2021-11-13 | 4 | -6/+90 |
| | | | | Enables frame decoding with FFmpeg | ||||
* | codecs: Add VP8 codec class | ameerj | 2021-11-13 | 9 | -20/+90 |
| | |||||
* | vulkan_device: Add missing vulkan image format R5G6B5 in GetFormatProperties | Feng Chen | 2021-11-05 | 1 | -0/+1 |
| | | | - Used by Dragon Quest Builders | ||||
* | gl_rasterizer: Remove unused includes | Morph | 2021-11-01 | 1 | -4/+2 |
| | | | | This removes unused includes, especially the core includes which were causing this file to be recompiled every time files included by those headers are modified. | ||||
* | gl_device: Force GLASM on NVIDIA drivers 495-496 | lat9nq | 2021-10-29 | 1 | -0/+15 |
| | | | | | | | | | GLSL shaders currently do not render correctly on the recent NVIDIA drivers. This adds a check that forces assembly shaders for these drivers since they seem unaffected and adds a warning informing of the decision. Developers can disable the check by enabling graphics debugging. | ||||
* | Merge pull request #7206 from vonchenplus/fix_vulkan_viewport_issue | Fernando S | 2021-10-24 | 1 | -0/+1 |
|\ | | | | | Vulkan Rasterizer: Fix viewport issue | ||||
| * | Fix vulkan viewport issue | Feng Chen | 2021-10-22 | 1 | -0/+1 |
| | | |||||
* | | Merge pull request #7070 from FernandoS27/want-you-bad | Ameer J | 2021-10-24 | 6 | -3/+31 |
|\ \ | | | | | | | Vulkan Rasterizer: Correct DepthBias/PolygonOffset on Vulkan. | ||||
| * | | Vulran Rasterizer: address feedback. | Fernando Sahmkow | 2021-10-23 | 1 | -3/+5 |
| | | | |||||
| * | | Vulkan Rasterizer: Correct DepthBias/PolygonOffset on Vulkan. | Fernando Sahmkow | 2021-09-23 | 6 | -3/+29 |
| | | | |||||
* | | | Merge pull request #7198 from ameerj/settings-chrono | bunnei | 2021-10-19 | 1 | -0/+1 |
|\ \ \ | |_|/ |/| | | settings: Remove std::chrono usage | ||||
| * | | settings: Remove std::chrono usage | ameerj | 2021-10-17 | 1 | -0/+1 |
| | | | | | | | | | | | | Alleviates the dependency on chrono for all files that include settings.h | ||||
* | | | Merge pull request #7077 from FernandoS27/face-down | Ameer J | 2021-10-17 | 2 | -5/+6 |
|\ \ \ | | | | | | | | | A series of fixes to queries and indexed samplers. | ||||
| * | | | Vulkan Query Cache: make sure to wait for the query result. | Fernando Sahmkow | 2021-09-24 | 1 | -1/+2 |
| | | | | |||||
| * | | | QueryCache: Flush queries in order of running. | Fernando Sahmkow | 2021-09-24 | 1 | -4/+4 |
| | |/ | |/| | |||||
* | | | Merge pull request #7127 from FernandoS27/i-saw-a-wabbit | Ameer J | 2021-10-17 | 2 | -5/+14 |
|\ \ \ | |_|/ |/| | | A few fixes on Vulkan and Rasterizer Caching | ||||
| * | | Vulkan: Fix failing barrier on refresh. | Fernando Sahmkow | 2021-10-04 | 1 | -1/+2 |
| | | | |||||
| * | | RasterizerInterface: Correct size of CPU addresses to cache. | FernandoS27 | 2021-10-04 | 1 | -1/+1 |
| | | | |||||
| * | | Vulkan: Fix the master Semaphore | FernandoS27 | 2021-10-04 | 1 | -4/+12 |
| | | | |||||
* | | | Merge pull request #7157 from ameerj/vic-surface-size | Morph | 2021-10-14 | 1 | -16/+15 |
|\ \ \ | | | | | | | | | vic: Use the minimum of surface/frame dimensions when writing the final frame to the GPU | ||||
| * | | | vic: Use the minimum of surface/frame dimensions when writing the final frame to the GPU | ameerj | 2021-10-11 | 1 | -16/+15 |
| | | | | | | | | | | | | | | | | Addresses possible buffer overflow behavior. | ||||
* | | | | Merge pull request #7109 from vonchenplus/fix_h264_max__reference_num_error | Ameer J | 2021-10-12 | 1 | -1/+6 |
|\ \ \ \ | |/ / / |/| | | | h264: Use max allowed max_num_ref_frames when using CPU decoding | ||||
| * | | | h264: Use max allowed max_num_ref_frames when using CPU decoding | Feng Chen | 2021-10-10 | 1 | -1/+6 |
| | |/ | |/| | |||||
* | | | vic: Allow surface to be higher than frame | Valeri | 2021-10-09 | 1 | -2/+3 |
| | | | | | | | | | | | | | | | Touhou Genso Wanderer Lotus Labyrinth R decodes 1920x1080 videos into 1920x1088 surface. Only allow mismatch for height, since larger width would result in increasingly offset rows and somewhat defeat entire purpose of this check. | ||||
* | | | vic: Avoid memory corruption when multiple streams with different dimensions are decoded | ameerj | 2021-10-08 | 1 | -0/+9 |
| | | | | | | | | | | | | This is a work around to avoid buffer overflow errors until multi channel/multi stream decoding is supported. | ||||
* | | | vic: Refactor frame writing methods | ameerj | 2021-10-07 | 2 | -138/+146 |
| | | | |||||
* | | | vic: Implement RGBX frame format | ameerj | 2021-10-07 | 2 | -3/+15 |
| | | | |||||
* | | | Merge pull request #7118 from ameerj/vc-gpu-impl | Fernando S | 2021-10-06 | 16 | -639/+854 |
|\ \ \ | | | | | | | | | gpu: Migrate implementation to the cpp file | ||||
| * | | | nvhost_ctrl: Refactor usage of gpu.LockSync() | ameerj | 2021-10-03 | 2 | -20/+1 |
| | | | | | | | | | | | | | | | | This seems to only be used to protect a later gpu function call. So we can move the lock into that call instead. | ||||
| * | | | gpu: Migrate implementation to the cpp file | ameerj | 2021-10-03 | 17 | -627/+862 |
| | | | | |||||
* | | | | Merge pull request #7115 from ameerj/log-compile | bunnei | 2021-10-05 | 1 | -0/+1 |
|\ \ \ \ | | | | | | | | | | | common/logging: Reduce dependent header include overhead | ||||
| * | | | | common/logging: Move Log::Entry declaration to a separate header | ameerj | 2021-10-02 | 1 | -0/+1 |
| |/ / / | | | | | | | | | | | | | This reduces the load of requiring to include std::chrono in all files which include log.h | ||||
* | | | | Merge pull request #7101 from ameerj/vk-tess-topology | bunnei | 2021-10-05 | 1 | -1/+10 |
|\ \ \ \ | |/ / / |/| | | | vk_graphics_pipeline: Force patch list topology when tessellation is used | ||||
| * | | | vk_graphics_pipeline: Force patch list topology when tessellation is used | ameerj | 2021-09-28 | 1 | -1/+10 |
| | | | | | | | | | | | | | | | | Fixes a crash on some drivers when tessellation is used but the IA topology is not patch list. | ||||
* | | | | Merge pull request #7061 from ameerj/dma-buffer-misc | bunnei | 2021-09-30 | 4 | -39/+37 |
|\ \ \ \ | | | | | | | | | | | buffer_cache, maxwell_dma: Minor refactoring and code fixes | ||||
| * | | | | maxwell_dma: Minor refactoring | ameerj | 2021-09-20 | 2 | -33/+33 |
| | | | | | |||||
| * | | | | buffer_cache: Minor fixes | ameerj | 2021-09-20 | 2 | -6/+4 |
| | | | | | | | | | | | | | | | | | | | | Loop through the tmp_intervals by reference, rather than by copy, and fix gl clear buffer size calculation. | ||||
* | | | | | Merge pull request #7036 from ameerj/ogl-bgr-v2 | bunnei | 2021-09-30 | 7 | -118/+59 |
|\ \ \ \ \ | |_|/ / / |/| | | | | gl_texture_cache: Unify BGR copy passes using PBOs | ||||
| * | | | | host_shaders: Remove opengl_copy_bgra.comp | ameerj | 2021-09-17 | 4 | -19/+0 |
| | | | | | |||||
| * | | | | gl_texture_cache: Migrate BGRCopyPass from util_shaders | ameerj | 2021-09-17 | 4 | -42/+48 |
| | | | | | | | | | | | | | | | | | | | | The BGR copies no longer use shaders. | ||||
| * | | | | util_shaders: Unify BGRA copy passes | ameerj | 2021-09-16 | 5 | -82/+36 |
| | | | | | |||||
* | | | | | Merge pull request #7076 from ameerj/amd-botw | bunnei | 2021-09-28 | 3 | -11/+22 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_texture_cache: Disable cube compatibility flag on non-mesa AMD GCN4 and earlier | ||||
| * | | | | | vk_texture_cache: Disable cube compatibility flag on non-mesa AMD GCN4 and earlier | ameerj | 2021-09-24 | 3 | -11/+22 |
| | |_|/ / | |/| | | | | | | | | | | | | | Fixes rainbow textures on BOTW. | ||||
* | | | | | Merge pull request #7078 from ameerj/vc-jthread-fixes | Morph | 2021-09-26 | 1 | -1/+1 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core: Fix jthread related hangs when stopping emulation | ||||
| * | | | | | video_core: Fix jthread related hangs when stopping emulation | ameerj | 2021-09-24 | 1 | -1/+1 |
| |/ / / / | | | | | | | | | | | | | | | | jthread on some compilers is more picky when it comes to the order in which objects are destroyed. | ||||
* / / / / | general: Update style to clang-format-12 | ameerj | 2021-09-24 | 3 | -13/+9 |
|/ / / / | |||||
* | | | | Merge pull request #7003 from ameerj/unlocked-present-mode | bunnei | 2021-09-20 | 3 | -4/+38 |
|\ \ \ \ | |_|/ / |/| | | | vk_swapchain: Use immediate present mode when mailbox is unavailable and FPS is unlocked | ||||
| * | | | vk_swapchain: Use immediate present mode when mailbox is unavailable and FPS is unlocked | ameerj | 2021-09-13 | 3 | -4/+38 |
| | | | | | | | | | | | | | | | | Allows drivers that do not support VK_PRESENT_MODE_MAILBOX_KHR the ability to present at a framerate higher than the monitor's refresh rate when the FPS is unlocked. | ||||
* | | | | Merge pull request #7019 from ameerj/videocore-jthread | bunnei | 2021-09-19 | 6 | -81/+26 |
|\ \ \ \ | |_|_|/ |/| | | | videocore: Use std::jthread for worker threads | ||||
| * | | | vk_scheduler: Use std::jthread | ameerj | 2021-09-16 | 2 | -17/+9 |
| | | | | |||||
| * | | | gpu: Use std::jthread for async gpu thread | ameerj | 2021-09-16 | 4 | -64/+17 |
| | |/ | |/| | |||||
* | | | Merge pull request #7011 from ameerj/vk-validation-0x0 | bunnei | 2021-09-17 | 1 | -0/+1 |
|\ \ \ | | | | | | | | | vulkan_debug_callback: Ignore InvalidCommandBuffer-VkDescriptorSet errors | ||||
| * | | | vulkan_debug_callback: Ignore InvalidCommandBuffer-VkDescriptorSet errors | ameerj | 2021-09-14 | 1 | -0/+1 |
| |/ / | | | | | | | | | | | | | | | | This validation error is spammed on some titles, asserting that VkDescriptorSet 0x0[] was destroyed. This is likely a validation layer bug when using VK_KHR_push_descriptor, which can avoid using traditional VkDescriptorSet. It should be safe to ignore for now. | ||||
* | | | Merge pull request #7027 from ameerj/sorry-amd | Fernando S | 2021-09-16 | 1 | -14/+3 |
|\ \ \ | | | | | | | | | vulkan_device: Reorder Float16Int8 declaration | ||||
| * | | | vulkan_device: Reorder Float16Int8 declaration | ameerj | 2021-09-16 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | This variable was going out of scope before its usage in the vulkan device creation, causing a crash on very specific drivers. | ||||
| * | | | Revert "Merge pull request #7006 from FernandoS27/a-motherfucking-driver" | ameerj | 2021-09-16 | 1 | -13/+1 |
| |/ / | | | | | | | | | | | | | This reverts commit 62e88d0e7455e37840db7e2a8e199bc6ca176966, reversing changes made to edf3da346f4ec0ca492b427f4f693d56e84abc52. | ||||
* / / | renderers: Log total pipeline count | Morph | 2021-09-14 | 2 | -0/+4 |
|/ / | |||||
* | | Merge pull request #6943 from FernandoS27/omae-wa-mou-shindeiru | Morph | 2021-09-13 | 1 | -6/+20 |
|\ \ | | | | | | | Vulkan: Disable VK_EXT_SAMPLER_FILTER_MINMAX in GCN AMD | ||||
| * | | Vulkan: Disable VK_EXT_SAMPLER_FILTER_MINMAX in GCN AMD since it's broken. | Fernando Sahmkow | 2021-09-13 | 1 | -6/+20 |
| | | | |||||
* | | | Merge pull request #7006 from FernandoS27/a-motherfucking-driver | Morph | 2021-09-13 | 1 | -1/+13 |
|\ \ \ | |/ / |/| | | Vulkan: Blacklist Int8Float16 Extension on AMD on driver 21.9.1 | ||||
| * | | Vulkan: Blacklist Int8Float16 Extension on AMD on driver 21.9.1 | Fernando Sahmkow | 2021-09-13 | 1 | -1/+13 |
| | | | |||||
* | | | Merge pull request #6944 from FernandoS27/dear-drunk-me | Morph | 2021-09-13 | 3 | -3/+14 |
|\ \ \ | | | | | | | | | Vulkan/Descriptors: Increase sets per pool on AMD propietary driver. | ||||
| * | | | Vulkan/Descriptors: Increase sets per pool on AMFD propietary driver. | Fernando Sahmkow | 2021-09-13 | 3 | -3/+14 |
| | | | | |||||
* | | | | Merge pull request #7001 from ameerj/wario-fix | Fernando S | 2021-09-13 | 1 | -6/+8 |
|\ \ \ \ | |_|_|/ |/| | | | vk_rasterizer: Fix dynamic StencilOp updating when two faces are enabled | ||||
| * | | | vk_rasterizer: Fix dynamic StencilOp updating when two faces are enabled | ameerj | 2021-09-12 | 1 | -6/+8 |
| | | | | | | | | | | | | | | | | This function was incorrectly using the stencil_two_side_enable register when dynamically updating the StencilOp. | ||||
* | | | | Merge pull request #7002 from ameerj/vk-state-unused | Mai M | 2021-09-12 | 1 | -4/+0 |
|\ \ \ \ | | | | | | | | | | | vk_state_tracker: Remove unused function | ||||
| * | | | | vk_state_tracker: Remove unused function | ameerj | 2021-09-12 | 1 | -4/+0 |
| |/ / / | |||||
* | | | | shader_environment: Add missing <algorithm> include | Morph | 2021-09-11 | 1 | -0/+1 |
| | | | | |||||
* | | | | vk_descriptor_pool: Add missing <algorithm> include | Morph | 2021-09-11 | 1 | -0/+1 |
| | | | | |||||
* | | | | slot_vector: Add missing <algorithm> include | Morph | 2021-09-11 | 1 | -0/+1 |
| | | | | |||||
* | | | | video_core/memory_manager: Add missing <algorithm> include | Morph | 2021-09-11 | 1 | -0/+2 |
| | | | | |||||
* | | | | codec: Add missing <string_view> include | Morph | 2021-09-11 | 1 | -0/+1 |
|/ / / | |||||
* | | | Merge pull request #6846 from ameerj/nvdec-gpu-decode | Fernando S | 2021-09-11 | 5 | -96/+144 |
|\ \ \ | | | | | | | | | nvdec: Add GPU video decoding for all capable drivers and platforms | ||||
| * | | | h264: Lower max_num_ref_frames | ameerj | 2021-08-16 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | GPU decoding seems to be more picky when it comes to the maximum number of reference frames. | ||||
| * | | | configure_graphics: Add GPU nvdec decoding as an option | ameerj | 2021-08-16 | 2 | -2/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | Some system configurations may see visual regressions or lower performance using GPU decoding compared to CPU decoding. This setting provides the option for users to specify their decoding preference. Co-Authored-By: yzct12345 <87620833+yzct12345@users.noreply.github.com> | ||||
| * | | | codec: Improve libav memory alloc and cleanup | ameerj | 2021-08-16 | 2 | -14/+19 |
| | | | | |||||
| * | | | codec: Fallback to CPU decoding if no compatible GPU format is found | ameerj | 2021-08-16 | 2 | -22/+32 |
| | | | | |||||
| * | | | cmake: Add VDPAU and NVDEC support to FFmpeg | lat9nq | 2021-08-16 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | Adds {h264_,vp9_}{nvdec,vdpau} hwaccels. | ||||
| * | | | codec: Replace deprecated av_init_packet usage | ameerj | 2021-08-12 | 1 | -9/+13 |
| | | | | |||||
| * | | | nvdec: Implement GPU accelerated decoding for all platforms | ameerj | 2021-08-12 | 2 | -70/+92 |
| | | | | | | | | | | | | | | | | Supplements the VAAPI intel gpu decoder by implementing the D3D11VA decoder for Windows, and CUVID/VDPAU for Nvidia and AMD on drivers linux respectively. | ||||
* | | | | Merge pull request #6901 from ameerj/vk-clear-bits | Fernando S | 2021-09-11 | 3 | -6/+24 |
|\ \ \ \ | | | | | | | | | | | vk_rasterizer: Only clear depth/stencil buffers when specified in attachment aspect mask | ||||
| * | | | | vk_rasterizer: Only clear depth and stencil buffers when set in attachment aspect mask | ameerj | 2021-08-21 | 3 | -6/+24 |
| | | | | | | | | | | | | | | | | | | | | Silences validation errors for clearing the depth/stencil buffers of framebuffer attachments that were not specified to have depth/stencil usage. | ||||
* | | | | | Merge pull request #6941 from ameerj/swapchain-srgb | Fernando S | 2021-09-11 | 5 | -11/+37 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_swapchain: Prefer linear swapchain format when presenting sRGB images | ||||
| * | | | | | vulkan_device: Enable VK_KHR_swapchain_mutable_format if available | ameerj | 2021-08-29 | 3 | -0/+27 |
| | | | | | | | | | | | | | | | | | | | | | | | | Silences validation errors when creating sRGB image views of linear swapchain images | ||||
| * | | | | | vk_swapchain: Prefer linear swapchain format when presenting sRGB images | ameerj | 2021-08-29 | 3 | -11/+10 |
| | |_|/ / | |/| | | | | | | | | | | | | | Fixes broken sRGB when presenting from a secondary GPU. | ||||
* | | | | | Merge pull request #6953 from ameerj/anv-semaphore | Fernando S | 2021-09-11 | 5 | -26/+33 |
|\ \ \ \ \ | | | | | | | | | | | | | renderer_vulkan: Wait on present semaphore at queue submit | ||||
| * | | | | | renderer_vulkan: Wait on present semaphore at queue submit | ameerj | 2021-09-02 | 5 | -26/+33 |
| | |_|_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The present semaphore is being signalled by the call to acquire the swapchain image. This semaphore is meant to be waited on when rendering to the swapchain image. Currently it is waited on when presenting, but moving its usage to be waited on in the command buffer submission allows for proper usage of this semaphore. Fixes the device lost when launching titles on the Intel Linux Mesa driver. | ||||
* / | | | | Fix blend equation enum error | Feng Chen | 2021-09-07 | 1 | -4/+4 |
|/ / / / | |||||
* | | | | Merge pull request #6900 from ameerj/attr-reorder | bunnei | 2021-09-02 | 3 | -0/+7 |
|\ \ \ \ | | | | | | | | | | | structured_control_flow: Add DemoteCombinationPass | ||||
| * | | | | structured_control_flow: Conditionally invoke demote reorder pass | ameerj | 2021-08-30 | 3 | -0/+7 |
| |/ / / | | | | | | | | | | | | | This is only needed on select drivers when a fragment shader discards/demotes. | ||||
* | | | | Merge pull request #6897 from FernandoS27/pineapple-does-not-belong-in-pizza | bunnei | 2021-08-31 | 5 | -103/+80 |
|\ \ \ \ | | | | | | | | | | | Project <tentative title>: Rework Garbage Collection. | ||||
| * | | | | Garbage Collection: Make it more agressive on high priority mode. | Fernando Sahmkow | 2021-08-29 | 3 | -5/+5 |
| | | | | | |||||
| * | | | | Garbage Collection: Adress Feedback. | Fernando Sahmkow | 2021-08-29 | 3 | -5/+12 |
| | | | | | |||||
| * | | | | Garbage Collection: enable as default, eliminate option. | Fernando Sahmkow | 2021-08-28 | 2 | -3/+2 |
| | | | | | |||||
| * | | | | VideoCore: Rework Garbage Collection. | Fernando Sahmkow | 2021-08-28 | 5 | -101/+72 |
| |/ / / | |||||
* | | | | Merge pull request #6879 from ameerj/decoder-assert | bunnei | 2021-08-31 | 2 | -9/+3 |
|\ \ \ \ | | | | | | | | | | | vk_blit_screen: Fix non-accelerated texture size calculation | ||||
| * | | | | vk_blit_screen: Fix non-accelerated texture size calculation | ameerj | 2021-08-16 | 2 | -9/+3 |
| | | | | | | | | | | | | | | | | | | | | Addresses the potential OOB access in UnswizzleTexture. | ||||
* | | | | | vp9_types: Minor refactor of VP9 info structs. | ameerj | 2021-08-26 | 1 | -32/+29 |
| | | | | | |||||
* | | | | | vp9_types: Remove unused Vp9PictureInfo members | ameerj | 2021-08-26 | 2 | -24/+1 |
| |/ / / |/| | | | |||||
* | | | | Merge pull request #6919 from ameerj/vk-int8-capability | Fernando S | 2021-08-25 | 3 | -9/+19 |
|\ \ \ \ | | | | | | | | | | | vulkan_device: Add a check for int8 support | ||||
| * | | | | vulkan_device: Add a check for int8 support | ameerj | 2021-08-25 | 3 | -9/+19 |
| | |/ / | |/| | | | | | | | | | | Silences validation errors when shaders use int8 without specifying its support to the API | ||||
* | | | | Merge pull request #6894 from FernandoS27/bunneis-left-ear | Ameer J | 2021-08-25 | 1 | -0/+1 |
|\ \ \ \ | |/ / / |/| | | | GPU_MemoryManger: Fix GetSubmappedRange. | ||||
| * | | | GPU_MemoryManger: Fix GetSubmappedRange. | Fernando Sahmkow | 2021-08-19 | 1 | -0/+1 |
| | | | | |||||
* | | | | Merge pull request #6888 from v1993/patch-3 | Ameer J | 2021-08-21 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | video_core: eliminate constant ternary | ||||
| * | | | | video_core: eliminate constant ternary | Valeri | 2021-08-19 | 1 | -1/+1 |
| |/ / / | | | | | | | | | `via_header_index` is already checked above, so it would never be true in this branch | ||||
* / / / | xbyak: Update include path | Merry | 2021-08-15 | 1 | -1/+1 |
|/ / / | |||||
* | | | Merge pull request #6861 from yzct12345/const-mempy-is-all-the-speed | bunnei | 2021-08-15 | 1 | -57/+116 |
|\ \ \ | |/ / |/| | | decoders: Optimize memcpy for the other functions | ||||
| * | | decoders: Templates allow memcpy optimizations | yzct12345 | 2021-08-12 | 1 | -57/+116 |
| |/ | |||||
* | | Merge pull request #6838 from ameerj/sws-align | bunnei | 2021-08-12 | 1 | -3/+2 |
|\ \ | |/ |/| | vic: Specify sws_scale height stride. | ||||
| * | vic: Specify sws_scale height stride. | ameerj | 2021-08-10 | 1 | -3/+2 |
| | | | | | | | | Silences a sws_scale runtime warning about unaligned strides. | ||||
* | | Merge pull request #6820 from yzct12345/split-cache | Fernando S | 2021-08-10 | 13 | -427/+420 |
|\ \ | | | | | | | texture_cache: Split out template definitions | ||||
| * | | texture_cache: Address ameerj's review | yzct12345 | 2021-08-08 | 3 | -7/+4 |
| | | | |||||
| * | | texture_cache: Address ameerj's review | yzct12345 | 2021-08-07 | 4 | -10/+5 |
| | | | |||||
| * | | texture_cache: Don't change copyright year | yzct12345 | 2021-08-05 | 4 | -4/+4 |
| | | | |||||
| * | | texture_cache: Address ameerj's review | yzct12345 | 2021-08-05 | 12 | -1821/+1821 |
| | | | |||||
| * | | texture_cache: Split templates out | yzct12345 | 2021-08-05 | 7 | -1532/+1533 |
| | | | |||||
* | | | Merge pull request #6844 from ameerj/vp9-empty-frame | Mai M | 2021-08-09 | 2 | -3/+3 |
|\ \ \ | | | | | | | | | vp9: Ensure the first frame is complete | ||||
| * | | | vp9: Ensure the first frame is complete | ameerj | 2021-08-08 | 2 | -3/+3 |
| | |/ | |/| | | | | | | | Silences a runtime error due to the first frame missing the frame data, and being set to hidden despite being a key-frame. | ||||
* | | | Merge pull request #6834 from K0bin/buffer-image-granularity | Fernando S | 2021-08-08 | 2 | -2/+8 |
|\ \ \ | |/ / |/| | | Respect Vulkan bufferImageGranularity | ||||
| * | | vulkan_memory_allocator: Respect bufferImageGranularity | Robin Kertels | 2021-08-07 | 2 | -2/+8 |
| | | | |||||
* | | | Merge pull request #6830 from ameerj/nvdec-unimpld-codec | bunnei | 2021-08-07 | 1 | -1/+1 |
|\ \ \ | |/ / |/| | | nvdec: Better logging for unimplemented codecs | ||||
| * | | nvdec: Better logging for unimplemented codecs | ameerj | 2021-08-07 | 1 | -1/+1 |
| | | | |||||
* | | | Merge pull request #6791 from ameerj/astc-opt | bunnei | 2021-08-07 | 7 | -421/+251 |
|\ \ \ | |/ / |/| | | astc_decoder: Various performance and memory optimizations | ||||
| * | | astc_decoder: Reduce workgroup size | ameerj | 2021-08-01 | 3 | -5/+5 |
| | | | | | | | | | | | | This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32. | ||||
| * | | astc_decoder: Compute offset swizzles in-shader | ameerj | 2021-08-01 | 4 | -109/+25 |
| | | | | | | | | | | | | Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes. | ||||
| * | | astc_decoder: Make use of uvec4 for payload data | ameerj | 2021-08-01 | 1 | -79/+43 |
| | | | |||||
| * | | astc_decoder: Simplify Select2DPartition | ameerj | 2021-08-01 | 1 | -38/+19 |
| | | | |||||
| * | | astc_decoder: Optimize the use EncodingData | ameerj | 2021-08-01 | 6 | -138/+108 |
| | | | | | | | | | | | | | | | | | | | | | This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea. | ||||
| * | | astc.h: Move data to cpp implementation | ameerj | 2021-08-01 | 2 | -64/+63 |
| | | | | | | | | | | | | Moves leftover values that are no longer used by the gpu decoder back to the cpp implementation. | ||||
* | | | Merge pull request #6799 from ameerj/vp9-fixes | bunnei | 2021-08-07 | 3 | -106/+48 |
|\ \ \ | |_|/ |/| | | nvdec: Fix VP9 reference frame refreshes | ||||
| * | | vp9: Cleanup unused variables | ameerj | 2021-08-07 | 3 | -58/+17 |
| | | | | | | | | | | | | | | | With reference frames refreshes fix, we no longer need to buffer two frames in advance. We can also remove other unused or otherwise unneeded variables. | ||||
| * | | vp9: Fix reference frame refreshes | ameerj | 2021-08-07 | 2 | -48/+31 |
| |/ | | | | | | | This resolves the artifacting when decoding VP9 streams. | ||||
* | | nvdec: Implement VA-API hardware video acceleration (#6713) | yzct12345 | 2021-08-04 | 5 | -72/+175 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * nvdec: VA-API * Verify formatting * Forgot a semicolon for Windows * Clarify comment about AV_PIX_FMT_NV12 * Fix assert log spam from missing negation * vic: Remove forgotten debug code * Address lioncash's review * Mention VA-API is Intel/AMD * Address v1993's review * Hopefully fix CMakeLists style this time * vic: Improve cache locality * vic: Fix off-by-one error * codec: Async * codec: Forgot the GetValue() * nvdec: Address ameerj's review * codec: Fallback to CPU without VA-API support * cmake: Address lat9nq's review * cmake: Make VA-API optional * vaapi: Multiple GPU * Apply suggestions from code review Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com> * nvdec: Address ameerj's review * codec: Use anonymous instead of static * nvdec: Remove enum and fix memory leak * nvdec: Address ameerj's review * codec: Remove preparation for threading Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com> | ||||
* | | decoders: Optimize swizzle copy performance (#6790) | yzct12345 | 2021-08-02 | 1 | -9/+43 |
| | | | | | | This makes UnswizzleTexture up to two times faster. It is the main bottleneck in NVDEC video decoding. | ||||
* | | Merge pull request #6720 from ameerj/vk-screenshot | Fernando S | 2021-08-01 | 7 | -73/+245 |
|\ \ | | | | | | | renderer_vulkan: Implement screenshots | ||||
| * | | renderers: Add explicit invert_y bool to screenshot callback | ameerj | 2021-07-29 | 4 | -5/+5 |
| | | | | | | | | | | | | OpenGL and Vulkan images render in different coordinate systems. This allows us to specify the coordinate system of the screenshot within each renderer | ||||
| * | | renderer_vulkan: Implement screenshots | ameerj | 2021-07-29 | 2 | -0/+152 |
| | | | |||||
| * | | vk_blit_screen: Add public CreateFramebuffer method | ameerj | 2021-07-29 | 2 | -14/+18 |
| | | | |||||
| * | | vk_blit_screen: Make Draw method more generic | ameerj | 2021-07-29 | 3 | -55/+71 |
| | | | | | | | | | | | | Allows specifying the framebuffer and render area dimensions, rather than being hard coded for the render window. | ||||
* | | | Merge pull request #6765 from ReinUsesLisp/y-negate-vk | Ameer J | 2021-08-01 | 1 | -2/+7 |
|\ \ \ | |_|/ |/| | | vk_rasterizer: Flip viewport on Y_NEGATE | ||||
| * | | vk_rasterizer: Flip viewport on Y_NEGATE | ReinUsesLisp | 2021-07-29 | 1 | -2/+7 |
| |/ | | | | | | | | | Matches OpenGL's behavior. I don't believe this register flips geometry, but we have to try to match behavior on both backends. | ||||
* | | Merge pull request #6759 from ReinUsesLisp/pipeline-statistics | bunnei | 2021-07-30 | 13 | -19/+278 |
|\ \ | |/ |/| | renderer_vulkan: Add setting to log pipeline statistics | ||||
| * | renderer_vulkan: Add setting to log pipeline statistics | ReinUsesLisp | 2021-07-28 | 13 | -19/+278 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Use VK_KHR_pipeline_executable_properties when enabled and available to log statistics about the pipeline cache in a game. For example, this is on Turing GPUs when generating a pipeline cache from Super Smash Bros. Ultimate: Average pipeline statistics ========================================== Code size: 6433.167 Register count: 32.939 More advanced results could be presented, at the moment it's just an average of all 3D and compute pipelines. | ||||
* | | Merge pull request #6749 from lioncash/rtarget | bunnei | 2021-07-28 | 1 | -3/+3 |
|\ \ | |/ |/| | render_target: Add missing initializer for size extent | ||||
| * | render_target: Add missing initializer for size extent | Lioncash | 2021-07-27 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | Everything else has a default constructor that does the straightforward thing of initializing most members to a default value, except for the size. We explicitly initialize the size (and others, for consistency), to prevent potential uninitialized reads from occurring. Particularly given the largeish surface area that this struct is used in. | ||||
* | | Merge pull request #6748 from lioncash/engine-init | Rodrigo Locatti | 2021-07-27 | 2 | -2/+2 |
|\ \ | | | | | | | video_core/engine: Consistently initialize rasterizer pointers | ||||
| * | | video_core/engine: Consistently initialize rasterizer pointers | Lioncash | 2021-07-27 | 2 | -2/+2 |
| |/ | | | | | | | | | Ensures all of the engines have consistent and deterministic initialization of the rasterizer pointers. | ||||
* | | Merge pull request #6745 from lioncash/copies | bunnei | 2021-07-27 | 3 | -5/+2 |
|\ \ | | | | | | | video_core: Remove some unused variables | ||||
| * | | buffer_cache: Remove unused small_vector in CommitAsyncFlushesHigh() | Lioncash | 2021-07-27 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | Given this is non-trivial, the constructor is required to execute, so this removes a bit of redundant codegen. | ||||
| * | | gl_shader_cache: Remove unused variable | Lioncash | 2021-07-27 | 1 | -1/+0 |
| | | | |||||
| * | | vk_compute_pass: Remove unused captures | Lioncash | 2021-07-27 | 1 | -3/+2 |
| |/ | | | | | | | Resolves two compiler warnings. | ||||
* / | vulkan_wrapper: Fix SetObjectName() always indicating objects as images | Lioncash | 2021-07-27 | 1 | -1/+1 |
|/ | | | | We should be using the passed in object type instead. | ||||
* | Merge pull request #6696 from ameerj/speed-limit-rename | bunnei | 2021-07-27 | 3 | -12/+4 |
|\ | | | | | general: Rename "Frame Limit" references to "Speed Limit" | ||||
| * | renderer_base: Removed redundant settings | ameerj | 2021-07-24 | 3 | -12/+4 |
| | | | | | | | | | | use_framelimiter was not being used internally by the renderers. set_background_color was always set to true as there is no toggle for the renderer background color, instead users directly choose the color of their choice. | ||||
| * | general: Rename "Frame Limit" references to "Speed Limit" | ameerj | 2021-07-24 | 1 | -1/+1 |
| | | | | | | | | | | This setting is best referred to as a speed limit, as it involves the limits of all timing based aspects of the emulator, not only framerate. This allows us to differentiate it from the fps unlocker setting. | ||||
* | | Merge pull request #6741 from ReinUsesLisp/stream-remove | Rodrigo Locatti | 2021-07-27 | 2 | -244/+0 |
|\ \ | | | | | | | vk_stream_buffer: Remove unused stream buffer | ||||
| * | | vk_stream_buffer: Remove unused stream buffer | ReinUsesLisp | 2021-07-26 | 2 | -244/+0 |
| | | | | | | | | | | | | Remove unused file. | ||||
* | | | Merge pull request #6740 from K0bin/hvv-fallback | Rodrigo Locatti | 2021-07-27 | 1 | -8/+21 |
|\ \ \ | | | | | | | | | Handle allocation failure in Staging buffer | ||||
| * | | | vk_staging_buffer_pool: Fall back to host memory when allocation fails | Robin Kertels | 2021-07-26 | 1 | -8/+21 |
| |/ / | |||||
* | | | Merge pull request #6728 from ReinUsesLisp/null-buffer-usage | Rodrigo Locatti | 2021-07-26 | 1 | -3/+7 |
|\ \ \ | | | | | | | | | vk_buffer_cache: Add transform feedback usage to null buffer | ||||
| * | | | vk_buffer_cache: Add transform feedback usage to null buffer | ReinUsesLisp | 2021-07-26 | 1 | -3/+7 |
| |/ / | | | | | | | | | | Fixes bad API usages on Vulkan. | ||||
* / / | vk_compute_pass: Fix pipeline barrier for indexed quads | ReinUsesLisp | 2021-07-26 | 1 | -1/+1 |
|/ / | | | | | | | Use an index buffer barrier instead of a vertex input read barrier. | ||||
* | | Merge pull request #6585 from ameerj/hades | bunnei | 2021-07-25 | 163 | -26949/+7358 |
|\ \ | | | | | | | Shader Decompiler Rewrite | ||||
| * | | vulkan/blit_image: Commit descriptor sets within worker thread | ReinUsesLisp | 2021-07-23 | 1 | -9/+7 |
| | | | | | | | | | | | | | | | Fixes race condition caused. The descriptor pool is not thread safe, so we have to commit descriptor sets within the same thread. | ||||
| * | | vulkan_device: Blacklist Volta and older from VK_KHR_push_descriptor | ReinUsesLisp | 2021-07-23 | 1 | -4/+39 |
| | | | | | | | | | | | | | | | Causes crashes on Link's Awakening intro. It's hard to debug if it's our fault due to bugs in validation layers. | ||||
| * | | Revert "renderers: Disable async shader compilation" | ReinUsesLisp | 2021-07-23 | 2 | -5/+3 |
| | | | | | | | | | | | | This reverts commit 4a152767286717fa69bfc94846a124a366f70065. | ||||
| * | | opengl: Fix asynchronous shaders | ReinUsesLisp | 2021-07-23 | 2 | -4/+33 |
| | | | | | | | | | | | | | | | Wait for shader to build before configuring it, and wait for the shader to build before sharing it with other contexts. | ||||
| * | | shader_environment: Receive cache version from outside | ReinUsesLisp | 2021-07-23 | 4 | -16/+23 |
| | | | | | | | | | | | | This allows us invalidating OpenGL and Vulkan separately in the future. | ||||
| * | | shader: Fix disabled attribute default values | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | gl_device: Simplify GLASM setting logic | ameerj | 2021-07-23 | 1 | -15/+8 |
| | | | |||||
| * | | renderer_opengl: Use ARB_separate_shader_objects | ReinUsesLisp | 2021-07-23 | 9 | -116/+154 |
| | | | | | | | | | | | | | | | Ensures that states set for a particular stage are not attached to other stages which may not need them. | ||||
| * | | glsl: Clamp shared mem size to GL_MAX_COMPUTE_SHARED_MEMORY_SIZE | ameerj | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | gl_shader_cache: Properly implement asynchronous shaders | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | shader_recompiler, video_core: Resolve clang errors | lat9nq | 2021-07-23 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Silences the following warnings-turned-errors: -Wsign-conversion -Wunused-private-field -Wbraced-scalar-init -Wunused-variable And some other errors | ||||
| * | | renderers: Fix clang formatting | ameerj | 2021-07-23 | 4 | -9/+13 |
| | | | |||||
| * | | renderers: Disable async shader compilation | ameerj | 2021-07-23 | 2 | -3/+5 |
| | | | | | | | | | | | | The current implementation is prone to causing graphical issues. Disable until a better solution is implemented. | ||||
| * | | maxwell_to_vk: Add R16_SNORM | ReinUsesLisp | 2021-07-23 | 2 | -1/+2 |
| | | | |||||
| * | | shader: Ignore global memory ops on devices lacking int64 support | ameerj | 2021-07-23 | 2 | -0/+2 |
| | | | |||||
| * | | vulkan_device: Add missing include algorithm | lat9nq | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | vulkan_device: Blacklist ampere devices from float16 math | ameerj | 2021-07-23 | 2 | -12/+23 |
| | | | |||||
| * | | gl_shader_cache: Fixes for async shaders | ameerj | 2021-07-23 | 2 | -2/+25 |
| | | | |||||
| * | | vulkan_device: Enable VK_EXT_extended_dynamic_state on RADV 21.2 onward | ReinUsesLisp | 2021-07-23 | 1 | -4/+7 |
| | | | |||||
| * | | emit_spirv: Workaround VK_KHR_shader_float_controls on fp16 Nvidia | ReinUsesLisp | 2021-07-23 | 2 | -0/+2 |
| | | | | | | | | | | | | Fix regression on Fire Emblem: Three Houses when using native fp16. | ||||
| * | | vk_rasterizer: Workaround bug in VK_EXT_vertex_input_dynamic_state | ReinUsesLisp | 2021-07-23 | 4 | -19/+20 |
| | | | | | | | | | | | | | | | Workaround potential bug on Nvidia's driver where only updating high attributes leaves low attributes out dated. | ||||
| * | | shader: Fix disabled and unwritten attributes and varyings | ReinUsesLisp | 2021-07-23 | 1 | -15/+20 |
| | | | |||||
| * | | vk_graphics_pipeline: Implement smooth lines | ReinUsesLisp | 2021-07-23 | 5 | -5/+65 |
| | | | |||||
| * | | vk_graphics_pipeline: Implement line width | ReinUsesLisp | 2021-07-23 | 8 | -8/+36 |
| | | | |||||
| * | | video_core: Enable GL SPIR-V shaders | lat9nq | 2021-07-23 | 7 | -38/+105 |
| | | | |||||
| * | | general: Add setting shader_backend | lat9nq | 2021-07-23 | 1 | -4/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | GLASM is getting good enough that we can move it out of advanced graphics settings. This removes the setting `use_assembly_shaders`, opting for a enum class `shader_backend`. This comes with the benefits that it is extensible for additional shader backends besides GLSL and GLASM, and this will work better with a QComboBox. Qt removes the related assembly shader setting from the Advanced Graphics section and places it as a new QComboBox in the API Settings group. This will replace the Vulkan device selector when OpenGL is selected. Additionally, mark all of the custom anisotropic filtering settings as "WILL BREAK THINGS", as that is the case with a select few games. | ||||
| * | | glasm: Add passthrough geometry shader support | ReinUsesLisp | 2021-07-23 | 3 | -1/+7 |
| | | | |||||
| * | | shader: Rework varyings and implement passthrough geometry shaders | ReinUsesLisp | 2021-07-23 | 7 | -15/+43 |
| | | | | | | | | | | | | | | | | | | Put all varyings into a single std::bitset with helpers to access it. Implement passthrough geometry shaders using host's. | ||||
| * | | vk_graphics_pipeline: Implement conservative rendering | ReinUsesLisp | 2021-07-23 | 6 | -10/+44 |
| | | | |||||
| * | | shader: Unify shader stage types | ReinUsesLisp | 2021-07-23 | 14 | -53/+28 |
| | | | |||||
| * | | shader: Emulate 64-bit integers when not supported | ReinUsesLisp | 2021-07-23 | 5 | -2/+13 |
| | | | | | | | | | | | | Useful for mobile and Intel Xe devices. | ||||
| * | | gl_graphics_pipeline: Fix assembly shaders check for transform feedbacks | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | gl_graphics_pipeline: Inline hash and operator== key functions | ReinUsesLisp | 2021-07-23 | 2 | -12/+8 |
| | | | |||||
| * | | gl_shader_cache: Check previous pipeline before checking hash map | ReinUsesLisp | 2021-07-23 | 5 | -29/+41 |
| | | | | | | | | | | | | Port optimization from Vulkan. | ||||
| * | | gl_graphics_pipeline: Port optimizations from Vulkan pipelines | ReinUsesLisp | 2021-07-23 | 2 | -57/+141 |
| | | | |||||
| * | | buffer_cache: Fix debugging leftover | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | buffer_cache: Fix size reductions not having in mind bind sizes | ReinUsesLisp | 2021-07-23 | 1 | -7/+23 |
| | | | | | | | | | | | | | | | A buffer binding can change between shaders without changing the shaders. This lead to outdated bindings on OpenGL. | ||||
| * | | shaders: Allow shader notify when async shaders is disabled | ameerj | 2021-07-23 | 2 | -11/+9 |
| | | | |||||
| * | | vk_graphics_pipeline: Use VK_KHR_push_descriptor when available | ReinUsesLisp | 2021-07-23 | 8 | -36/+88 |
| | | | | | | | | | | | | ~51% faster on Nvidia compared to previous method. | ||||
| * | | shader: Properly manage attributes not written from previous stages | ReinUsesLisp | 2021-07-23 | 2 | -5/+22 |
| | | | |||||
| * | | shader: Split profile and runtime info headers | ReinUsesLisp | 2021-07-23 | 2 | -1/+2 |
| | | | |||||
| * | | shader: Add support for native 16-bit floats | ReinUsesLisp | 2021-07-23 | 5 | -10/+24 |
| | | | |||||
| * | | shader: Rename maxwell/program.h to translate_program.h | ReinUsesLisp | 2021-07-23 | 2 | -2/+2 |
| | | | |||||
| * | | vulkan_device: Blacklist VK_EXT_vertex_input_dynamic_state on Intel | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | | | |||||
| * | | glsl: Address rest of feedback | ameerj | 2021-07-23 | 4 | -17/+22 |
| | | | |||||
| * | | glsl: Conditionally use fine/coarse derivatives based on device support | ameerj | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | glsl: Cleanup/Address feedback | ameerj | 2021-07-23 | 1 | -0/+2 |
| | | | |||||
| * | | gl_shader_cache: Implement async shaders | ameerj | 2021-07-23 | 7 | -107/+154 |
| | | | |||||
| * | | glsl: Add stubs for sparse queries and variable aoffi when not supported | ameerj | 2021-07-23 | 3 | -0/+8 |
| | | | |||||
| * | | gl_shader_cache: Remove const from pipeline source arguments | ameerj | 2021-07-23 | 4 | -6/+6 |
| | | | |||||
| * | | gl_shader_cache: Move OGL shader compilation to the respective Pipeline constructor | ameerj | 2021-07-23 | 5 | -76/+79 |
| | | | |||||
| * | | glsl: Address more feedback. Implement indexed texture reads | ameerj | 2021-07-23 | 1 | -3/+3 |
| | | | |||||
| * | | gl_rasterizer: Add texture fetch barrier for fragments | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | | | | | | | | | Fixes flicker seen in XC2 | ||||
| * | | glsl: Implement fswzadd | ameerj | 2021-07-23 | 1 | -0/+1 |
| | | | | | | | | | | | | and wip nv thread shuffle impl | ||||
| * | | glsl: Rebase fixes | ameerj | 2021-07-23 | 2 | -3/+5 |
| | | | |||||
| * | | glsl: Use textureGrad fallback when EXT_texture_shadow_lod is unsupported | ameerj | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | glsl: skip gl_ViewportIndex write if device does not support it | ameerj | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | glsl: Implement transform feedback | ameerj | 2021-07-23 | 1 | -5/+13 |
| | | | |||||
| * | | glsl: Implement VOTE for subgroup size potentially larger | ameerj | 2021-07-23 | 3 | -1/+7 |
| | | | |||||
| * | | glsl: Implement some attribute getters and setters | ameerj | 2021-07-23 | 1 | -1/+0 |
| | | | |||||
| * | | glsl: Query GL Device for FP16 extension support | ameerj | 2021-07-23 | 3 | -0/+14 |
| | | | |||||
| * | | glsl: Fixup build issues | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | glsl: Initial backend | ameerj | 2021-07-23 | 1 | -2/+5 |
| | | | |||||
| * | | vk_rasterizer: Exit render passes on fragment barriers | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | vk_graphics_pipeline: Fix path with no VK_EXT_extended_dynamic_state | Rodrigo Locatti | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | buffer_cache: Invalidate fast buffers on compute | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | shader: Add shader loop safety check settings | lat9nq | 2021-07-23 | 1 | -2/+2 |
| | | | | | | | | | | | | Also add a setting for enable Nsight Aftermath. | ||||
| * | | vulkan_device: Enable VK_EXT_vertex_input_dynamic_state | ReinUsesLisp | 2021-07-23 | 1 | -0/+28 |
| | | | |||||
| * | | vk_pipeline_cache: Skip cached pipelines with different dynamic state | ReinUsesLisp | 2021-07-23 | 1 | -0/+6 |
| | | | |||||
| * | | vulkan: Add VK_EXT_vertex_input_dynamic_state support | ReinUsesLisp | 2021-07-23 | 11 | -116/+291 |
| | | | | | | | | | | | | | | | Reduces the number of total pipelines generated on Vulkan. Tested on Super Smash Bros. Ultimate. | ||||
| * | | shader: Reorder shader cache directories | ReinUsesLisp | 2021-07-23 | 2 | -18/+12 |
| | | | |||||
| * | | vk_rasterizer: Implement first index | ReinUsesLisp | 2021-07-23 | 1 | -2/+5 |
| | | | |||||
| * | | vulkan: Use VK_EXT_provoking_vertex when available | ReinUsesLisp | 2021-07-23 | 6 | -4/+52 |
| | | | |||||
| * | | gl_buffer_cache: Use unorm internal formats for snorm texture buffer views | ameerj | 2021-07-23 | 1 | -1/+24 |
| | | | | | | | | | | | | Fixes black textures in UE4 games | ||||
| * | | shader_environment: Fix local memory size calculations | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | | | |||||
| * | | buffer_cache: Fix copy based uniform bindings tracking | ReinUsesLisp | 2021-07-23 | 2 | -9/+22 |
| | | | |||||
| * | | shader_environment: Add shader_local_memory_crs_size to local memory size | ameerj | 2021-07-23 | 1 | -1/+1 |
| | | | | | | | | | | | | Fixes DOOM 2016 missing local memory | ||||
| * | | gl_texture_cache: Create image storage views | ReinUsesLisp | 2021-07-23 | 4 | -38/+126 |
| | | | | | | | | | | | | Fixes SULD.D tests. | ||||
| * | | gl_shader_util: Move shader utility code to a separate file | ReinUsesLisp | 2021-07-23 | 7 | -245/+106 |
| | | | |||||
| * | | gl_shader_cache: Store workers in shader cache object | ReinUsesLisp | 2021-07-23 | 2 | -58/+78 |
| | | | |||||
| * | | vk_pipeline_cache,shader_notify: Add shader notifications | ReinUsesLisp | 2021-07-23 | 9 | -90/+121 |
| | | | |||||
| * | | vk_pipeline_cache: Add asynchronous shaders | ReinUsesLisp | 2021-07-23 | 3 | -3/+33 |
| | | | |||||
| * | | vk_rasterizer: Flush work on clear and dispatches | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | | | |||||
| * | | DMA: Restrict optimised path for BlockToLinear further. | FernandoS27 | 2021-07-23 | 1 | -1/+2 |
| | | | |||||
| * | | vk_swapchain: Handle outdated swapchains | ReinUsesLisp | 2021-07-23 | 3 | -17/+34 |
| | | | | | | | | | | | | Fixes pixelated presentation on Intel devices. | ||||
| * | | shader: Fix VertexA Shaders. | FernandoS27 | 2021-07-23 | 1 | -5/+21 |
| | | | |||||
| * | | vk_buffer_cache: Handle null texture buffers | ReinUsesLisp | 2021-07-23 | 1 | -0/+4 |
| | | | | | | | | | | | | Fixes a crash on Age of Calamity cutscenes. | ||||
| * | | nsight_aftermath_tracker: Fix SPIR-V module writes | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | vk_pipeline_cache: Set support_derivative_control to true | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | glasm: Use ARB_derivative_control conditionally | ReinUsesLisp | 2021-07-23 | 3 | -0/+7 |
| | | | |||||
| * | | buffer_cache: Reduce uniform buffer size from shader usage | ReinUsesLisp | 2021-07-23 | 9 | -35/+61 |
| | | | | | | | | | | | | Increases performance significantly on certain titles. | ||||
| * | | transform_feedback: Read buffer stride from index instead of layout | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | |||||
| * | | fixed_pipeline_state: Use regular for loop instead of ranges for perf | ReinUsesLisp | 2021-07-23 | 1 | -2/+3 |
| | | | | | | | | | | | | MSVC generates better code for it. | ||||
| * | | vk_swapchain: Avoid recreating the swapchain on each frame | ReinUsesLisp | 2021-07-23 | 2 | -15/+9 |
| | | | | | | | | | | | | | | | Recreate only when requested (or sRGB is changed) instead of tracking the frontend's size. That size is still used as a hint. | ||||
| * | | vulkan: Conditionally use shaderInt16 | ReinUsesLisp | 2021-07-23 | 3 | -2/+9 |
| | | | | | | | | | | | | Add support for Polaris AMD devices. | ||||
| * | | vulkan: Enable depth bounds and use it conditionally | ReinUsesLisp | 2021-07-23 | 4 | -2/+17 |
| | | | | | | | | | | | | Intel devices pre-Xe don't support this. | ||||
| * | | vk_buffer_cache: Add transform feedback usage to buffers | ReinUsesLisp | 2021-07-23 | 1 | -15/+22 |
| | | | |||||
| * | | opengl: Declare fragment outputs even if they are not used | ReinUsesLisp | 2021-07-23 | 2 | -0/+9 |
| | | | | | | | | | | | | | | | | | | Fixes Ori and the Blind Forest's menu on GLASM. For some reason (probably high level optimizations) it is not sanitized on SPIR-V for OpenGL. Vulkan is unaffected by this change. | ||||
| * | | buffer_cache: Mark uniform buffers as dirty if any enable bit changes | ReinUsesLisp | 2021-07-23 | 5 | -7/+17 |
| | | | |||||
| * | | vulkan_device: Enable float64 and int64 conditionally | ReinUsesLisp | 2021-07-23 | 2 | -2/+6 |
| | | | | | | | | | | | | Add Intel Xe support. | ||||
| * | | texture_cache: Reduce invalid image/sampler error severity | ReinUsesLisp | 2021-07-23 | 1 | -7/+7 |
| | | | |||||
| * | | shader: Handle host exceptions | ReinUsesLisp | 2021-07-23 | 4 | -32/+55 |
| | | | |||||
| * | | glasm: Prepare XFB from state instead of global registers | ReinUsesLisp | 2021-07-23 | 1 | -4/+2 |
| | | | |||||
| * | | glasm: Use storage buffers instead of global memory when possible | ReinUsesLisp | 2021-07-23 | 11 | -67/+120 |
| | | | |||||
| * | | gl_shader_cache: Add disk shader cache | ReinUsesLisp | 2021-07-23 | 3 | -11/+116 |
| | | | |||||
| * | | video_core,shader: Clang-format fixes | ReinUsesLisp | 2021-07-23 | 2 | -5/+10 |
| | | | |||||
| * | | gl_shader_cache: Rename Program abstractions into Pipeline | ReinUsesLisp | 2021-07-23 | 10 | -104/+104 |
| | | | |||||
| * | | gl_shader_cache: Do not flip tessellation on OpenGL | ReinUsesLisp | 2021-07-23 | 1 | -2/+1 |
| | | | |||||
| * | | gl_graphics_program: Fix texture buffer bindings | ReinUsesLisp | 2021-07-23 | 1 | -24/+35 |
| | | | |||||
| * | | gl_shader_cache: Conditionally use viewport mask | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | gl_shader_cache,glasm: Conditionally use typeless image reads extension | ReinUsesLisp | 2021-07-23 | 2 | -37/+39 |
| | | | |||||
| * | | gl_shader_cache: Improve GLASM error print logic | ReinUsesLisp | 2021-07-23 | 1 | -7/+10 |
| | | | |||||
| * | | glasm: Implement forced early Z | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | | | |||||
| * | | glasm: Set transform feedback state | ReinUsesLisp | 2021-07-23 | 5 | -113/+132 |
| | | | |||||
| * | | video_core: Abstract transform feedback translation utility | ReinUsesLisp | 2021-07-23 | 6 | -111/+145 |
| | | | |||||
| * | | gl_shader_cache: Pass shader runtime information | ReinUsesLisp | 2021-07-23 | 1 | -2/+74 |
| | | | |||||
| * | | shader: Split profile and runtime information in separate structs | ReinUsesLisp | 2021-07-23 | 3 | -237/+212 |
| | | | |||||
| * | | gl_shader_manager: Zero initialize current assembly programs | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | gl_shader_manager: Remove unintentionally committed #pragma | ReinUsesLisp | 2021-07-23 | 1 | -2/+0 |
| | | | |||||
| * | | renderer_opengl: State track compute assembly programs | ReinUsesLisp | 2021-07-23 | 3 | -4/+21 |
| | | | |||||
| * | | renderer_opengl: State track assembly programs | ReinUsesLisp | 2021-07-23 | 3 | -23/+56 |
| | | | |||||
| * | | HACK: Bind stages before and after bindings | ReinUsesLisp | 2021-07-23 | 1 | -0/+11 |
| | | | | | | | | | | | | | | | | | | | | | Works around a bug where program parameters are only applied to the current stage, and this one wasn't bound at the moment. Affects all SSBO usages on GLASM. | ||||
| * | | glasm: Support textures used in more than one stage | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | opengl: Initial (broken) support to GLASM shaders | ReinUsesLisp | 2021-07-23 | 3 | -14/+53 |
| | | | |||||
| * | | vk_update_descriptor: Properly initialize payload on the update descriptor queue | ReinUsesLisp | 2021-07-23 | 1 | -1/+3 |
| | | | |||||
| * | | vk_pipeline_cache: Enable int8 and int16 types on Vulkan | ReinUsesLisp | 2021-07-23 | 1 | -0/+2 |
| | | | |||||
| * | | gl_rasterizer: Flush L2 caches before glFlush on GLASM | ReinUsesLisp | 2021-07-23 | 1 | -0/+8 |
| | | | |||||
| * | | glasm: Initial GLASM compute implementation for testing | ReinUsesLisp | 2021-07-23 | 3 | -14/+47 |
| | | | |||||
| * | | vk_scheduler: Use locks instead of SPSC a queue | ReinUsesLisp | 2021-07-23 | 2 | -32/+42 |
| | | | | | | | | | | | | This tries to fix a data race where we'd wait forever for the GPU. | ||||
| * | | vk_query_cache: Wait before reading queries | ReinUsesLisp | 2021-07-23 | 1 | -9/+2 |
| | | | |||||
| * | | vk_master_semaphore: Use fetch_add to increase master semaphore tick | ReinUsesLisp | 2021-07-23 | 2 | -6/+4 |
| | | | |||||
| * | | gl_shader_cache: Remove code unintentionally committed | ReinUsesLisp | 2021-07-23 | 1 | -3/+0 |
| | | | |||||
| * | | Move SPIR-V emission functions to their own header | ReinUsesLisp | 2021-07-23 | 2 | -7/+6 |
| | | | |||||
| * | | shader: Initial OpenGL implementation | ReinUsesLisp | 2021-07-23 | 35 | -705/+1415 |
| | | | |||||
| * | | spirv: Support OpenGL uniform buffers and change bindings | ReinUsesLisp | 2021-07-23 | 1 | -2/+5 |
| | | | |||||
| * | | shader: Address feedback | FernandoS27 | 2021-07-23 | 1 | -9/+9 |
| | | | |||||
| * | | shader: Implement VertexA stage | FernandoS27 | 2021-07-23 | 1 | -3/+14 |
| | | | |||||
| * | | vk_graphics_pipeline: Fix texture buffer descriptors | ReinUsesLisp | 2021-07-23 | 1 | -7/+8 |
| | | | |||||
| * | | vk_scheduler: Allow command submission on worker thread | ReinUsesLisp | 2021-07-23 | 8 | -182/+200 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This changes how Scheduler::Flush works. It queues the current command buffer to be sent to the GPU but does not do it immediately. The Vulkan worker thread takes care of that. Users will have to use Scheduler::Flush + Scheduler::WaitWorker to get the previous behavior. Scheduler::Finish is unchanged. To avoid waiting on work never queued, Scheduler::Wait sends the current command buffer if that's what the caller wants to wait. | ||||
| * | | vk_compute_pass: Fix -Wshadow warning | ReinUsesLisp | 2021-07-23 | 1 | -3/+3 |
| | | | |||||
| * | | shader: Move pipeline cache logic to separate files | ReinUsesLisp | 2021-07-23 | 12 | -824/+1095 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Move code to separate files to be able to reuse it from OpenGL. This greatly simplifies the pipeline cache logic on Vulkan. Transform feedback state is not yet abstracted and it's still intrusively stored inside vk_pipeline_cache. It will be moved when needed on OpenGL. | ||||
| * | | vulkan: Defer descriptor set work to the Vulkan thread | ReinUsesLisp | 2021-07-23 | 8 | -79/+69 |
| | | | | | | | | | | | | | | | | | | | | | Move descriptor lookup and update code to a separate thread. Delaying this removes work from the main GPU thread and allows creating descriptor layouts on another thread. This reduces a bit the workload of the main thread when new pipelines are encountered. | ||||
| * | | vulkan: Rework descriptor allocation algorithm | ReinUsesLisp | 2021-07-23 | 15 | -197/+314 |
| | | | | | | | | | | | | | | | | | | Create multiple descriptor pools on demand. There are some degrees of freedom what is considered a compatible pool to avoid wasting large pools on small descriptors. | ||||
| * | | vk_graphics_pipeline: Generate specialized pipeline config functions and improve code | ReinUsesLisp | 2021-07-23 | 2 | -31/+230 |
| | | | |||||
| * | | shader: Accelerate pipeline transitions and use dirty flags for shaders | ReinUsesLisp | 2021-07-23 | 9 | -64/+114 |
| | | | |||||
| * | | vk_compute_pipeline: Fix index comparison oversight on compute texture buffers | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | vulkan_device: Require shaderClipDistance and shaderCullDistance features | ReinUsesLisp | 2021-07-23 | 1 | -2/+4 |
| | | | |||||
| * | | vk_graphics_pipeline: Guard against non-tessellation pipelines using patches | ReinUsesLisp | 2021-07-23 | 1 | -2/+8 |
| | | | |||||
| * | | shader: Fix bugs and build issues on GCC | Rodrigo Locatti | 2021-07-23 | 3 | -4/+4 |
| | | | |||||
| * | | shader: Fix render targets with null attachments | ReinUsesLisp | 2021-07-23 | 2 | -26/+34 |
| | | | |||||
| * | | shader: Require dual source blending | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | |||||
| * | | shader: Implement indexed textures | ReinUsesLisp | 2021-07-23 | 3 | -64/+95 |
| | | | |||||
| * | | shader: Move microinstruction header to the value header | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | shader: Implement D3D samplers | ReinUsesLisp | 2021-07-23 | 3 | -37/+51 |
| | | | |||||
| * | | shader: Implement SR_Y_DIRECTION | FernandoS27 | 2021-07-23 | 3 | -0/+4 |
| | | | |||||
| * | | shader: Implement PIXLD.MY_INDEX | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | |||||
| * | | spirv: Implement ViewportMask with NV_viewport_array2 | ReinUsesLisp | 2021-07-23 | 3 | -0/+12 |
| | | | |||||
| * | | shader: Implement tessellation shaders, polygon mode and invocation id | ReinUsesLisp | 2021-07-23 | 6 | -3/+50 |
| | | | |||||
| * | | vk_pipeline_cache: Silence GCC warnings | lat9nq | 2021-07-23 | 1 | -0/+2 |
| | | | | | | | | | | | | | | | Silences `-Werror=missing-field-initializers` due to missing initializers. | ||||
| * | | spirv: Implement image buffers | ReinUsesLisp | 2021-07-23 | 4 | -26/+56 |
| | | | |||||
| * | | spirv: Implement alpha test | ameerj | 2021-07-23 | 1 | -0/+36 |
| | | | |||||
| * | | shader: Implement transform feedbacks and define file format | ReinUsesLisp | 2021-07-23 | 3 | -7/+156 |
| | | | |||||
| * | | shader: Implement early Z tests | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | spirv: Rework storage buffers and shader memory | ReinUsesLisp | 2021-07-23 | 1 | -1/+28 |
| | | | |||||
| * | | shader: Implement geometry shaders | ReinUsesLisp | 2021-07-23 | 2 | -7/+56 |
| | | | |||||
| * | | pipeline_helper: Simplify descriptor objects initialization | ReinUsesLisp | 2021-07-23 | 1 | -33/+25 |
| | | | |||||
| * | | shader: Implement ATOM/S and RED | ameerj | 2021-07-23 | 3 | -0/+21 |
| | | | |||||
| * | | nsight_aftermath_tracker: Report used shaders to Nsight Aftermath | ReinUsesLisp | 2021-07-23 | 6 | -16/+20 |
| | | | |||||
| * | | spirv: Guard against typeless image reads on unsupported devices | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | vk_rasterizer: Request outside render pass execution context for compute | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | pipeline_helper: Add missing [[maybe_unused]] | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | shader: Implement SULD and SUST | ReinUsesLisp | 2021-07-23 | 8 | -65/+135 |
| | | | |||||
| * | | shader: Address feedback + clang format | lat9nq | 2021-07-23 | 1 | -2/+2 |
| | | | |||||
| * | | shader_recompiler,video_core: Cleanup some GCC and Clang errors | lat9nq | 2021-07-23 | 5 | -16/+16 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Mostly fixing unused *, implicit conversion, braced scalar init, fpermissive, and some others. Some Clang errors likely remain in video_core, and std::ranges is still a pertinent issue in shader_recompiler shader_recompiler: cmake: Force bracket depth to 1024 on Clang Increases the maximum fold expression depth thread_worker: Include condition_variable Don't use list initializers in control flow Co-authored-by: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
| * | | shader: Interact texture buffers with buffer cache | ReinUsesLisp | 2021-07-23 | 14 | -119/+304 |
| | | | |||||
| * | | shader: Implement texture buffers | ReinUsesLisp | 2021-07-23 | 4 | -12/+29 |
| | | | |||||
| * | | vk_pipeline_cache: Fix num of pipeline workers on weird platforms | ReinUsesLisp | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | shader: Fix ShadowCube declaration type, set number of pipeline threads based on hardware | FernandoS27 | 2021-07-23 | 1 | -1/+3 |
| | | | |||||
| * | | vk_compute_pass: Fix compute passes | ReinUsesLisp | 2021-07-23 | 3 | -23/+19 |
| | | | |||||
| * | | shader: Remove atomic flags and use mutex + cond variable for pipelines | ReinUsesLisp | 2021-07-23 | 4 | -11/+32 |
| | | | |||||
| * | | vk_pipeline_cache: Remove unnecesary scope in pipeline cache locking | ReinUsesLisp | 2021-07-23 | 1 | -15/+12 |
| | | | |||||
| * | | vk_pipeline_cache: Small fixes to the pipeline cache | FernandoS27 | 2021-07-23 | 1 | -10/+14 |
| | | | |||||
| * | | shader: Mark SSBOs as written when they are | FernandoS27 | 2021-07-23 | 2 | -2/+2 |
| | | | |||||
| * | | shader: Implement ViewportIndex | FernandoS27 | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | vulkan: Serialize pipelines on a separate thread | ReinUsesLisp | 2021-07-23 | 2 | -67/+64 |
| | | | |||||
| * | | vulkan: Create pipeline layouts in separate threads | ReinUsesLisp | 2021-07-23 | 7 | -63/+65 |
| | | | |||||
| * | | vulkan: Build pipelines in parallel at runtime | ReinUsesLisp | 2021-07-23 | 9 | -165/+197 |
| | | | | | | | | | | | | | | | Wait from the worker thread for a pipeline to build before binding it to the command buffer. This allows queueing pipelines to multiple threads. | ||||
| * | | vk_pipeline_cache: Name SPIR-V modules | ReinUsesLisp | 2021-07-23 | 1 | -1/+11 |
| | | | |||||
| * | | shader: Address feedback | FernandoS27 | 2021-07-23 | 1 | -1/+1 |
| | | | |||||
| * | | shader: Implement TLD | FernandoS27 | 2021-07-23 | 1 | -2/+1 |
| | | | |||||
| * | | spirv: Add fixed pipeline point size | ReinUsesLisp | 2021-07-23 | 1 | -0/+3 |
| | | | |||||
| * | | shader: Implement BRX | FernandoS27 | 2021-07-23 | 1 | -1/+49 |
| | | | |||||
| * | | vk_pipeline_cache: Fix size hashing of shaders | ReinUsesLisp | 2021-07-23 | 1 | -8/+7 |
| | | | |||||
| * | | shader: Implement LDS, STS, LDL, and STS and use SPIR-V 1.4 when available | ReinUsesLisp | 2021-07-23 | 3 | -19/+104 |
| | | | |||||
| * | | shader: Better interpolation and disabled attributes support | ReinUsesLisp | 2021-07-23 | 2 | -2/+5 |
| | | | |||||
| * | | spirv: Remove dependencies on Environment when generating SPIR-V | ReinUsesLisp | 2021-07-23 | 1 | -7/+3 |
| | | | |||||
| * | | vk_pipeline_cache: Fix pipeline and shader caches | ReinUsesLisp | 2021-07-23 | 2 | -6/+21 |
| | | | |||||
| * | | shader: Fix rasterizer integration order issues | ReinUsesLisp | 2021-07-23 | 3 | -7/+6 |
| | | | |||||
| * | | shader: Implement TXQ and fix FragDepth | ReinUsesLisp | 2021-07-23 | 1 | -0/+92 |
| | | | |||||
| * | | shader: Implement NDC [-1, 1], attribute types and default varying initialization | ReinUsesLisp | 2021-07-23 | 3 | -3/+37 |
| | | | |||||
| * | | shader: Implement VOTE | ameerj | 2021-07-23 | 4 | -1/+15 |
| | | | |||||
| * | | vk_pipeline_cache: Fix ReleaseContents order | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | | | |||||
| * | | vk_pipeline_cache: Add pipeline cache | ReinUsesLisp | 2021-07-23 | 2 | -0/+7 |
| | | | |||||
| * | | vk_pipeline_cache: Add pipeline cache | ReinUsesLisp | 2021-07-23 | 4 | -98/+332 |
| | | | |||||
| * | | shader: Implement DMNMX, DSET, DSETP | ameerj | 2021-07-23 | 1 | -0/+2 |
| | | | |||||
| * | | spirv: Implement VertexId and InstanceId, refactor code | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | shader: Implement I2F | ReinUsesLisp | 2021-07-23 | 1 | -1/+2 |
| | | | |||||
| * | | shader: Add partial rasterizer integration | ReinUsesLisp | 2021-07-23 | 20 | -410/+1298 |
| | | | |||||
| * | | spirv: Add SignedZeroInfNanPreserve logic | ameerj | 2021-07-23 | 1 | -0/+4 |
| | | | |||||
| * | | shader: Initial support for textures and TEX | ReinUsesLisp | 2021-07-23 | 4 | -1/+111 |
| | | | |||||
| * | | spirv: Fixes and Intel specific workarounds | ReinUsesLisp | 2021-07-23 | 1 | -0/+1 |
| | | | |||||
| * | | shader: Rename, implement FADD.SAT and P2R (imm) | ReinUsesLisp | 2021-07-23 | 1 | -2/+2 |
| | | | |||||
| * | | shader: Add denorm flush support | ReinUsesLisp | 2021-07-23 | 5 | -33/+50 |
| | | | |||||
| * | | spirv: Add lower fp16 to fp32 pass | ReinUsesLisp | 2021-07-23 | 4 | -9/+14 |
| | | | |||||
| * | | shader: Primitive Vulkan integration | ReinUsesLisp | 2021-07-23 | 15 | -2538/+430 |
| | | | |||||
| * | | shader: Remove old shader management | ReinUsesLisp | 2021-07-23 | 79 | -19513/+53 |
| | | | |||||
| * | | spirv: Initial SPIR-V support | ReinUsesLisp | 2021-07-23 | 2 | -3265/+0 |
| |/ | |||||
* | | Merge pull request #6690 from ReinUsesLisp/dma-clear-fixups | bunnei | 2021-07-24 | 2 | -7/+3 |
|\ \ | |/ |/| | buffer_cache: Misc fixups related to buffer clears | ||||
| * | gl_buffer_cache: Use glClearNamedBufferSubData:GL_RED instead of GL_RGBA | ReinUsesLisp | 2021-07-20 | 1 | -1/+1 |
| | | | | | | | | Avoids reading out of bounds from the stack. | ||||
| * | buffer_cache: Simplify clear logic | ReinUsesLisp | 2021-07-20 | 1 | -6/+2 |
| | | | | | | | | | | Use existing helper functions and avoid looping when only one buffer has to be active. | ||||
* | | Merge pull request #6686 from ReinUsesLisp/vk-optimal-copy | bunnei | 2021-07-22 | 1 | -21/+35 |
|\ \ | |/ |/| | vk_texture_cache: Use VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL when possible | ||||
| * | vk_texture_cache: Use VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL when possible | ReinUsesLisp | 2021-07-20 | 1 | -21/+35 |
| | | | | | | | | Silences performance warnings generated from validation layers on each frame. | ||||
* | | Merge pull request #6629 from FernandoS27/accel-dma-2 | bunnei | 2021-07-20 | 11 | -12/+136 |
|\ \ | | | | | | | DMAEngine: Accelerate BufferClear [accelerateDMA Part 2] | ||||
| * | | Buffer cache: Fixes, Clang and Feedback. | Fernando Sahmkow | 2021-07-15 | 3 | -11/+10 |
| | | | |||||
| * | | GPUMemoryManager: Force inmediate invalidation when writting block. | Fernando Sahmkow | 2021-07-14 | 1 | -1/+1 |
| | | | |||||
| * | | Buffer Cache: Fixes to DMA Copy. | Fernando Sahmkow | 2021-07-14 | 1 | -6/+7 |
| | | | |||||
| * | | DMAEngine: Revert flushing from Pitch to BlpockLinear. | Fernando Sahmkow | 2021-07-14 | 1 | -2/+7 |
| | | | |||||
| * | | BufferCache: fix clearing on forced download. | Fernando Sahmkow | 2021-07-14 | 1 | -10/+20 |
| | | | |||||
| * | | DMAEngine: Accelerate BufferClear | Fernando Sahmkow | 2021-07-13 | 11 | -6/+115 |
| | | | |||||
* | | | Merge pull request #6685 from ReinUsesLisp/radeonsi-client | Fernando S | 2021-07-20 | 1 | -1/+1 |
|\ \ \ | |_|/ |/| | | gl_texture_cache: Workaround slow PBO downloads on radeonsi | ||||
| * | | gl_texture_cache: Workaround slow PBO downloads on radeonsi | ReinUsesLisp | 2021-07-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | There's an optimization bug on non-git mesa versions where not specifying GL_CLIENT_STORAGE_BIT causes very slow reads on the CPU side. Add this bit for all vendors. | ||||
* | | | vk_buffer_cache: Fix quad index array with 0 vertices (#6627) | Fernando S | 2021-07-20 | 1 | -0/+7 |
|/ / | |||||
* | | Merge pull request #6580 from ReinUsesLisp/xfb-radv | Rodrigo Locatti | 2021-07-20 | 2 | -11/+19 |
|\ \ | | | | | | | vk_buffer_cache: Use emulated null buffers for transform feedback | ||||
| * | | vk_buffer_cache: Use emulated null buffers for transform feedback | ReinUsesLisp | 2021-07-09 | 2 | -11/+19 |
| | | | | | | | | | | | | | | | | | | Vulkan does not support null buffers on transform feedback bindings. Emulate these using the same null buffer we were using for index buffers. | ||||
* | | | Merge pull request #6679 from yzct12345/fix-lets-go | Fernando S | 2021-07-19 | 1 | -1/+4 |
|\ \ \ | | | | | | | | | Fix Pokemon Let's Go on Vulkan | ||||
| * | | | Update src/video_core/renderer_vulkan/vk_texture_cache.cpp | yzct12345 | 2021-07-19 | 1 | -1/+1 |
| | | | | | | | | | | | | Co-authored-by: Vitor K <vitor-kiguchi@hotmail.com> | ||||
| * | | | Update src/video_core/renderer_vulkan/vk_texture_cache.cpp | yzct12345 | 2021-07-19 | 1 | -1/+1 |
| | | | | | | | | | | | | Co-authored-by: Vitor K <vitor-kiguchi@hotmail.com> | ||||
| * | | | Ignore wrong blit format | yzct12345 | 2021-07-18 | 1 | -1/+4 |
| | | | | |||||
* | | | | Merge pull request #6670 from ReinUsesLisp/prepare-rt | Fernando S | 2021-07-19 | 1 | -0/+6 |
|\ \ \ \ | | | | | | | | | | | texture_cache: Always prepare image views on render targets | ||||
| * | | | | texture_cache: Always prepare image views on render targets | ReinUsesLisp | 2021-07-18 | 1 | -0/+6 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Images used as render targets were not being "prepared", causing desynchronizations on the texture cache. Needs #6669 to avoid performance regressions on certain cooking titles. - Fixes black shadows on Age of Calamity. | ||||
* | | | | Merge pull request #6669 from ReinUsesLisp/fix-samples-sizes | Fernando S | 2021-07-19 | 1 | -53/+33 |
|\ \ \ \ | | | | | | | | | | | texture_cache/util: Fix size calculations of multisampled images | ||||
| * | | | | texture_cache/util: Fix size calculations of multisampled images | ReinUsesLisp | 2021-07-18 | 1 | -53/+33 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | On the texture cache we handle multisampled images by keeping their real size in samples (e.g. 1920x1080 with 4 samples is 3840x2160). This works nicely with size matches and other comparisons, but the calculation for guest sizes was not having this in mind, and the size was being multiplied (again) by the number of samples per dimension. For example a 3840x2160 texture cache image had its width and height multiplied by 2, resulting in a much larger texture. Fix this issue. - Fixes performance regression on cooking related titles when an unrelated bug was fixed. | ||||
* | | | | vk_texture_cache: Finalize renderpass when downloading images | ReinUsesLisp | 2021-07-18 | 1 | -0/+1 |
| | | | | |||||
* | | | | vk_compute_pass: Fix pipeline barriers on non-initialized ASTC images | ReinUsesLisp | 2021-07-18 | 1 | -2/+3 |
| | | | | |||||
* | | | | vk_compute_pass: Fix ASTC buffer setup synchronization | ReinUsesLisp | 2021-07-18 | 1 | -14/+14 |
|/ / / | |||||
* | | | Merge pull request #6579 from ameerj/float-settings | bunnei | 2021-07-16 | 2 | -6/+7 |
|\ \ \ | | | | | | | | | settings: Eliminate usage of float-point setting values | ||||
| * | | | configure_graphics: Use u8 for bg_color values | ameerj | 2021-07-09 | 2 | -6/+7 |
| |/ / | |||||
* | | | Merge pull request #6635 from ameerj/intel-vk-sm3dw | Fernando S | 2021-07-15 | 1 | -2/+4 |
|\ \ \ | | | | | | | | | vk_rasterizer: Only clear valid color attachments | ||||
| * | | | vk_rasterizer: Only clear valid color attachments | ameerj | 2021-07-13 | 1 | -2/+4 |
| | |/ | |/| | |||||
* | | | Merge pull request #6525 from ameerj/nvdec-fixes | Fernando S | 2021-07-15 | 1 | -11/+10 |
|\ \ \ | |/ / |/| | | nvdec: Fix Submit Ioctl data source, vic frame dimension computations | ||||
| * | | vic: Fix dimension compuation of YUV frames | ameerj | 2021-07-15 | 1 | -11/+10 |
| | | | | | | | | | | | | Fixes out of bound memory crashes in Mario Golf | ||||
* | | | accelerateDMA: Fixes and feedback. | Fernando Sahmkow | 2021-07-12 | 3 | -88/+62 |
| | | | |||||
* | | | accelerateDMA: Accelerate Buffer Copies. | Fernando Sahmkow | 2021-07-11 | 9 | -13/+176 |
| | | | |||||
* | | | Buffer Cache: Address Feedback. | Fernando Sahmkow | 2021-07-10 | 2 | -4/+9 |
| | | | |||||
* | | | Buffer Cache: Fix GCC copmpile error | Fernando Sahmkow | 2021-07-09 | 1 | -1/+0 |
| | | | |||||
* | | | Fence Manager: remove reference fencing. | Fernando Sahmkow | 2021-07-09 | 3 | -31/+6 |
| | | | |||||
* | | | BufferCache: Additional download fixes. | Fernando Sahmkow | 2021-07-09 | 2 | -23/+107 |
| | | | |||||
* | | | Buffer Cache: Revert unnecessary range reduction. | Fernando Sahmkow | 2021-07-09 | 1 | -29/+13 |
| | | | |||||
* | | | Fence Manager: Force ordering on WFI. | Fernando Sahmkow | 2021-07-09 | 4 | -38/+71 |
| | | | |||||
* | | | Buffer Cache: Eliminate the AC Hack as the base game is fixed in Hades. | Fernando Sahmkow | 2021-07-09 | 1 | -14/+4 |
| | | | |||||
* | | | Fence Manager: Add fences on Reference Count. | Fernando Sahmkow | 2021-07-09 | 8 | -6/+57 |
| | | | |||||
* | | | Videocore: Address Feedback & CLANG Format. | Fernando Sahmkow | 2021-07-09 | 2 | -78/+75 |
| | | | |||||
* | | | Buffer Cache: Fix High Downloads and don't predownload on Extreme. | Fernando Sahmkow | 2021-07-09 | 3 | -91/+122 |
| |/ |/| | |||||
* | | Merge pull request #6539 from lat9nq/default-setting | Ameer J | 2021-07-08 | 1 | -1/+1 |
|\ \ | | | | | | | general: Move most settings' defaults and labels into their definition | ||||
| * | | yuzu qt: Make most UISettings a BasicSetting | lat9nq | 2021-06-29 | 1 | -4/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | For simple primitive settings, moves their defaults and labels to definition time. Also fixes typo and clang-format yuzu qt: config: Fix rng_seed | ||||
| * | | general: Make most settings a BasicSetting | lat9nq | 2021-06-28 | 1 | -10/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Creates a new BasicSettings class in common/settings, and forces setting a default and label for each setting that uses it in common/settings. Moves defaults and labels from both frontends into common settings. Creates a helper function in each frontend to facillitate reading the settings now with the new default and label properties. Settings::Setting is also now a subclass of Settings::BasicSetting. Also adds documentation for both Setting and BasicSetting. | ||||
* | | | Out of bound blit (#6531) | Feng Chen | 2021-07-08 | 2 | -58/+35 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | * Fix out of bound blit error * Fix code read * Fix ci error Co-authored-by: Feng Chen <chen.feng@gloritysolutions.com> | ||||
* | | | util_shaders: Fix BindImageTexture | lat9nq | 2021-07-07 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | According to https://gitlab.freedesktop.org/mesa/mesa/-/issues/3820#note_753371 we need to set these to true for use with 3D textures. Fixes BOTW teleporting on RadeonSI and iris. | ||||
* | | | Merge pull request #6497 from FernandoS27/scotty-doesnt-know | bunnei | 2021-07-07 | 13 | -59/+581 |
|\ \ \ | | | | | | | | | GPU Memory Manager - Correct handling of non continuous backing memory. | ||||
| * | | | Texture Cache: Fix collision with multiple overlaps of the same sparse texture. | Fernando Sahmkow | 2021-07-04 | 1 | -1/+6 |
| | | | | |||||
| * | | | Texture Cache: Fix GCC & Clang. | Fernando Sahmkow | 2021-07-04 | 2 | -11/+11 |
| | | | | |||||
| * | | | Texture Cache: Address feedback. | Fernando Sahmkow | 2021-07-04 | 5 | -18/+37 |
| | | | | |||||
| * | | | Texture Cache: Improve accuracy of sparse texture detection. | Fernando Sahmkow | 2021-07-04 | 6 | -131/+342 |
| | | | | |||||
| * | | | Texture Cache: Initial Implementation of Sparse Textures. | Fernando Sahmkow | 2021-07-04 | 12 | -23/+310 |
| | | | | |||||
* | | | | Merge pull request #6537 from Morph1984/warnings | bunnei | 2021-07-06 | 7 | -23/+8 |
|\ \ \ \ | | | | | | | | | | | general: Enforce multiple warnings in MSVC | ||||
| * | | | | video_core: Remove #pragma warning directives for external headers | Morph | 2021-06-28 | 2 | -15/+0 |
| | | | | | |||||
| * | | | | video_core: Enforce C4242 | Morph | 2021-06-28 | 1 | -3/+2 |
| | | | | | |||||
| * | | | | video_core: Silence signed/unsigned mismatch warnings | Morph | 2021-06-28 | 4 | -5/+6 |
| | | | | | |||||
* | | | | | Merge pull request #6540 from Kelebek1/nvdec | Ameer J | 2021-07-05 | 10 | -356/+522 |
|\ \ \ \ \ | |_|/ / / |/| | | | | Slightly refactor NVDEC and codecs for readability and safety | ||||
| * | | | | Slightly refactor NVDEC and codecs for readability and safety | Kelebek1 | 2021-07-01 | 10 | -356/+522 |
| |/ / / | |||||
* | | | | TextureCacheOGL: Implement Image Copies for 1D and 1D Array. | Fernando Sahmkow | 2021-07-03 | 1 | -0/+26 |
| | | | | |||||
* | | | | TextureCache: Fix 1D to 2D overlapps. | Fernando Sahmkow | 2021-07-03 | 1 | -3/+0 |
| | | | | |||||
* | | | | Merge pull request #6459 from lat9nq/ubuntu-fixes | Ameer J | 2021-07-01 | 1 | -1/+4 |
|\ \ \ \ | | | | | | | | | | | cmake: Improve Linux dependency checking for externals | ||||
| * | | | | cmake: Fix find_program usage for 3.15 | lat9nq | 2021-06-13 | 1 | -1/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | yuzu requires CMake 3.15 yet find_program was using REQUIRED, which is only available on 3.18 and later. Instead, we check for "<VAR>-NOTFOUND". In addition, check for additional requirements before building libusb or FFmpeg with autotools. Otherwise, CMake configuration will pass yet compilation will fail. | ||||
* | | | | | Merge pull request #6502 from ameerj/vendor-title | Morph | 2021-06-28 | 7 | -4/+91 |
|\ \ \ \ \ | |_|/ / / |/| | | | | main: Add GPU Vendor name to running title bar | ||||
| * | | | | gl_device: Expand on Mesa driver names | lat9nq | 2021-06-21 | 1 | -3/+28 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Makes this list a bit more capable at identifying Mesa drivers. Tries to deal with two of the overloaded vendor strings in a more generic fashion. | ||||
| * | | | | video_core: Add GPU vendor name to window title bar | ameerj | 2021-06-21 | 7 | -4/+66 |
| | | | | | |||||
* | | | | | buffer_cache: Only flush downloaded size | ReinUsesLisp | 2021-06-26 | 1 | -2/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fixes a regression unintentionally introduced by the garbage collector. This makes regular memory downloads only flush the requested sizes. This negatively affected Koei Tecmo games. | ||||
* | | | | | video_core: Enforce C4244 | ReinUsesLisp | 2021-06-26 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | Enforce implicit integer casts to a smaller type as errors. | ||||
* | | | | | codec,vic: Disable warnings in ffmpeg headers | ReinUsesLisp | 2021-06-26 | 2 | -4/+29 |
| | | | | | |||||
* | | | | | vk_buffer_cache: Silence implicit cast warnings | ReinUsesLisp | 2021-06-26 | 1 | -2/+3 |
| | | | | | |||||
* | | | | | buffer_cache/texture_cache: Make GC functions private | ReinUsesLisp | 2021-06-26 | 2 | -5/+5 |
| | | | | | |||||
* | | | | | buffer_cache: Silence implicit cast warning | ReinUsesLisp | 2021-06-26 | 1 | -1/+1 |
| |_|/ / |/| | | | |||||
* | | | | vulkan_device: Make device memory match the rest of the file | ReinUsesLisp | 2021-06-25 | 2 | -19/+18 |
| |_|/ |/| | | | | | | | | Match the style in the file. | ||||
* | | | Merge pull request #6496 from ameerj/astc-fixes | bunnei | 2021-06-25 | 5 | -155/+50 |
|\ \ \ | | | | | | | | | astc: Various robustness enhancements for the gpu decoder | ||||
| * | | | util_shaders: Specify ASTC decoder memory barrier bits | ameerj | 2021-06-19 | 1 | -1/+6 |
| | | | | |||||
| * | | | astc_decoder.comp: Remove unnecessary LUT SSBOs | ameerj | 2021-06-19 | 5 | -113/+34 |
| | | | | | | | | | | | | | | | | We can move them to instead be compile time constants within the shader. | ||||
| * | | | astc: Various robustness enhancements for the gpu decoder | ameerj | 2021-06-19 | 5 | -47/+16 |
| |/ / | | | | | | | | | | | | | | | | These changes should help in reducing crashes/drivers panics that may occur due to synchronization issues between the shader completion and later access of the decoded texture. | ||||
* | | | Merge pull request #6519 from Wunkolo/mem-size-literal | bunnei | 2021-06-25 | 5 | -14/+27 |
|\ \ \ | | | | | | | | | common: Replace common_sizes into user-literals | ||||
| * | | | common: Replace common_sizes into user-literals | Wunkolo | 2021-06-24 | 5 | -14/+27 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Removes common_sizes.h in favor of having `_KiB`, `_MiB`, `_GiB`, etc user-literals within literals.h. To keep the global namespace clean, users will have to use: ``` using namespace Common::Literals; ``` to access these literals. | ||||
* | | | | Merge pull request #6517 from lioncash/fmtlib | bunnei | 2021-06-24 | 2 | -2/+2 |
|\ \ \ \ | |/ / / |/| | | | externals: Update fmt to 8.0.0 | ||||
| * | | | General: Resolve fmt specifiers to adhere to 8.0.0 API where applicable | Lioncash | 2021-06-23 | 2 | -2/+2 |
| | | | | | | | | | | | | | | | | Also removes some deprecated API usages. | ||||
* | | | | Merge pull request #6518 from lioncash/func | bunnei | 2021-06-23 | 1 | -0/+1 |
|\ \ \ \ | | | | | | | | | | | maxwell3d: Add missing return in default SizeInBytes() case | ||||
| * | | | | maxwell3d: Add missing return in default SizeInBytes() case | Lioncash | 2021-06-23 | 1 | -0/+1 |
| |/ / / | | | | | | | | | | | | | | | | | We were returning '1' in ComponentCount()'s default case but were neglecting to do the same with SizeInBytes(). | ||||
* | | | | Merge pull request #6465 from FernandoS27/sex-on-the-beach | Mai M | 2021-06-23 | 17 | -63/+466 |
|\ \ \ \ | |/ / / |/| | | | GPU: Implement a garbage collector for GPU Caches (project Reaper+) | ||||
| * | | | Reaper: Set minimum cleaning limit on OGL. | Fernando Sahmkow | 2021-06-22 | 1 | -1/+4 |
| | | | | |||||
| * | | | Reaper: Guarantee correct deletion. | Fernando Sahmkow | 2021-06-20 | 5 | -2/+23 |
| | | | | |||||
| * | | | Reaper: Correct size calculation on Vulkan. | Fernando Sahmkow | 2021-06-17 | 1 | -5/+3 |
| | | | | |||||
| * | | | Reaper: Change memory restrictions on TC depending on host memory on VK. | Fernando Sahmkow | 2021-06-17 | 8 | -39/+88 |
| | | | | |||||
| * | | | Reaper: Address Feedback. | Fernando Sahmkow | 2021-06-16 | 4 | -19/+41 |
| | | | | |||||
| * | | | Reaper: Setup settings and final tuning. | Fernando Sahmkow | 2021-06-16 | 3 | -32/+38 |
| | | | | |||||
| * | | | Reaper: Tune it up to be an smart GC. | Fernando Sahmkow | 2021-06-16 | 5 | -13/+130 |
| | | | | |||||
| * | | | Initial Reaper Setup | ReinUsesLisp | 2021-06-16 | 6 | -56/+226 |
| | | | | | | | | | | | | | | | | WIP | ||||
| * | | | vulkan_memory_allocator: Release allocations with no commits | ReinUsesLisp | 2021-06-16 | 2 | -5/+22 |
| | | | | |||||
* | | | | Merge pull request #6508 from ReinUsesLisp/bootmanager-stop-token | Mai M | 2021-06-23 | 5 | -8/+8 |
|\ \ \ \ | | | | | | | | | | | bootmanager: Use std::stop_source for stopping emulation | ||||
| * | | | | bootmanager: Use std::stop_source for stopping emulation | ReinUsesLisp | 2021-06-22 | 5 | -8/+8 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | Use its std::stop_token to abort shader cache loading. Using std::stop_token instead of std::atomic_bool allows the usage of other utilities like std::stop_callback. | ||||
* | | | | common: fs: Remove [[nodiscard]] attribute on Remove* functions | Morph | 2021-06-22 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | There are a lot of scenarios where we don't particularly care whether or not the removal operation and just simply attempt a removal. As such, removing the [[nodiscard]] attribute is best for these functions. | ||||
* | | | | vk_master_semaphore: Use jthread for debug thread | ReinUsesLisp | 2021-06-22 | 2 | -19/+8 |
|/ / / | |||||
* | | | vulkan_debug_callback: Skip logging known false-positive validation errors | ameerj | 2021-06-18 | 1 | -0/+8 |
| | | | | | | | | | | | | Avoids overwhelming the log with validation errors that are not applicable | ||||
* | | | Merge pull request #6469 from ReinUsesLisp/blit-view-compat | Ameer J | 2021-06-17 | 1 | -1/+9 |
|\ \ \ | |/ / |/| | | texture_cache/util: Avoid relaxed image views on different bytes per block | ||||
| * | | texture_cache/util: Avoid relaxed image views on different bytes per pixel | ReinUsesLisp | 2021-06-15 | 1 | -1/+9 |
| |/ | | | | | | | Avoids API usage errors on UE4 titles leading to crashes. | ||||
* | | astc_decoder: Fix LDR CEM1 endpoint calculation | ameerj | 2021-06-16 | 2 | -2/+2 |
| | | | | | | | | | | | | | | | | Per the spec, L1 is clamped to the value 0xff if it is greater than 0xff. An oversight caused us to take the maximum of L1 and 0xff, rather than the minimum. Huge thanks to wwylele for finding this. Co-Authored-By: Weiyi Wang <wwylele@gmail.com> | ||||
* | | configure_graphics: Add Accelerate ASTC decoding setting | ameerj | 2021-06-16 | 2 | -2/+11 |
| | | |||||
* | | textures: Reintroduce CPU ASTC decoder | ameerj | 2021-06-16 | 4 | -2/+1592 |
|/ | | | | | | | Users may want to fall back to the CPU ASTC texture decoder due to hangs and crashes that may be caused by keeping the GPU under compute heavy loads for extended periods of time. This is especially the case in games such as Astral Chain which make extensive use of ASTC textures. | ||||
* | GPUTHread: Remove async reads from Normal Accuracy. | Fernando Sahmkow | 2021-06-11 | 1 | -18/+6 |
| | |||||
* | rasterizer: Update pages in batches | ReinUsesLisp | 2021-06-11 | 1 | -15/+41 |
| | |||||
* | Fix GCC undefined behavior sanitizer. | Markus Wick | 2021-06-10 | 2 | -0/+6 |
| | | | | | | | | | * Wrong alignment in u64 LOG_DEBUG -> memcpy. * Huge shift exponent in stride calculation for linear buffer, unused result -> skipped. * Large shift in buffer cache if word = 0, skip checking for set bits. Non of those were critical, so this should not change any behavior. At least with the assumption, that the last one used masking behavior, which always yield continuous_bits = 0. | ||||
* | Merge pull request #6410 from lat9nq/avoid-oob | bunnei | 2021-06-07 | 1 | -0/+8 |
|\ | | | | | decoders: Avoid out-of-bounds access | ||||
| * | decoders: Break instead of continue | lat9nq | 2021-06-04 | 1 | -2/+2 |
| | | | | | | | | continue causes a memory leak in A Hat in Time. | ||||
| * | decoders: Avoid out-of-bounds access | lat9nq | 2021-06-04 | 1 | -0/+8 |
| | | | | | | | | This is not a real fix, so assert here and continue before crashing. | ||||
* | | buffer_cache: Simplify uniform disabling logic | ameerj | 2021-06-01 | 8 | -6/+29 |
|/ | |||||
* | Merge pull request #6367 from ReinUsesLisp/vma-host | bunnei | 2021-06-01 | 2 | -31/+43 |
|\ | | | | | vulkan_memory_allocator: Allow textures to be allocated in host memory | ||||
| * | vulkan_memory_allocator: Allow textures to be allocated in host memory | ReinUsesLisp | 2021-05-27 | 2 | -31/+43 |
| | | | | | | | | | | Allow Vulkan's allocator to use host memory when there's no more device local memory. This delays OOM, but it will eventually still happen. | ||||
* | | video_core: gpu: WaitFence: Do not block threads during shutdown. | bunnei | 2021-05-29 | 2 | -1/+13 |
| | | | | | | | | | | - Fixes a hang on shutdown when NVFlinger thread is waiting on a syncpoint that will never occur. - Commonly observed when stopping emulation in Super Mario Odyssey. | ||||
* | | Fix two GCC 11 warnings: Unneeded copies. | Markus Wick | 2021-05-29 | 1 | -2/+2 |
| | | | | | | | | | | std::move created an unneeded copy. iterating without reference also created copies. | ||||
* | | video_core: rasterizer_cache: Use u16 for cached page count. | bunnei | 2021-05-27 | 2 | -9/+9 |
|/ | | | | - Greatly reduces the risk of overflow, at the cost of doubling the size of this array. | ||||
* | common: fs: Rework the Common Filesystem interface to make use of std::filesystem (#6270) | Morph | 2021-05-26 | 5 | -106/+116 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * common: fs: fs_types: Create filesystem types Contains various filesystem types used by the Common::FS library * common: fs: fs_util: Add std::string to std::u8string conversion utility * common: fs: path_util: Add utlity functions for paths Contains various utility functions for getting or manipulating filesystem paths used by the Common::FS library * common: fs: file: Rewrite the IOFile implementation * common: fs: Reimplement Common::FS library using std::filesystem * common: fs: fs_paths: Add fs_paths to replace common_paths * common: fs: path_util: Add the rest of the path functions * common: Remove the previous Common::FS implementation * general: Remove unused fs includes * string_util: Remove unused function and include * nvidia_flags: Migrate to the new Common::FS library * settings: Migrate to the new Common::FS library * logging: backend: Migrate to the new Common::FS library * core: Migrate to the new Common::FS library * perf_stats: Migrate to the new Common::FS library * reporter: Migrate to the new Common::FS library * telemetry_session: Migrate to the new Common::FS library * key_manager: Migrate to the new Common::FS library * bis_factory: Migrate to the new Common::FS library * registered_cache: Migrate to the new Common::FS library * xts_archive: Migrate to the new Common::FS library * service: acc: Migrate to the new Common::FS library * applets/profile: Migrate to the new Common::FS library * applets/web: Migrate to the new Common::FS library * service: filesystem: Migrate to the new Common::FS library * loader: Migrate to the new Common::FS library * gl_shader_disk_cache: Migrate to the new Common::FS library * nsight_aftermath_tracker: Migrate to the new Common::FS library * vulkan_library: Migrate to the new Common::FS library * configure_debug: Migrate to the new Common::FS library * game_list_worker: Migrate to the new Common::FS library * config: Migrate to the new Common::FS library * configure_filesystem: Migrate to the new Common::FS library * configure_per_game_addons: Migrate to the new Common::FS library * configure_profile_manager: Migrate to the new Common::FS library * configure_ui: Migrate to the new Common::FS library * input_profiles: Migrate to the new Common::FS library * yuzu_cmd: config: Migrate to the new Common::FS library * yuzu_cmd: Migrate to the new Common::FS library * vfs_real: Migrate to the new Common::FS library * vfs: Migrate to the new Common::FS library * vfs_libzip: Migrate to the new Common::FS library * service: bcat: Migrate to the new Common::FS library * yuzu: main: Migrate to the new Common::FS library * vfs_real: Delete the contents of an existing file in CreateFile Current usages of CreateFile expect to delete the contents of an existing file, retain this behavior for now. * input_profiles: Don't iterate the input profile dir if it does not exist Silences an error produced in the log if the directory does not exist. * game_list_worker: Skip parsing file if the returned VfsFile is nullptr Prevents crashes in GetLoader when the virtual file is nullptr * common: fs: Validate paths for path length * service: filesystem: Open the mod load directory as read only | ||||
* | Merge pull request #6248 from A-w-x/intelmesa | bunnei | 2021-05-21 | 1 | -1/+1 |
|\ | | | | | gl_device: Intel: Disable texture view formats workaround on mesa | ||||
| * | gl_device: Intel: Disable texture view formats workaround on mesa | A-w-x | 2021-04-26 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #6317 from ameerj/fps-fix | bunnei | 2021-05-19 | 4 | -0/+9 |
|\ \ | | | | | | | perf_stats: Rework FPS counter to be more accurate | ||||
| * | | perf_stats: Rework FPS counter to be more accurate | ameerj | 2021-05-16 | 4 | -0/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | The FPS counter was based on metrics in the nvdisp swapbuffers call. This metric would be accurate if the gpu thread/renderer were synchronous with the nvdisp service, but that's no longer the case. This commit moves the frame counting responsibility onto the concrete renderers after their frame draw calls. Resulting in more meaningful metrics. The displayed FPS is now made up of the average framerate between the previous and most recent update, in order to avoid distracting FPS counter updates when framerate is oscillating between close values. The status bar update frequency was also changed from 2 seconds to 500ms. | ||||
* | | | Merge pull request #6322 from ameerj/fast-null-buffer | bunnei | 2021-05-18 | 1 | -1/+4 |
|\ \ \ | | | | | | | | | buffer_cache: Ensure null buffers cannot take the fast uniform bind path | ||||
| * | | | buffer_cache: Ensure null buffers cannot take the fast uniform bind path | ameerj | 2021-05-16 | 1 | -1/+4 |
| |/ / | | | | | | | | | | Fixes a crash in New Pokemon Snap | ||||
* | | | Merge pull request #6289 from ameerj/oob-blit | bunnei | 2021-05-16 | 8 | -61/+99 |
|\ \ \ | |/ / |/| | | texture_cache: Handle out of bound texture blits | ||||
| * | | texture_cache: Handle out of bound texture blits | ameerj | 2021-05-08 | 8 | -61/+99 |
| |/ | | | | | | | Some games interleave a texture blit using regions which are out-of-bounds. This addresses the interleaving to avoid oob reads from the src texture. | ||||
* / | hle: kernel: Rename Process to KProcess. | bunnei | 2021-05-06 | 3 | -3/+3 |
|/ | |||||
* | Merge pull request #6237 from ameerj/nvdec-end-fix | bunnei | 2021-04-26 | 2 | -5/+3 |
|\ | | | | | nvhost_vic: Fix device closure | ||||
| * | nvhost_vic: Fix device closure | ameerj | 2021-04-25 | 2 | -5/+3 |
| | | | | | | | | | | | | Implements the OnClose method of the nvhost_vic device, and removes the remnants of an older implementation. Also cleans up some of the surrounding code. | ||||
* | | vk_texture_cache: Swap R and B channels of color flipped format | ameerj | 2021-04-25 | 1 | -1/+24 |
|/ | | | | Swaps the Red and Blue channels of the A1B5G5R5_UNORM texture format, which was being incorrectly rendered. | ||||
* | texture_cache/util: Fix src being used instead of dst within DeduceBlitImages | Lioncash | 2021-04-19 | 1 | -1/+1 |
| | | | | | | | This line can only ever be reached if src is null, so dereferencing it here is a logic bug that slipped through. Instead, we dereference dst instead which is guaranteed to be valid. | ||||
* | Merge pull request #6125 from ogniK5377/nvdec-close-dev | bunnei | 2021-04-17 | 2 | -5/+10 |
|\ | | | | | nvdrv: Cleanup CDMA Processor on device closure | ||||
| * | Address issues | Chloe Marcec | 2021-04-16 | 1 | -3/+2 |
| | | |||||
| * | nvdrv: Cleanup CDMA Processor on device closure | Chloe Marcec | 2021-03-30 | 2 | -5/+11 |
| | | | | | | | | Brings us a step closer to unifying all channels to share a common interface. | ||||
* | | Merge pull request #6196 from bunnei/asserts-setting | bunnei | 2021-04-15 | 18 | -18/+18 |
|\ \ | | | | | | | core: settings: Add setting for debug assertions and disable by default. | ||||
| * | | common: Move settings to common from core. | bunnei | 2021-04-15 | 18 | -18/+18 |
| | | | | | | | | | | | | - Removes a dependency on core and input_common from common. | ||||
* | | | Merge pull request #6191 from lioncash/vdtor | bunnei | 2021-04-14 | 4 | -4/+5 |
|\ \ \ | |/ / |/| | | engine_interface: Add missing virtual destructor | ||||
| * | | engine_interface: Add missing virtual destructor | Lioncash | 2021-04-12 | 4 | -4/+5 |
| | | | | | | | | | | | | | | | | | | Eliminates a potential bug vector related to inheritance. Plus, we should generally be specifying the destructor as virtual within purely virtual interfaces to begin with. | ||||
* | | | Merge pull request #6190 from lioncash/constfn2 | bunnei | 2021-04-14 | 1 | -2/+2 |
|\ \ \ | | | | | | | | | vk_master_semaphore: Add missing const qualifier for IsFree() | ||||
| * | | | vk_master_semaphore: Deduplicate atomic access within IsFree() | Lioncash | 2021-04-12 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | We can just reuse the already existing KnownGpuTick() to deduplicate the access. | ||||
| * | | | vk_master_semaphore: Add missing const qualifier for IsFree() | Lioncash | 2021-04-12 | 1 | -1/+1 |
| |/ / | | | | | | | | | | This member function doesn't modify class state. | ||||
* | | | Merge pull request #6188 from lioncash/bits | bunnei | 2021-04-14 | 1 | -5/+6 |
|\ \ \ | | | | | | | | | vk_texture_cache: Make use of bit_cast where applicable | ||||
| * | | | vk_texture_cache: Make use of Common::BitCast where applicable | Lioncash | 2021-04-12 | 1 | -5/+6 |
| |/ / | | | | | | | | | | | | | Also clarify the TODO comment a little more on the lacking implementations for std::bit_cast. | ||||
* | | | Merge pull request #6187 from lioncash/sign-conv | bunnei | 2021-04-13 | 2 | -11/+15 |
|\ \ \ | | | | | | | | | texure_cache/util: Resolve implicit sign conversions with std::reduce | ||||
| * | | | texure_cache/util: Resolve implicit sign conversions with std::reduce | Lioncash | 2021-04-12 | 2 | -11/+15 |
| |/ / | | | | | | | | | | | | | | | | Amends implicit sign conversions occurring with usages of std::reduce and also relocates it to its own utility function to reduce verbosity a little bit. | ||||
* / / | query_cache: Make use of std::erase_if | Lioncash | 2021-04-12 | 1 | -5/+4 |
|/ / | | | | | | | Same behavior, but much more straightforward to read. | ||||
* | | Merge pull request #6181 from Joshua-Ashton/robustness_features | Rodrigo Locatti | 2021-04-12 | 1 | -0/+9 |
|\ \ | | | | | | | vulkan_device: Enable EXT_robustness2 features | ||||
| * | | vulkan_device: Enable EXT_robustness2 features | Joshua Ashton | 2021-04-11 | 1 | -0/+9 |
| | | | | | | | | | | | | | | | | | | When this was being made mandatory, these enablement of these features was removed, but this is still needed. Fixes: 757fd1e91716 ("vulkan_device: Require VK_EXT_robustness2") | ||||
* | | | vk_buffer_cache: Fix offset for NULL vertex buffers | Joshua Ashton | 2021-04-11 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | The Vulkan spec states: If an element of pBuffers is VK_NULL_HANDLE, then the corresponding element of pOffsets must be zero. https://www.khronos.org/registry/vulkan/specs/1.2-extensions/man/html/vkCmdBindVertexBuffers2EXT.html#VUID-vkCmdBindVertexBuffers2EXT-pBuffers-04112 | ||||
* | | | renderer_vulkan: Check return value of AcquireNextImage | Joshua Ashton | 2021-04-11 | 3 | -5/+10 |
|/ / | | | | | | | | | We can get into a really bad state by ignoring this leading to device loss and using incorrect resources. | ||||
* | | video_core: Use a CV for blocking commands. | Markus Wick | 2021-04-07 | 2 | -23/+31 |
| | | | | | | | | There is no need for a busy loop here. Let's just use a condition variable to save some power. | ||||
* | | video_core/gpu_thread: Keep the write lock for allocating the fence. | Markus Wick | 2021-04-07 | 2 | -1/+4 |
| | | | | | | | | | | Else the fence might get submited out-of-order into the queue, which makes testing them pointless. Overhead should be tiny as the mutex is just moved from the queue to the writing code. | ||||
* | | video_core/gpu_thread: Implement a ShutDown method. | Markus Wick | 2021-04-07 | 4 | -14/+27 |
| | | | | | | | | | | | | This was implicitly done by `is_powered_on = false`, however the explicit method allows us to block until the GPU is actually gone. This should fix a race condition while removing the other subsystems while the GPU is still active. | ||||
* | | common/threadsafe_queue: Provide Wait() method. | Markus Wick | 2021-04-07 | 1 | -2/+1 |
| | | | | | | | | | | | | | | It shall block until there is something to consume in the queue. And use it for the GPU emulation instead of the spin loop. This is only in booting the emulator, however in BOTW this is the case for about 1 second. | ||||
* | | vp9: Avoid memcpy with null pointers | lat9nq | 2021-04-05 | 1 | -7/+9 |
| | | | | | | | | | | | | | | | | Avoid sending null pointer to memcpy as reported by Undefined Behaviour Sanitizer. Replaces the std::memcpy calls in SpliceVectors with std::copy calls. Opting to replace all the memcpy's with copy's. Co-authored-by: LC <mathew1800@gmail.com> | ||||
* | | Merge pull request #5927 from ameerj/astc-compute | Rodrigo Locatti | 2021-03-31 | 22 | -1770/+2027 |
|\ \ | | | | | | | video_core: Accelerate ASTC texture decoding using compute shaders | ||||
| * | | astc_decoder: Refactor for style and more efficient memory use | ameerj | 2021-03-25 | 9 | -2256/+502 |
| | | | |||||
| * | | astc_decoder: Reimplement Layers | Rodrigo Locatti | 2021-03-13 | 5 | -142/+161 |
| | | | | | | | | | | | | Reimplements the approach to decoding layers in the compute shader. Fixes multilayer astc decoding when using Vulkan. | ||||
| * | | astc_decoder: Fix out of bounds memory access | ameerj | 2021-03-13 | 1 | -2/+10 |
| | | | | | | | | | | | | resolves a crash with some anamolous textures found in Astral Chain. | ||||
| * | | renderer_vulkan: Accelerate ASTC decoding | ameerj | 2021-03-13 | 11 | -57/+426 |
| | | | | | | | | | | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
| * | | host_shaders: Modify shader cmake integration to allow for larger shaders | ameerj | 2021-03-13 | 4 | -8/+27 |
| | | | | | | | | | | | | using a raw string to encapsulate the entire shader code limits us to shaders of size less than 2KB. This change overcomes this limitation. | ||||
| * | | renderer_opengl: Accelerate ASTC texture decoding with a compute shader | ameerj | 2021-03-13 | 6 | -2/+1598 |
| | | | | | | | | | | | | | | | | | | ASTC texture decoding is currently handled by a CPU decoder for GPU's without native ASTC decoding support (most desktop GPUs). This is the cause for noticeable performance degradation in titles which use the format extensively. This commit adds support to accelerate ASTC decoding using a compute shader on OpenGL for GPUs without native support. | ||||
* | | | vulkan_common: enable OpenGL interop on other Unices | Jan Beich | 2021-03-30 | 2 | -5/+5 |
| |/ |/| | |||||
* | | gl_device: unblock async shaders on other Unix systems | Jan Beich | 2021-03-24 | 1 | -1/+1 |
| | | | | | | | | | | Mesa is the primary OpenGL provider on all FreeDesktop systems. For example, iris is used on Intel GPU + FreeBSD by default. | ||||
* | | gl_device: Block async shaders on AMD and Intel | lat9nq | 2021-03-21 | 1 | -1/+13 |
|/ | | | | | | | | Currently, the Windows versions of the Intel OpenGL driver and the AMD proprietary OpenGL driver do not properly support (or in fact degrade) when asynchronous shader compilation is enabled. This blocks specifically those drivers from using this feature. This affects AMDGPU-PRO on Linux, and AMD's and Intel's OpenGL drivers on Windows. | ||||
* | Merge pull request #6028 from bunnei/raster-cache | bunnei | 2021-03-13 | 2 | -47/+40 |
|\ | | | | | video_core: rasterizer_accelerated: Use a flat array instead of interval_map for cached pages. | ||||
| * | video_core: rasterizer_accelerated: Fix un/signed mismatch. | bunnei | 2021-03-13 | 1 | -1/+2 |
| | | |||||
| * | video_core: rasterizer_accelerated: Fix delta check ordering. | bunnei | 2021-03-03 | 1 | -3/+3 |
| | | |||||
| * | video_core: rasterizer_accelerated: Improve error handling & fix implicit conversion. | bunnei | 2021-03-03 | 1 | -4/+8 |
| | | |||||
| * | video_core: rasterizer_accelerated: Use a flat array instead of interval_map for cached pages. | bunnei | 2021-03-03 | 2 | -44/+32 |
| | | | | | | | | | | | | - Uses a fixed 64MB for the cache instead of an ever growing map. - Slightly faster by using atomics instead of a single mutex for access. - Thanks for Rodrigo for the idea. | ||||
* | | Merge pull request #5891 from ameerj/bgra-ogl | Rodrigo Locatti | 2021-03-09 | 14 | -30/+212 |
|\ \ | | | | | | | renderer_opengl: Use compute shaders to swizzle BGR textures on copy | ||||
| * | | texture_cache: Blacklist BGRA8 copies and views on OpenGL | ameerj | 2021-03-04 | 9 | -28/+80 |
| | | | | | | | | | | | | | | | | | | In order to force the BGRA8 conversion on Nvidia using OpenGL, we need to forbid texture copies and views with other formats. This commit also adds a boolean relating to this, as this needs to be done only for the OpenGL api, Vulkan must remain unchanged. | ||||
| * | | renderer_opengl: Swizzle BGR textures on copy | ameerj | 2021-03-04 | 5 | -2/+132 |
| |/ | | | | | | | | | | | OpenGL does not natively support BGR internal formats, which causes many BGR textures to render incorrectly, with Red and Blue channels swapped. This commit aims to address this by swizzling the blue and red channels on texture copies when a BGR format is encountered. | ||||
* | | Merge pull request #6021 from ReinUsesLisp/skip-cache-heuristic | bunnei | 2021-03-09 | 2 | -11/+37 |
|\ \ | | | | | | | buffer_cache: Heuristically decide to skip cache on uniform buffers | ||||
| * | | buffer_cache: Heuristically decide to skip cache on uniform buffers | ReinUsesLisp | 2021-03-02 | 2 | -11/+37 |
| |/ | | | | | | | | | | | | | | | | | Some games benefit from skipping caches (Pokémon Sword), and others don't (Animal Crossing: New Horizons). Add an heuristic to decide this at runtime. The cache hit ratio has to be ~98% or better to not skip the cache. There are 16 frames of buffer. | ||||
* | | Merge pull request #5989 from ReinUsesLisp/cmdpool | bunnei | 2021-03-04 | 1 | -1/+1 |
|\ \ | |/ |/| | vk_command_pool: Reduce the command pool size from 4096 to 4 | ||||
| * | vk_command_pool: Reduce the command pool size from 4096 to 4 | ReinUsesLisp | 2021-02-23 | 1 | -1/+1 |
| | | | | | | | | | | This allows drivers to reuse memory more easily and preallocate less. The optimal number has been measured booting Pokémon Sword. | ||||
* | | gpu_thread: Remove Async NVDEC placeholders | ameerj | 2021-03-01 | 3 | -26/+8 |
| | | | | | | | | This commit removes early placeholders for an implementation of async nvdec. With recent changes to the source code, the placeholders are no longer accurate, and can cause a nullptr dereference due to the nature of the cdma_pusher lifetime. | ||||
* | | Merge pull request #5984 from jbeich/gcc-freebsd | bunnei | 2021-02-27 | 1 | -0/+1 |
|\ \ | | | | | | | common,video-core: unbreak GCC 11 build on FreeBSD 13 | ||||
| * | | video_core: add missing header after 468bd9c1b0f9 | Jan Beich | 2021-02-23 | 1 | -0/+1 |
| |/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | src/video_core/shader_notify.cpp: In member function 'void VideoCore::ShaderNotify::MarkShaderComplete()': src/video_core/shader_notify.cpp:33:10: error: 'unique_lock' is not a member of 'std' 33 | std::unique_lock lock{mutex}; | ^~~~~~~~~~~ src/video_core/shader_notify.cpp:6:1: note: 'std::unique_lock' is defined in header '<mutex>'; did you forget to '#include <mutex>'? 5 | #include "video_core/shader_notify.h" +++ |+#include <mutex> 6 | src/video_core/shader_notify.cpp: In member function 'void VideoCore::ShaderNotify::MarkSharderBuilding()': src/video_core/shader_notify.cpp:38:10: error: 'unique_lock' is not a member of 'std' 38 | std::unique_lock lock{mutex}; | ^~~~~~~~~~~ src/video_core/shader_notify.cpp:38:10: note: 'std::unique_lock' is defined in header '<mutex>'; did you forget to '#include <mutex>'? | ||||
* | | Merge pull request #5953 from bunnei/memory-refactor-1 | bunnei | 2021-02-27 | 1 | -1/+1 |
|\ \ | | | | | | | Kernel Rework: Memory updates and refactoring (Part 1) | ||||
| * | | hle: kernel: Migrate PageHeap/PageTable to KPageHeap/KPageTable. | bunnei | 2021-02-19 | 1 | -1/+1 |
| | | | |||||
* | | | Implement glDepthRangeIndexeddNV | Kelebek1 | 2021-02-24 | 3 | -1/+12 |
| |/ |/| | |||||
* | | Merge pull request #5936 from Kelebek1/Offsets | bunnei | 2021-02-22 | 3 | -9/+34 |
|\ \ | | | | | | | Offsets for TexelFetch and TextureGather in Vulkan | ||||
| * | | Review 1 | Kelebek1 | 2021-02-15 | 2 | -3/+3 |
| | | | |||||
| * | | Implement texture offset support for TexelFetch and TextureGather and add offsets for Tlds | Kelebek1 | 2021-02-15 | 3 | -9/+34 |
| | | | | | | | | | | | | Formatting | ||||
* | | | gl_disk_shader_cache: Log total shader entries count on game load | Morph | 2021-02-20 | 1 | -0/+4 |
| | | | |||||
* | | | Merge pull request #5924 from ReinUsesLisp/inline-bindings | bunnei | 2021-02-19 | 4 | -24/+24 |
|\ \ \ | |_|/ |/| | | vk_update_descriptor: Inline and improve code for binding buffers | ||||
| * | | vk_update_descriptor: Inline and improve code for binding buffers | ReinUsesLisp | 2021-02-13 | 4 | -24/+24 |
| | | | | | | | | | | | | Allow compilers with our settings inline hot code. | ||||
* | | | Merge pull request #4973 from ameerj/nvdec-opt | bunnei | 2021-02-19 | 9 | -146/+72 |
|\ \ \ | | | | | | | | | nvdec: Reuse allocated buffers and general cleanup | ||||
| * | | | rebase, fix name shadowing, more const | ameerj | 2021-02-13 | 4 | -11/+10 |
| | | | | |||||
| * | | | Address PR feedback | ameerj | 2021-02-13 | 2 | -8/+5 |
| | | | | | | | | | | | | | | | | Co-Authored-By: LC <712067+lioncash@users.noreply.github.com> | ||||
| * | | | streamline cdma_pusher/command_classes | ameerj | 2021-02-13 | 1 | -13/+5 |
| | | | | |||||
| * | | | streamline cdma_pusher/command_classes | ameerj | 2021-02-13 | 5 | -85/+34 |
| | | | | |||||
| * | | | nvdec cleanup | ameerj | 2021-02-13 | 7 | -42/+31 |
| | | | | |||||
* | | | | vk_rasterizer: Fix loading shader addresses twice | ReinUsesLisp | 2021-02-16 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | This was recently introduced on a wrongly rebased commit. | ||||
* | | | | Merge pull request #5923 from ReinUsesLisp/vk-dirty-pipeline | bunnei | 2021-02-15 | 7 | -56/+103 |
|\ \ \ \ | |_|_|/ |/| | | | fixed_pipeline_cache: Use dirty flags to lazily update key | ||||
| * | | | fixed_pipeline_cache: Use dirty flags to lazily update key | ReinUsesLisp | 2021-02-13 | 7 | -56/+103 |
| | |/ | |/| | | | | | | | | | | Use dirty flags to avoid building pipeline key from scratch on each draw call. This saves a bit of unnecesary work on each draw call. | ||||
* | | | yuzu: Various frontend improvements to avoid crashes and improve experience on Linux. | bunnei | 2021-02-14 | 1 | -0/+1 |
| | | | |||||
* | | | vk_resource_pool: Load GPU tick once and compare with it | ReinUsesLisp | 2021-02-13 | 2 | -8/+8 |
|/ / | | | | | | | | | Other minor style improvements. Rename free_iterator to hint_iterator, to describe better what it does. | ||||
* / | gl_texture_cache: Lazily create non-sRGB texture views for sRGB formats | ameerj | 2021-02-13 | 3 | -7/+41 |
|/ | | | | | | This creates non-sRGB texture views for sRGB texture formats to allow for interfacing with these views in compute shaders using imageLoad and imageStore. Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
* | Merge pull request #5919 from ReinUsesLisp/stream-buffer-tragic | Morph | 2021-02-13 | 3 | -6/+12 |
|\ | | | | | gl_stream_buffer/vk_staging_buffer_pool: Fix size check | ||||
| * | vk_master_semaphore: Mark gpu_tick atomic operations with relaxed order | ReinUsesLisp | 2021-02-13 | 1 | -4/+4 |
| | | |||||
| * | vk_staging_buffer_pool: Inline tick tests | ReinUsesLisp | 2021-02-13 | 2 | -1/+7 |
| | | | | | | | | | | | | Load the current tick to a local variable, moving it out of an atomic and allowing us to compare the value without going through a pointer each time. This should make the loop more optimizable. | ||||
| * | gl_stream_buffer/vk_staging_buffer_pool: Fix size check | ReinUsesLisp | 2021-02-13 | 2 | -2/+2 |
| | | | | | | | | | | | | Fix a tragic off-by-one condition that causes Vulkan's stream buffer to think it's always full, using fallback memory. The OpenGL was also affected by this bug to a lesser extent. | ||||
* | | Merge pull request #5916 from ameerj/maxwell-gl-unused | LC | 2021-02-13 | 2 | -36/+3 |
|\ \ | | | | | | | maxwell_to_gl: Remove unused code | ||||
| * | | maxwell_to_gl: Remove unused code | ameerj | 2021-02-13 | 2 | -36/+3 |
| | | | | | | | | | | | | Removes unused declarations in maxwell_to_gl.h | ||||
* | | | vulkan_device: Require VK_EXT_robustness2 | ReinUsesLisp | 2021-02-13 | 2 | -37/+14 |
| |/ |/| | | | | | | | | | | | We are already using robustness2 features without requiring it explicitly, causing potential crashes on drivers without the extension. Requiring this at boot allows better diagnostics for it and formalizes our usage on the extension. | ||||
* | | video_core: Fix clang build issues | ReinUsesLisp | 2021-02-13 | 2 | -8/+5 |
| | | |||||
* | | vk_staging_buffer_pool: Fix softlock when stream buffer overflows | ReinUsesLisp | 2021-02-13 | 2 | -19/+20 |
| | | | | | | | | | | | | | | There was still a code path that could wait on a timeline semaphore tick that would never be signalled. While we are at it, make use of more STL algorithms. | ||||
* | | vk_buffer_cache: Add support for null index buffers | ReinUsesLisp | 2021-02-13 | 2 | -4/+40 |
| | | | | | | | | | | | | | | | | | | | | | | Games can bind a null index buffer (size=0) where all indices are evaluated as zero. VK_EXT_robustness2 doesn't support this and all drivers segfault when a null index buffer is passed to vkCmdBindIndexBuffer. Workaround this by creating a 4 byte buffer and filling it with zeroes. If it's read out of bounds, robustness takes care of returning zeroes as indices. | ||||
* | | buffer_cache: Add extra bytes to guest SSBOs | ReinUsesLisp | 2021-02-13 | 1 | -1/+7 |
| | | | | | | | | | | | | | | Bind extra bytes beyond the guest API's bound range. This is due to some games like Astral Chain operating out of bounds. Binding the whole map range would be technically correct, but games have large maps that make this approach unaffordable for now. | ||||
* | | Merge branch 'bytes-to-map-end' into new-bufcache-wip | ReinUsesLisp | 2021-02-13 | 1 | -0/+2 |
| | | |||||
* | | vk_staging_buffer_pool: Get a staging buffer instead of waiting | ReinUsesLisp | 2021-02-13 | 2 | -9/+18 |
| | | | | | | | | | | | | Avoids waiting idle while the GPU finishes to do work, and fixes an issue where we'd wait forever if a single command buffer (logic tick) all the data. | ||||
* | | renderer_opengl: Remove interop | ReinUsesLisp | 2021-02-13 | 8 | -122/+10 |
| | | | | | | | | Remove unused interop code from the OpenGL backend. | ||||
* | | gl_buffer_cache: Drop interop based parameter buffer workarounds | ReinUsesLisp | 2021-02-13 | 3 | -65/+45 |
| | | | | | | | | | | Sacrify runtime performance to avoid generating kernel exceptions on Windows due to our abusive aliasing of interop buffer objects. | ||||
* | | buffer_cache: Heuristically detect stream buffers | ReinUsesLisp | 2021-02-13 | 2 | -6/+33 |
| | | | | | | | | | | | | | | Detect when a memory region has been joined several times and increase the size of the created buffer on those instances. The buffer is assumed to be a "stream buffer", increasing its size should stop us from constantly recreating it and fragmenting memory. | ||||
* | | buffer_cache: Split CreateBuffer in separate functions | ReinUsesLisp | 2021-02-13 | 1 | -29/+52 |
| | | | | | | | | | | Allow adding functionality to each function without making CreateBuffer more complex. | ||||
* | | buffer_cache: Skip cache on small uploads on Vulkan | ReinUsesLisp | 2021-02-13 | 3 | -9/+18 |
| | | | | | | | | | | | | | | | | Ports from OpenGL the optimization to skip small 3D uniform buffer uploads. This will take advantage of the previously introduced stream buffer. Fixes instances where the staging buffer offset was being ignored. | ||||
* | | vk_staging_buffer_pool: Add stream buffer for small uploads | ReinUsesLisp | 2021-02-13 | 15 | -127/+298 |
| | | | | | | | | | | | | | | | | This uses a ring buffer similar to OpenGL's stream buffer for small uploads. This stops us from allocating several small buffers, reducing memory fragmentation and cache locality. It uses dedicated allocations when possible. | ||||
* | | vulkan_device: Enable robustBufferAccess | ReinUsesLisp | 2021-02-13 | 1 | -1/+2 |
| | | | | | | | | | | Fix regression on Pascal on Animal Crossing: New Horizons, fixing a validation error. | ||||
* | | video_core: Reimplement the buffer cache | ReinUsesLisp | 2021-02-13 | 67 | -2607/+2514 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits. | ||||
* | | vulkan_common: Expose interop and headless devices | ReinUsesLisp | 2021-02-13 | 4 | -21/+100 |
| | | |||||
* | | vulkan_common: Make interop extensions mandatory | ReinUsesLisp | 2021-02-13 | 1 | -0/+6 |
| | | |||||
* | | vulkan_device: Enable robust buffers | ReinUsesLisp | 2021-02-13 | 1 | -2/+4 |
| | | |||||
* | | vulkan_device: Use designated initializers for features | ReinUsesLisp | 2021-02-13 | 1 | -60/+59 |
| | | |||||
* | | vulkan_wrapper: Add memory barrier pipeline barrier helper | ReinUsesLisp | 2021-02-13 | 1 | -0/+6 |
| | | |||||
* | | vulkan_device: Fix formatting of constants | ReinUsesLisp | 2021-02-13 | 1 | -10/+6 |
| | | |||||
* | | vulkan_wrapper: Add interop functions | ReinUsesLisp | 2021-02-13 | 2 | -1/+41 |
| | | |||||
* | | vulkan_instance: Initialize Vulkan instance in a separate thread | ReinUsesLisp | 2021-02-13 | 1 | -1/+5 |
| | | | | | | | | | | | | | | Workaround an issue on Nvidia where creating a Vulkan instance from an active OpenGL thread disables threaded optimization on the driver. This optimization is important to have good performance on Nvidia OpenGL. | ||||
* | | vulkan_wrapper: Pull Windows symbols | ReinUsesLisp | 2021-02-13 | 1 | -0/+11 |
| | | |||||
* | | gpu: Report renderer errors with exceptions | ReinUsesLisp | 2021-02-13 | 24 | -227/+158 |
| | | | | | | | | | | | | Instead of using a two step initialization to report errors, initialize the GPU renderer and rasterizer on the constructor and report errors through std::runtime_error. | ||||
* | | buffer_base: Add support for cached CPU writes | ReinUsesLisp | 2021-02-13 | 1 | -61/+145 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some games usually write memory pages currently used by the GPU, causing rendering issues (e.g. flashing geometry and shadows on Link's Awakening). To workaround this issue, Guest CPU writes are delayed until the command buffer finishes processing, but the pages are updated immediately. The overall behavior is: - CPU writes are cached until they are flushed, they update the page state, but don't change the modification state. Cached writes stop pages from being flushed, in case games have meaningful data in it. - Command processing writes (e.g. push constants) update the page state and are marked to the command processor as dirty. They don't remove the state of cached writes. | ||||
* | | Merge pull request #5900 from lioncash/unused-func | bunnei | 2021-02-10 | 2 | -37/+0 |
|\ \ | |/ |/| | video_core: Remove unused functions and variables | ||||
| * | gl_rasterizer: Remove unused variables | Lioncash | 2021-02-09 | 1 | -3/+0 |
| | | | | | | | | Resolves warnings on clang 12 | ||||
| * | texture_cache/util: Remove unused functions | Lioncash | 2021-02-09 | 1 | -34/+0 |
| | | | | | | | | Silences a few warnings on clang 12. | ||||
* | | Merge pull request #5880 from lat9nq/ffmpeg-external | Ameer J | 2021-02-09 | 1 | -6/+5 |
|\ \ | |/ |/| | cmake: FFmpeg linking rework | ||||
| * | Address reviewer comments | lat9nq | 2021-02-05 | 1 | -1/+1 |
| | | |||||
| * | CMake: Port citra-emu/citra FindFFmpeg.cmake | lat9nq | 2021-02-05 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | Also renames related CMake variables to match both the Find*FFmpeg* and variables defined within the file. Fixes odd errors produced by the old FindFFmpeg. Citra's FindFFmpeg is slightly modified here: adds Citra's copyright at the beginning, renames FFmpeg_INCLUDES to FFmpeg_INCLUDE_DIR, disables a few components in _FFmpeg_ALL_COMPONENTS, and adds the missing avutil component to the comment above. | ||||
| * | CMake: Implement YUZU_USE_BUNDLED_FFMPEG | lat9nq | 2021-02-05 | 2 | -7/+6 |
| | | | | | | | | | | | | | | | | For Linux, instructs CMake to use the FFmpeg submodule in externals. This is HEAVILY based on our usage of the late Unicorn. Minimal change to MSVC as it uses the yuzu-emu/ext-windows-bin. MinGW now targets the same ext-windows-bin libraries as MSVC for FFmpeg. Adds FFMPEG_LIBRARIES to WIN32 and simplifies video_core/CMakeLists.txt a bit. | ||||
* | | Merge pull request #5888 from Morph1984/ogl-4.6 | Rodrigo Locatti | 2021-02-08 | 1 | -1/+1 |
|\ \ | | | | | | | renderer_opengl: Update OpenGL backend version requirement to 4.6 | ||||
| * | | renderer_opengl: Update OpenGL backend version requirement to 4.6 | Morph | 2021-02-07 | 1 | -1/+1 |
| |/ | |||||
* / | video_core: Delete morton | Chloe Marcec | 2021-02-08 | 3 | -2/+0 |
|/ | | | | moron.h & morton.cpp are not used anywhere and are just empty files | ||||
* | video_core: host_shaders: Don't pass --quiet to glslangValidator if unavailable | lat9nq | 2021-02-02 | 1 | -1/+19 |
| | | | | | Prevents CMake from calling `glslangValidator` with `--quiet` when it is not available, i.e. on older downstream versions from Ubuntu. | ||||
* | Merge pull request #5795 from ReinUsesLisp/bytes-to-map-end | bunnei | 2021-01-30 | 2 | -2/+27 |
|\ | | | | | video_core/memory_manager: Add BytesToMapEnd | ||||
| * | video_core/memory_manager: Add BytesToMapEnd | ReinUsesLisp | 2021-01-22 | 2 | -2/+27 |
| | | | | | | | | | | Track map address sizes in a flat ordered map and add a method to query the number of bytes until the end of a map in a given address. | ||||
* | | Merge pull request #5836 from ReinUsesLisp/unaligned-constr-sched | LC | 2021-01-28 | 1 | -6/+6 |
|\ \ | | | | | | | vk_scheduler: Fix unaligned placement new expressions | ||||
| * | | vk_scheduler: Fix unaligned placement new expressions | ReinUsesLisp | 2021-01-28 | 1 | -6/+6 |
| | | | | | | | | | | | | | | | We were accidentaly creating an object in an unaligned memory address. Fix this by manually aligning the offset. | ||||
* | | | Merge pull request #5786 from ReinUsesLisp/glsl-cbuf | bunnei | 2021-01-28 | 1 | -1/+2 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Fix constant buffer size calculation | ||||
| * | | | gl_shader_decompiler: Fix constant buffer size calculation | ReinUsesLisp | 2021-01-21 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | The divide logic was wrong and can cause an uniform buffer size overflow. | ||||
* | | | | vulkan_device: Blacklist Intel from float16 math (#5798) | Rodrigo Locatti | 2021-01-27 | 1 | -0/+5 |
| |/ / |/| | | | | | | | | Astral Chain crashes Intel's SPIR-V compiler when using fp16. Disable this while the vendor works on a fix. | ||||
* | | | Merge pull request #5778 from ReinUsesLisp/shader-dir | bunnei | 2021-01-27 | 3 | -5/+14 |
|\ \ \ | | | | | | | | | renderer_opengl: Avoid precompiled cache and force NV GL cache directory | ||||
| * | | | renderer_opengl: Avoid precompiled cache and force NV GL cache directory | ReinUsesLisp | 2021-01-21 | 3 | -5/+14 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Setting __GL_SHADER_DISK_CACHE_PATH we can force the cache directory to be in yuzu's user directory to stop commonly distributed malware from deleting our driver shader cache. And by setting __GL_SHADER_DISK_CACHE_SKIP_CLEANUP we can have an unbounded shader cache size. This has only been implemented on Windows, mostly because previous tests didn't seem to work on Linux. Disable the precompiled cache on Nvidia's driver. There's no need to hide information the driver already has in its own cache. | ||||
* | | | Merge pull request #5785 from ReinUsesLisp/buffer-dma | bunnei | 2021-01-25 | 2 | -8/+21 |
|\ \ \ | | | | | | | | | video_core/memory_manager: Flush destination buffer on CopyBlock | ||||
| * | | | video_core/memory_manager: Remove unused CopyBlockUnsafe | ReinUsesLisp | 2021-01-21 | 2 | -8/+0 |
| | | | | | | | | | | | | | | | | This function was not being used. | ||||
| * | | | video_core/memory_manager: Flush destination buffer on CopyBlock | ReinUsesLisp | 2021-01-21 | 1 | -0/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When we copy into a buffer, it might contain data modified from the GPU on the same pages. Because of this, we have to flush the contents before writing new data. An alternative approach would be to write the data in place, but games can also write data in other ways, invalidating our contents. Fixes geometry in Zombie Panic in Wonderland DX. | ||||
| * | | | video_core/memory_manager: Add GPU address based flush method | ReinUsesLisp | 2021-01-21 | 2 | -0/+17 |
| |/ / | | | | | | | | | | Allow flushing rasterizer contents based on a GPU address. | ||||
* | | | Revert "Start of Integer flags implementation" | ReinUsesLisp | 2021-01-25 | 3 | -59/+3 |
| | | | | | | | | | | | | | | | This reverts #4713. The implementation in that PR is not accurate. It does not reflect the behavior seen in hardware. | ||||
* | | | vk_graphics_pipeline: Fix narrowing conversion on MSVC | ReinUsesLisp | 2021-01-25 | 1 | -2/+2 |
| | | | |||||
* | | | Merge pull request #5807 from ReinUsesLisp/vc-warnings | LC | 2021-01-24 | 9 | -62/+93 |
|\ \ \ | | | | | | | | | video_core: Silence the remaining gcc warnings and enforce them | ||||
| * | | | cmake: Enforce -Warray-bounds and -Wmissing-field-initializers globally | ReinUsesLisp | 2021-01-24 | 1 | -2/+0 |
| | | | | |||||
| * | | | video_core/cmake: Enforce -Warray-bounds and -Wmissing-field-initializers | ReinUsesLisp | 2021-01-24 | 1 | -0/+2 |
| | | | | |||||
| * | | | video_core: Silence -Wmissing-field-initializers warnings | ReinUsesLisp | 2021-01-24 | 5 | -25/+56 |
| | | | | |||||
| * | | | maxwell_3d: Silence array bounds warnings | ReinUsesLisp | 2021-01-24 | 2 | -35/+35 |
| | | | | |||||
| * | | | maxwell_to_vk: Silence -Wextra warnings about using different enum types | ReinUsesLisp | 2021-01-24 | 2 | -2/+2 |
| | | | | |||||
* | | | | Merge pull request #5363 from ReinUsesLisp/vk-image-usage | Rodrigo Locatti | 2021-01-24 | 3 | -38/+72 |
|\ \ \ \ | | | | | | | | | | | vk_texture_cache: Support image store on sRGB images with VkImageViewUsageCreateInfo | ||||
| * | | | | vk_texture_cache: Support image store on sRGB images with VkImageViewUsageCreateInfo | ReinUsesLisp | 2021-01-24 | 3 | -38/+72 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Vulkan 1.0 didn't support creating sRGB image views on an ABGR8 VkImage with storage usage bits. VK_KHR_maintenance2 addressed this allowing to reduce the usage bits on a VkImageView. To allow image store on non-sRGB image views when the VkImage is created with sRGB, always create VkImages without sRGB and add the sRGB format on the view. | ||||
* | | | | | vulkan_device: Lift VK_EXT_extended_dynamic_state blacklist on RDNA | ReinUsesLisp | 2021-01-25 | 1 | -23/+0 |
| | | | | | | | | | | | | | | | | | | | | It seems to be safe to use this on new drivers. | ||||
* | | | | | Merge pull request #5796 from ReinUsesLisp/vertex-a-bypass-vk | bunnei | 2021-01-24 | 1 | -9/+3 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_pipeline_cache: Properly bypass VertexA shaders | ||||
| * | | | | | vk_pipeline_cache: Properly bypass VertexA shaders | ReinUsesLisp | 2021-01-23 | 1 | -9/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The VertexA stage is not yet implemented, but Vulkan is adding its descriptors, causing a discrepancy in the pushed descriptors and the template. This generally ends up in a driver side crash. Bypass the VertexA stage for now. | ||||
* | | | | | | host_shaders/cmake: Pass --quiet to glslang to keep it quiet | ReinUsesLisp | 2021-01-24 | 1 | -1/+1 |
| |_|/ / / |/| | | | | | | | | | | | | | | Silences noisy builds on toolchains. | ||||
* | | | | | shader_ir: Fix comment typo | Levi Behunin | 2021-01-23 | 1 | -1/+1 |
| | | | | | |||||
* | | | | | video_core/cmake: Properly generate fatal errors on Aftermath | ReinUsesLisp | 2021-01-23 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Fix "message(ERROR ..." to "message(FATAL_ERROR ..." to properly stop cmake when Nsight Aftermath can't be configured. | ||||
* | | | | | nsight_aftermath_tracker: Fix build issues when enabled | ReinUsesLisp | 2021-01-23 | 2 | -16/+5 |
|/ / / / | | | | | | | | | | | | | Fixes a bunch of build errors when Nsight Aftermath is properly enabled. | ||||
* | | | | Merge pull request #4713 from behunin/int-flags | bunnei | 2021-01-23 | 3 | -3/+59 |
|\ \ \ \ | |_|_|/ |/| | | | Start of Integer flags implementation | ||||
| * | | | Merge remote-tracking branch 'upstream/master' into int-flags | Levi | 2021-01-11 | 262 | -10235/+16506 |
| |\ \ \ | |||||
| * | | | | More forgetting... duh | Levi Behunin | 2020-09-25 | 1 | -2/+2 |
| | | | | | |||||
| * | | | | Forgot to apply suggestion here as well | Levi Behunin | 2020-09-25 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | Address Comments | Levi Behunin | 2020-09-25 | 3 | -25/+34 |
| | | | | | |||||
| * | | | | Start of Integer flags implementation | Levi Behunin | 2020-09-25 | 3 | -3/+50 |
| | | | | | |||||
* | | | | | Merge pull request #5361 from ReinUsesLisp/vk-shader-comment | bunnei | 2021-01-21 | 1 | -1/+4 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | vk_shader_decompiler: Show comments as OpUndef with a type | ||||
| * | | | | vk_shader_decompiler: Show comments as OpUndef with a type | ReinUsesLisp | 2021-01-16 | 1 | -1/+4 |
| | |_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Silence the new validation layer error about SPIR-V not allowing OpUndef on a OpTypeVoid, even when the SPIR-V spec doesn't say anything against it. They will be inserted as an undefined int to avoid SPIRV-Cross and validation errors, but only when a debugging tool is attached. | ||||
* | | | | Merge pull request #5746 from lioncash/sign-compare | Rodrigo Locatti | 2021-01-18 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | texture_cache/util: Resolve -Wsign-compare warning | ||||
| * | | | | texture_cache/util: Resolve -Wsign-compare warning | Lioncash | 2021-01-17 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | Resolves a -Wsign-compare warning on Clang. | ||||
* | | | | | Merge pull request #5745 from lioncash/documentation | Rodrigo Locatti | 2021-01-17 | 2 | -4/+3 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core: Resolve -Wdocumentation warnings | ||||
| * | | | | | video_core: Resolve -Wdocumentation warnings | Lioncash | 2021-01-17 | 2 | -4/+3 |
| |/ / / / | | | | | | | | | | | | | | | | Silences some -Wdocumentation warnings on Clang. | ||||
* / / / / | vulkan_debug_callback: Add missing header guard | Lioncash | 2021-01-17 | 1 | -0/+2 |
|/ / / / | | | | | | | | | | | | | Prevents inclusion issues from occurring. | ||||
* | | | | Merge pull request #5262 from ReinUsesLisp/buffer-base | Rodrigo Locatti | 2021-01-16 | 2 | -0/+496 |
|\ \ \ \ | | | | | | | | | | | buffer_cache/buffer_base: Add a range tracking buffer container and tests | ||||
| * | | | | buffer_cache/buffer_base: Add a range tracking buffer container | ReinUsesLisp | 2021-01-13 | 2 | -0/+496 |
| | |_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | It keeps track of the modified CPU and GPU ranges on a CPU page granularity, notifying the given rasterizer about state changes in the tracking behavior of the buffer. Use a small vector optimization to store buffers smaller than 256 KiB locally instead of using free store memory allocations. | ||||
* | | | | Merge pull request #5297 from ReinUsesLisp/vulkan-allocator-common | Rodrigo Locatti | 2021-01-16 | 19 | -554/+609 |
|\ \ \ \ | |_|/ / |/| | | | vulkan_memory_allocator: Improvements to the memory allocator | ||||
| * | | | vulkan_memory_allocator: Remove unnecesary 'device' memory from commits | ReinUsesLisp | 2021-01-15 | 2 | -15/+15 |
| | | | | |||||
| * | | | vk_texture_cache: Use Download memory types for texture flushes | ReinUsesLisp | 2021-01-15 | 2 | -5/+10 |
| | | | | | | | | | | | | | | | | Use the Download memory type where it matters. | ||||
| * | | | vulkan_memory_allocator: Add allocation support for download types | ReinUsesLisp | 2021-01-15 | 2 | -55/+91 |
| | | | | | | | | | | | | | | | | | | | | Implements the allocator logic to handle download memory types. This will try to use HOST_CACHED_BIT when available. | ||||
| * | | | vulkan_memory_allocator: Add "download" memory usage hint | ReinUsesLisp | 2021-01-15 | 9 | -45/+86 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Allow users of the allocator to hint memory usage for downloads. This removes the non-descriptive boolean passed for "host visible" or not host visible memory commits, and uses an enum to hint device local, upload and download usages. | ||||
| * | | | vulkan_common: Move allocator to the common directory | ReinUsesLisp | 2021-01-15 | 11 | -11/+11 |
| | | | | | | | | | | | | | | | | Allow using the abstraction from the OpenGL backend. | ||||
| * | | | renderer_vulkan: Rename Vulkan memory manager to memory allocator | ReinUsesLisp | 2021-01-15 | 15 | -54/+52 |
| | | | | | | | | | | | | | | | | | | | | "Memory manager" collides with the guest GPU memory manager, and a memory allocator sounds closer to what the abstraction aims to be. | ||||
| * | | | vk_memory_manager: Improve memory manager and its API | ReinUsesLisp | 2021-01-15 | 13 | -343/+318 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fix a bug where the memory allocator could leave gaps between commits. To fix this the allocation algorithm was reworked, although it's still short in number of lines of code. Rework the allocation API to self-contained movable objects instead of naively using an unique_ptr to do the job for us. Remove the VK prefix. | ||||
* | | | | Merge pull request #5358 from ReinUsesLisp/rename-insert-padding | LC | 2021-01-15 | 6 | -123/+123 |
|\ \ \ \ | |/ / / |/| | | | common/common_funcs: Rename INSERT_UNION_PADDING_{BYTES,WORDS} to _NOINIT | ||||
| * | | | common/common_funcs: Rename INSERT_UNION_PADDING_{BYTES,WORDS} to _NOINIT | ReinUsesLisp | 2021-01-15 | 6 | -123/+123 |
| | | | | | | | | | | | | | | | | INSERT_PADDING_BYTES_NOINIT is more descriptive of the underlying behavior. | ||||
* | | | | Merge pull request #5357 from ReinUsesLisp/alignment-log2 | LC | 2021-01-15 | 3 | -11/+11 |
|\ \ \ \ | | | | | | | | | | | common/alignment: Rename AlignBits to AlignUpLog2 and use constraints | ||||
| * | | | | common/alignment: Rename AlignBits to AlignUpLog2 | ReinUsesLisp | 2021-01-15 | 3 | -11/+11 |
| |/ / / | | | | | | | | | | | | | AlignUpLog2 describes what the function does better than AlignBits. | ||||
* / / / | common/bit_util: Replace CLZ/CTZ operations with standardized ones | Lioncash | 2021-01-15 | 3 | -5/+5 |
|/ / / | | | | | | | | | | Makes for less code that we need to maintain. | ||||
* | | | video_core/cmake: Remove Werror flags already defined code-base wide | ReinUsesLisp | 2021-01-15 | 1 | -2/+0 |
| | | | | | | | | | | | | These flags are already defined in src/cmake. | ||||
* | | | Merge pull request #5351 from ReinUsesLisp/vc-unused-functions | LC | 2021-01-15 | 1 | -4/+0 |
|\ \ \ | | | | | | | | | cmake: Enforce -Wunused-function code-base wise | ||||
| * | | | cmake: Enforce -Wunused-function code-base wide | ReinUsesLisp | 2021-01-15 | 1 | -1/+0 |
| | | | | |||||
| * | | | video_core: Enforce -Wunused-function | ReinUsesLisp | 2021-01-15 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Stops us from merging code with unused functions in the future. If something is invoked behind conditionally evaluated code in a way that the language can't see it (e.g. preprocessor macros), the potentially unused function should use [[maybe_unused]]. | ||||
| * | | | vk_buffer_cache: Remove unused function | ReinUsesLisp | 2021-01-15 | 1 | -4/+0 |
| | | | | |||||
* | | | | Merge pull request #5350 from ReinUsesLisp/vk-init-warns | Rodrigo Locatti | 2021-01-15 | 2 | -145/+146 |
|\ \ \ \ | | | | | | | | | | | vulkan_common: Silence missing initializer warnings | ||||
| * | | | | vulkan_common: Silence missing initializer warnings | ReinUsesLisp | 2021-01-15 | 2 | -145/+146 |
| |/ / / | | | | | | | | | | | | | Silence warnings explicitly initializing all members on construction. | ||||
* | | | | Merge pull request #5349 from ReinUsesLisp/anv-fix | LC | 2021-01-15 | 2 | -18/+20 |
|\ \ \ \ | | | | | | | | | | | vulkan_device: Enable shaderStorageImageMultisample conditionally | ||||
| * | | | | vulkan_device: Enable shaderStorageImageMultisample conditionally | ReinUsesLisp | 2021-01-15 | 2 | -18/+20 |
| |/ / / | | | | | | | | | | | | | Fix Vulkan initialization on ANV. | ||||
* | | | | astc: Increase integer encoded vector size | ReinUsesLisp | 2021-01-15 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | Invalid ASTC textures seem to write more bytes here, increase the size to something that can't make us push out of bounds. | ||||
* | | | | astc: Return zero on out of bound bits | ReinUsesLisp | 2021-01-15 | 1 | -17/+22 |
|/ / / | | | | | | | | | | | | | Avoid out of bound reads on invalid ASTC textures. Games can bind invalid textures that make us read or write out of bounds. | ||||
* | | | vulkan_device: Remove requirement on shaderStorageImageMultisample | ReinUsesLisp | 2021-01-13 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | yuzu doesn't currently emulate MS image stores. Requiring this makes no sense for now. Fixes ANV not booting any games on Vulkan. | ||||
* | | | Merge pull request #5311 from ReinUsesLisp/fence-wait | bunnei | 2021-01-13 | 3 | -54/+18 |
|\ \ \ | |/ / |/| | | vk_fence_manager: Use timeline semaphores instead of spin waits | ||||
| * | | vk_fence_manager: Use timeline semaphores instead of spin waits | ReinUsesLisp | 2021-01-08 | 3 | -54/+18 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | With timeline semaphores we can avoid creating objects. Instead of creating an event, grab the current tick from the scheduler and flush the current command buffer. When the fence has to be queried/waited, we can do so against the master semaphore instead of spinning on an event. If Vulkan supported NVN like events or fences, we could signal from the command buffer and wait for that without splitting things in two separate command buffers. | ||||
* | | | Merge pull request #5231 from ReinUsesLisp/dyn-bindings | bunnei | 2021-01-08 | 3 | -26/+12 |
|\ \ \ | |/ / |/| | | renderer_vulkan/fixed_pipeline_state: Move enabled bindings to static state | ||||
| * | | renderer_vulkan/fixed_pipeline_state: Move enabled bindings to static state | ReinUsesLisp | 2020-12-26 | 3 | -26/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Without using VK_EXT_robustness2, we can't consider the 'enabled' (not null) vertex buffers as dynamic state, as this leads to invalid Vulkan state. Move this to static state that is always hashed and compared in the pipeline key. The bits for enabled vertex buffers are moved into the attribute state bitfield. This is not 'correct' as it's not an attribute state, but that struct has bits to spare, and it's used in an array of 32 elements (the exact same number of vertex buffer bindings). | ||||
* | | | remove inaccurate reference | Ameer J | 2021-01-07 | 1 | -1/+1 |
| | | | | | | | | | Co-authored-by: LC <mathew1800@gmail.com> | ||||
* | | | fix for nvdec disabled, cleanup host1x | ameerj | 2021-01-07 | 2 | -61/+9 |
| | | | |||||
* | | | nvdec syncpt incorporation | ameerj | 2021-01-07 | 4 | -17/+16 |
| | | | | | | | | | | | | laying the groundwork for async gpu, although this does not fully implement async nvdec operations | ||||
* | | | vulkan_library: Common::DynamicLibrary::Open is [[nodiscard]] | MerryMage | 2021-01-07 | 1 | -1/+1 |
| | | | | | | | | | | | | Ignore the return value on __APPLE__ systems as well | ||||
* | | | texture_cache: Replace PAGE_SHIFT with PAGE_BITS | MerryMage | 2021-01-07 | 1 | -6/+6 |
| | | | | | | | | | | | | PAGE_SHIFT is a #define in system headers that leaks into user code on some systems | ||||
* | | | Merge pull request #5288 from ReinUsesLisp/workaround-garbage | Morph | 2021-01-06 | 12 | -120/+148 |
|\ \ \ | | | | | | | | | gl_texture_cache: Avoid format views on Intel and AMD | ||||
| * | | | gl_texture_cache: Avoid format views on Intel and AMD | ReinUsesLisp | 2021-01-04 | 11 | -21/+48 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Intel and AMD proprietary drivers are incapable of rendering to texture views of different formats than the original texture. Avoid creating these at a cache level. This will consume more memory, emulating them with copies. | ||||
| * | | | gl_texture_cache: Create base images with sRGB | ReinUsesLisp | 2021-01-04 | 2 | -99/+100 |
| | | | | | | | | | | | | | | | | | | | | | | | | This breaks accelerated decoders trying to imageStore into images with sRGB. The decoders are currently disabled so this won't cause issues at runtime. | ||||
* | | | | Merge pull request #5289 from ReinUsesLisp/vulkan-device | bunnei | 2021-01-06 | 31 | -62/+55 |
|\ \ \ \ | | | | | | | | | | | vulkan_common: Move device abstraction to the common directory and allow surfaceless devices | ||||
| * | | | | vulkan_device: Allow creating a device without surface | ReinUsesLisp | 2021-01-04 | 1 | -3/+3 |
| | | | | | |||||
| * | | | | renderer_vulkan/nsight_aftermath_tracker: Move to vulkan_common | ReinUsesLisp | 2021-01-04 | 5 | -30/+21 |
| | | | | | |||||
| * | | | | renderer_vulkan: Move device abstraction to vulkan_common | ReinUsesLisp | 2021-01-04 | 29 | -29/+31 |
| | | | | | |||||
* | | | | | Merge pull request #5292 from ReinUsesLisp/empty-set | LC | 2021-01-05 | 1 | -2/+4 |
|\ \ \ \ \ | |/ / / / |/| | | | | vk_rasterizer: Skip binding empty descriptor sets on compute | ||||
| * | | | | vk_rasterizer: Skip binding empty descriptor sets on compute | ReinUsesLisp | 2021-01-04 | 1 | -2/+4 |
| |/ / / | | | | | | | | | | | | | | | | | Fixes unit tests where compute shaders had no descriptors in the set, making Vulkan drivers crash when binding an empty set. | ||||
* / / / | renderer_vulkan: Rename VKDevice to Device | ReinUsesLisp | 2021-01-03 | 52 | -169/+166 |
|/ / / | | | | | | | | | | | | | | | | | | | The "VK" prefix predates the "Vulkan" namespace. It was carried around the codebase for consistency. "VKDevice" currently is a bad alias with "VkDevice" (only an upcase character of difference) that can cause confusion. Rename all instances of it. | ||||
* | | | Merge pull request #5230 from ReinUsesLisp/vulkan-common | Rodrigo Locatti | 2021-01-03 | 60 | -486/+574 |
|\ \ \ | | | | | | | | | vulkan_common: Move reusable Vulkan abstractions to a separate directory | ||||
| * | | | vulkan_instance: Allow different Vulkan versions and enforce 1.1 | ReinUsesLisp | 2020-12-31 | 7 | -41/+39 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | For listing the available physical devices we can use Vulkan 1.0. Now that MoltenVK supports 1.1 we can require it for running games. Add missing documentation. | ||||
| * | | | vk_device: Use an array to report lacking device limits | ReinUsesLisp | 2020-12-31 | 1 | -13/+17 |
| | | | | | | | | | | | | | | | | This makes easier to add and tune the required device limits. | ||||
| * | | | vk_device: Stop initialization when device is not suitable | ReinUsesLisp | 2020-12-31 | 2 | -61/+39 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | VKDevice::IsSuitable was not being called. To address this issue, check suitability before initialization and throw an exception if it fails. By doing this, we can deduplicate some code on queue searches. Previosuly we would first search if a present and graphics queue existed, then on initialization we would search again to find the index. | ||||
| * | | | renderer_vulkan: Remove two step initialization on VKDevice | ReinUsesLisp | 2020-12-31 | 6 | -31/+10 |
| | | | | | | | | | | | | | | | | | | | | The Vulkan device abstraction either initializes successfully on the constructor or throws a Vulkan exception. | ||||
| * | | | renderer_vulkan: Throw when enumerating devices fails | ReinUsesLisp | 2020-12-31 | 5 | -33/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | Report device enumeration errors with exceptions to be consistent with other initialization related function calls. Reduces the amount of code to maintain. | ||||
| * | | | renderer_vulkan: Initialize surface in separate file | ReinUsesLisp | 2020-12-31 | 6 | -73/+109 |
| | | | | | | | | | | | | | | | | | | | | | | | | Move surface initialization code to a separate file. It's unlikely to use this code outside of Vulkan, but keeping platform-specific code (Win32, Xlib, Wayland) in its own translation unit keeps things cleaner. | ||||
| * | | | renderer_vulkan: Catch and report exceptions | ReinUsesLisp | 2020-12-31 | 1 | -2/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Move more Vulkan code to report errors with exceptions and report them through a log before notifying it with an error boolean for backwards compatibility. In the future we can replace the rasterizer two-step initialization to always use exceptions. | ||||
| * | | | renderer_vulkan: Create debug callback on separate file and throw | ReinUsesLisp | 2020-12-31 | 8 | -79/+88 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Initialize debug callbacks (messenger) from a separate file. This allows sharing code with different backends. Change our Vulkan error handling to use exceptions instead of error codes, simplifying the initialization process. | ||||
| * | | | renderer_vulkan: Move instance initialization to a separate file | ReinUsesLisp | 2020-12-31 | 4 | -111/+176 |
| | | | | | | | | | | | | | | | | | | | | | | | | Simplify Vulkan's backend initialization code by moving it to a separate file, allowing us to initialize a Vulkan instance from different backends. | ||||
| * | | | vulkan_common: Rename renderer_vulkan/wrapper.h to vulkan_common/vulkan_wrapper.h | ReinUsesLisp | 2020-12-31 | 51 | -51/+51 |
| | | | | | | | | | | | | | | | | Allows sharing Vulkan wrapper code between different rendering backends. | ||||
| * | | | vulkan_common: Move dynamic library load to a separate file | ReinUsesLisp | 2020-12-31 | 4 | -31/+59 |
| | | | | | | | | | | | | | | | | | | | | Allows us to initialize a Vulkan dynamic library from different backends without duplicating code. | ||||
* | | | | general: Fix various spelling errors | Morph | 2021-01-02 | 2 | -2/+2 |
| | | | | |||||
* | | | | Merge pull request #5208 from bunnei/service-threads | bunnei | 2020-12-31 | 11 | -297/+164 |
|\ \ \ \ | |/ / / |/| | | | Service threads | ||||
| * | | | gpu: gpu_thread: Ensure MicroProfile is shutdown on exit. | bunnei | 2020-12-29 | 1 | -0/+3 |
| | | | | |||||
| * | | | video_core: gpu_thread: Do not wait when system is powered down. | bunnei | 2020-12-29 | 1 | -1/+2 |
| | | | | |||||
| * | | | video_core: gpu: Implement synchronous mode using threaded GPU. | bunnei | 2020-12-29 | 4 | -12/+34 |
| | | | | |||||
| * | | | video_core: gpu: Refactor out synchronous/asynchronous GPU implementations. | bunnei | 2020-12-29 | 10 | -289/+130 |
| |/ / | | | | | | | | | | - We must always use a GPU thread now, even with synchronous GPU. | ||||
* | | | half_set: Resolve -Wmaybe-uninitialized warnings | Lioncash | 2020-12-30 | 1 | -7/+7 |
| | | | |||||
* | | | maxwell_to_vk: Initialize usage variable in SurfaceFormat() | Lioncash | 2020-12-30 | 1 | -1/+1 |
| | | | | | | | | | | | | Silences a -Wmaybe-uninitialized warning | ||||
* | | | video_core: Rewrite the texture cache | ReinUsesLisp | 2020-12-30 | 152 | -8101/+10359 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues. | ||||
* | | | video_core: Add a delayed destruction ring abstraction | ReinUsesLisp | 2020-12-30 | 2 | -0/+33 |
| | | | |||||
* | | | host_shaders: Add Vulkan assembler compute shaders | ReinUsesLisp | 2020-12-30 | 4 | -0/+96 |
| | | | |||||
* | | | host_shaders: Add helper to blit depth stencil fragment shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+17 |
| | | | |||||
* | | | host_shaders: Add texture color blit fragment shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+15 |
| | | | |||||
* | | | host_shaders: Add shaders to present to the swapchain | ReinUsesLisp | 2020-12-30 | 3 | -0/+36 |
| | | | |||||
* | | | host_shaders: Add shaders to convert between depth and color images | ReinUsesLisp | 2020-12-30 | 3 | -0/+28 |
| | | | |||||
* | | | host_shaders: Add compute shader to copy BC4 as RG32UI to RGBA8 | ReinUsesLisp | 2020-12-30 | 2 | -0/+71 |
| | | | |||||
* | | | host_shaders: Add shader to render a full screen triangle | ReinUsesLisp | 2020-12-30 | 2 | -0/+30 |
| | | | |||||
* | | | host_shaders: Add pitch linear upload compute shader | ReinUsesLisp | 2020-12-30 | 2 | -0/+87 |
| | | | |||||
* | | | host_shaders: Add block linear upload compute shaders | ReinUsesLisp | 2020-12-30 | 3 | -0/+249 |
| | | | |||||
* | | | host_shaders: Add copyright headers to OpenGL present shaders | ReinUsesLisp | 2020-12-30 | 2 | -0/+8 |
| | | | |||||
* | | | video_core/host_shaders: Add support for prebuilt SPIR-V shaders | ReinUsesLisp | 2020-12-30 | 1 | -16/+37 |
|/ / | | | | | | | | | Add support for building SPIR-V shaders from GLSL and generating headers to include the text of those same GLSL shaders to consume from OpenGL. | ||||
* | | Merge pull request #5226 from ReinUsesLisp/c4715-vc | Rodrigo Locatti | 2020-12-25 | 2 | -0/+2 |
|\ \ | | | | | | | video_core: Enforce C4715 (not all control paths return a value) | ||||
| * | | video_core: Enforce C4715 (not all control paths return a value) | ReinUsesLisp | 2020-12-25 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Most of the time people write code that always returns a value, terminates execution, throws an exception, or uses an unconventional jump primitive. This is not always true when we build without asserts on mainline builds. To avoid introducing undefined behavior on our most used builds, enforce this warning signalling an error and stopping the build from shipping. | ||||
| * | | vk_shader_decompiler: Silence warning when compiling without asserts | ReinUsesLisp | 2020-12-25 | 1 | -0/+1 |
| | | | |||||
* | | | cmake: Always enable Vulkan | ReinUsesLisp | 2020-12-25 | 2 | -79/+66 |
|/ / | | | | | | | | | Removes the unnecesary burden of maintaining separate #ifdef paths and allows us sharing generic Vulkan code across APIs. | ||||
* | | Merge pull request #5157 from lioncash/array-dirty | bunnei | 2020-12-15 | 1 | -34/+33 |
|\ \ | | | | | | | maxwell_3d: Remove unused dirty_pointer array | ||||
| * | | maxwell_3d: Move member variables to end of class | Lioncash | 2020-12-07 | 1 | -31/+32 |
| | | | | | | | | | | | | Follows our established coding style. | ||||
| * | | maxwell_3d: Resolve -Wdocumentation warning | Lioncash | 2020-12-07 | 1 | -1/+1 |
| | | | | | | | | | | | | Removes a documentation comment for a non-existent member. | ||||
| * | | maxwell_3d: Remove unused dirty_pointer array | Lioncash | 2020-12-07 | 1 | -2/+0 |
| | | | | | | | | | | | | This is unused and removing it shrinks the structure by 3584 bytes. | ||||
* | | | Merge pull request #5162 from lioncash/copy-shader | bunnei | 2020-12-10 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Elide unnecessary copies within DeclareConstantBuffers() | ||||
| * | | | gl_shader_decompiler: Elide unnecessary copies within DeclareConstantBuffers() | Lioncash | 2020-12-07 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | Resolves a -Wrange-loop-analysis warning. | ||||
* | | | | Merge pull request #5164 from lioncash/contains | Rodrigo Locatti | 2020-12-08 | 8 | -16/+13 |
|\ \ \ \ | | | | | | | | | | | video_core: Make use of ordered container contains() where applicable | ||||
| * | | | | video_core: Make use of ordered container contains() where applicable | Lioncash | 2020-12-07 | 8 | -16/+13 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | With C++20, we can use the more concise contains() member function instead of comparing the result of the find() call with the end iterator. | ||||
* / / / | ast: Improve string concat readability in operator() | Lioncash | 2020-12-07 | 1 | -5/+4 |
|/ / / | | | | | | | | | | Provides an in-place format string to make it more pleasant to read. | ||||
* | | | Merge pull request #5149 from comex/xx-map-interval | bunnei | 2020-12-07 | 1 | -1/+2 |
|\ \ \ | | | | | | | | | map_interval: Change field order to address uninitialized field warning | ||||
| * | | | map_interval: Change field order to address uninitialized field warning | comex | 2020-12-07 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Clang complains about `new_chunk`'s constructor using the then-uninitialized `first_chunk` (even though it's just to get a pointer into it). | ||||
* | | | | Merge pull request #5159 from lioncash/move-amend | Rodrigo Locatti | 2020-12-07 | 1 | -2/+2 |
|\ \ \ \ | | | | | | | | | | | shader_ir: std::move node within DeclareAmend() | ||||
| * | | | | shader_ir: std::move node within DeclareAmend() | Lioncash | 2020-12-07 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Same behavior, but elides an unnecessary atomic reference count increment and decrement. | ||||
* | | | | | buffer_block: Mark interface as nodiscard where applicable | Lioncash | 2020-12-07 | 1 | -7/+7 |
| | | | | | | | | | | | | | | | | | | | | Prevents logic errors from occurring from unused values. | ||||
* | | | | | buffer_block: Remove unnecessary includes | Lioncash | 2020-12-07 | 1 | -5/+0 |
| | | | | | | | | | | | | | | | | | | | | Reduces the amount of dependencies the header pulls in. | ||||
* | | | | | video_core: Remove unnecessary enum class casting in logging messages | Lioncash | 2020-12-07 | 33 | -148/+125 |
|/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | fmt now automatically prints the numeric value of an enum class member by default, so we don't need to use casts any more. Reduces the line noise a bit. | ||||
* | | | | Merge pull request #5152 from comex/xx-override | LC | 2020-12-07 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | renderer_vulkan: Add missing `override` specifier | ||||
| * | | | | renderer_vulkan: Add missing `override` specifier | comex | 2020-12-07 | 1 | -1/+1 |
| |/ / / | |||||
* | | | | Merge pull request #5136 from lioncash/video-shadow3 | LC | 2020-12-07 | 45 | -280/+293 |
|\ \ \ \ | | | | | | | | | | | video_core: Resolve more variable shadowing scenarios pt.3 | ||||
| * | | | | video_core: Resolve more variable shadowing scenarios pt.3 | Lioncash | 2020-12-05 | 45 | -280/+293 |
| | |/ / | |/| | | | | | | | | | | | | | | Cleans out the rest of the occurrences of variable shadowing and makes any further occurrences of shadowing compiler errors. | ||||
* | | | | video_core: Adjust `NUM` macro to avoid Clang warning | comex | 2020-12-07 | 3 | -3/+3 |
| |/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The previous definition was: #define NUM(field_name) (sizeof(Maxwell3D::Regs::field_name) / sizeof(u32)) In cases where `field_name` happens to refer to an array, Clang thinks `sizeof(an array value) / sizeof(a type)` is an instance of the idiom where `sizeof` is used to compute an array length. So it thinks the type in the denominator ought to be the array element type, and warns if it isn't, assuming this is a mistake. In reality, `NUM` is not used to get array lengths at all, so there is no mistake. Silence the warning by applying Clang's suggested workaround of parenthesizing the denominator. | ||||
* | | | maxwell_dma: Rename RenderEnable::Mode::FALSE and TRUE to avoid name conflict | comex | 2020-12-05 | 1 | -5/+7 |
|/ / | | | | | | | | | | | | | On Apple platforms, FALSE and TRUE are defined as macros by <mach/boolean.h>, which is included by various system headers. Note that there appear to be no actual users of the names to fix up. | ||||
* | | video_core: Resolve more variable shadowing scenarios pt.2 | Lioncash | 2020-12-05 | 39 | -296/+305 |
| | | | | | | | | | | | | | | Migrates the video core code closer to enabling variable shadowing warnings as errors. This primarily sorts out shadowing occurrences within the Vulkan code. | ||||
* | | Merge pull request #5124 from lioncash/video-shadow | bunnei | 2020-12-05 | 42 | -206/+219 |
|\ \ | | | | | | | video_core: Resolve more variable shadowing scenarios | ||||
| * | | video_core: Resolve more variable shadowing scenarios | Lioncash | 2020-12-04 | 42 | -206/+219 |
| | | | | | | | | | | | | | | | | | | Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit. | ||||
* | | | Merge pull request #5127 from FearlessTobi/port-5617 | bunnei | 2020-12-05 | 1 | -3/+3 |
|\ \ \ | | | | | | | | | Port citra-emu/citra#5617: "Fix telemetry-related exit crash from use-after-free" | ||||
| * | | | Fix telemetry-related exit crash from use-after-free | FearlessTobi | 2020-12-05 | 1 | -3/+3 |
| |/ / | | | | | | | | | | Co-Authored-By: xperia64 <xperia64@users.noreply.github.com> | ||||
* / / | codec: Remove deprecated usage of AVCodecContext::refcounted_frames | Lioncash | 2020-12-04 | 1 | -1/+0 |
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This was only necessary for use with the avcodec_decode_video2/avcoded_decode_audio4 APIs which are also deprecated. Given we use avcodec_send_packet/avcodec_receive_frame, this isn't necessary, this is even indicated directly within the FFmpeg API changes document here on 2017-09-26: https://github.com/FFmpeg/FFmpeg/blob/master/doc/APIchanges#L410 This prevents our code from breaking whenever we update to a newer version of FFmpeg in the future if they ever decide to fully remove this API member. | ||||
* | | Merge pull request #5064 from lioncash/node-shadow | bunnei | 2020-12-04 | 1 | -75/+77 |
|\ \ | | | | | | | node: Eliminate variable shadowing | ||||
| * | | node: Mark member functions as [[nodiscard]] where applicable | Lioncash | 2020-12-03 | 1 | -29/+29 |
| | | | | | | | | | | | | Prevents logic bugs from accidentally ignoring the return value. | ||||
| * | | node: Eliminate variable shadowing | Lioncash | 2020-12-03 | 1 | -47/+49 |
| | | | |||||
* | | | vp9/vic: Resolve pessimizing moves | Lioncash | 2020-12-03 | 2 | -11/+11 |
|/ / | | | | | | | | | Removes the usage of moves that don't result in behavior different from a copy, or otherwise would prevent copy elision from occurring. | ||||
* | | Merge pull request #5002 from ameerj/nvdec-frameskip | bunnei | 2020-12-03 | 10 | -340/+234 |
|\ \ | | | | | | | nvdec: Queue and display all decoded frames, cleanup decoders | ||||
| * | | Limit queue size to 10 frames | ameerj | 2020-11-26 | 1 | -0/+4 |
| | | | | | | | | | | | | Workaround for ZLA, which seems to decode and queue twice as many frames as it displays. | ||||
| * | | Address PR feedback | ameerj | 2020-11-26 | 4 | -32/+33 |
| | | | | | | | | | | | | | | | | | | remove some redundant moves, make deleter match naming guidelines. Co-Authored-By: LC <712067+lioncash@users.noreply.github.com> | ||||
| * | | Queue decoded frames, cleanup decoders | ameerj | 2020-11-25 | 10 | -338/+227 |
| | | | |||||
* | | | Merge pull request #5013 from ReinUsesLisp/vk-early-z | bunnei | 2020-11-30 | 6 | -11/+19 |
|\ \ \ | | | | | | | | | vk_shader_decompiler: Implement force early fragment tests | ||||
| * | | | vk_shader_decompiler: Implement force early fragment tests | ReinUsesLisp | 2020-11-26 | 6 | -11/+19 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Force early fragment tests when the 3D method is enabled. The established pipeline cache takes care of recompiling if needed. This is implemented only on Vulkan to avoid invalidating the shader cache on OpenGL. | ||||
* | | | | codec: Fix `pragma GCC diagnostic pop` missing corresponding push | comex | 2020-11-26 | 1 | -0/+1 |
|/ / / | |||||
* | | | Merge pull request #4975 from comex/invalid-syncpoint-id | bunnei | 2020-11-26 | 1 | -11/+18 |
|\ \ \ | | | | | | | | | nvdrv, video_core: Don't index out of bounds when given invalid syncpoint ID | ||||
| * | | | nvdrv, video_core: Don't index out of bounds when given invalid syncpoint ID | comex | 2020-11-24 | 1 | -11/+18 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Use .at() instead of raw indexing when dealing with untrusted indices. - For the special case of WaitFence with syncpoint id UINT32_MAX, instead of crashing, log an error and ignore. This is what I get when running Super Mario Maker 2. | ||||
* | | | | Merge pull request #4976 from comex/poll-events | Rodrigo Locatti | 2020-11-26 | 2 | -4/+2 |
|\ \ \ \ | |_|/ / |/| | | | Overhaul EmuWindow::PollEvents to fix yuzu-cmd calling SDL_PollEvents off main thread | ||||
| * | | | Overhaul EmuWindow::PollEvents to fix yuzu-cmd calling SDL_PollEvents off main thread | comex | 2020-11-23 | 2 | -4/+2 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | EmuWindow::PollEvents was called from the GPU thread (or the CPU thread in sync-GPU mode) when swapping buffers. It had three implementations: - In GRenderWindow, it didn't actually poll events, just set a flag and emit a signal to indicate that a frame was displayed. - In EmuWindow_SDL2_Hide, it did nothing. - In EmuWindow_SDL2, it did call SDL_PollEvents, but this is wrong because SDL_PollEvents is supposed to be called on the thread that set up video - in this case, the main thread, which was sleeping in a busyloop (regardless of whether sync-GPU was enabled). On macOS this causes a crash. To fix this: - Rename EmuWindow::PollEvents to OnFrameDisplayed, and give it a default implementation that does nothing. - In EmuWindow_SDL2, do not override OnFrameDisplayed, but instead have the main thread call SDL_WaitEvent in a loop. | ||||
* | | | cleanup unneeded comments and newlines | ameerj | 2020-11-25 | 1 | -6/+0 |
| | | | |||||
* | | | Refactor MaxwellToSpirvComparison. Use Common::BitCast | ameerj | 2020-11-25 | 3 | -31/+34 |
| | | | | | | | | | | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
* | | | Address PR feedback from Rein | ameerj | 2020-11-25 | 5 | -40/+31 |
| | | | |||||
* | | | vulkan_renderer: Alpha Test Culling Implementation | ameerj | 2020-11-25 | 5 | -2/+76 |
| | | | | | | | | | | | | Used by various textures in many titles, e.g. SSBU menu. | ||||
* | | | Merge pull request #3681 from lioncash/component | Rodrigo Locatti | 2020-11-24 | 1 | -2/+2 |
|\ \ \ | |/ / |/| | | decoder/image: Fix incorrect G24R8 component sizes in GetComponentSize() | ||||
| * | | decode/image: Fix typo in assert in GetComponentSize() | Lioncash | 2020-04-16 | 1 | -3/+3 |
| | | | |||||
| * | | decoder/image: Fix incorrect G24R8 component sizes in GetComponentSize() | Lioncash | 2020-04-16 | 1 | -2/+2 |
| | | | | | | | | | | | | The components' sizes were mismatched. This corrects that. | ||||
* | | | Merge pull request #4954 from lioncash/compare | Morph | 2020-11-22 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | gl_rasterizer: Make floating-point literal a float | ||||
| * | | | gl_rasterizer: Make floating-point literal a float | Lioncash | 2020-11-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | Gets rid of an unnecessary expansion from float to double. | ||||
* | | | | Merge pull request #4955 from lioncash/move3 | bunnei | 2020-11-21 | 1 | -19/+13 |
|\ \ \ \ | | | | | | | | | | | async_shaders: std::move data within QueueVulkanShader() | ||||
| * | | | | async_shaders: emplace threads into the worker thread vector | Lioncash | 2020-11-20 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Same behavior, but constructs the threads in place instead of moving them. | ||||
| * | | | | async_shaders: Simplify implementation of GetCompletedWork() | Lioncash | 2020-11-20 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | This is equivalent to moving all the contents and then clearing the vector. This avoids a redundant allocation. | ||||
| * | | | | async_shaders: Simplify moving data into the pending queue | Lioncash | 2020-11-20 | 1 | -13/+8 |
| | | | | | |||||
| * | | | | async_shaders: std::move data within QueueVulkanShader() | Lioncash | 2020-11-20 | 1 | -2/+2 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | Same behavior, but avoids redundant copies. While we're at it, we can simplify the pushing of the parameters into the pending queue. | ||||
* | | | | Merge pull request #4957 from ReinUsesLisp/alpha-test-rt | LC | 2020-11-21 | 1 | -4/+0 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Remove warning of untested alpha test | ||||
| * | | | | gl_rasterizer: Remove warning of untested alpha test | ReinUsesLisp | 2020-11-21 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | | | | | Alpha test has been proven to only affect the first render target. | ||||
* | | | | | Merge pull request #4953 from lioncash/shader-shadow | bunnei | 2020-11-21 | 1 | -88/+96 |
|\ \ \ \ \ | |/ / / / |/| | | | | shader_bytecode: Eliminate variable shadowing | ||||
| * | | | | shader_bytecode: Make use of [[nodiscard]] where applicable | Lioncash | 2020-11-20 | 1 | -73/+79 |
| | | | | | | | | | | | | | | | | | | | | Ensures that all queried values are made use of. | ||||
| * | | | | shader_bytecode: Eliminate variable shadowing | Lioncash | 2020-11-20 | 1 | -15/+17 |
| |/ / / | |||||
* | | | | Merge pull request #4308 from ReinUsesLisp/maxwell-3d-funcs | Rodrigo Locatti | 2020-11-20 | 2 | -151/+122 |
|\ \ \ \ | |/ / / |/| | | | maxwell_3d: Move code to separate functions and insert instead of push_back | ||||
| * | | | maxwell_3d: Use insert instead of loop push_back | ReinUsesLisp | 2020-11-11 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | This reduces the overhead of bounds checking on each element. It won't reduce the cost of allocation because usually this vector's capacity is usually large enough to hold whatever we push to it. | ||||
| * | | | maxwell_3d: Move code to separate functions | ReinUsesLisp | 2020-11-11 | 2 | -151/+124 |
| | | | | | | | | | | | | | | | | | | | | Deduplicate some code and put it in separate functions so it's easier to understand and profile. | ||||
* | | | | rasterizer_interface: Make use of [[nodiscard]] where applicable | Lioncash | 2020-11-17 | 1 | -8/+9 |
| | | | | |||||
* | | | | render_base: Make use of [[nodiscard]] where applicable | Lioncash | 2020-11-17 | 1 | -11/+11 |
| | | | | |||||
* | | | | gpu: Make use of [[nodiscard]] where applicable | Lioncash | 2020-11-17 | 1 | -31/+35 |
|/ / / | |||||
* | | | video_core: dma_pusher: Remove integrity check on command lists. | bunnei | 2020-11-07 | 2 | -28/+1 |
| | | | | | | | | | | | | - This seems to cause softlocks in Breath of the Wild. | ||||
* | | | Merge pull request #4891 from lioncash/clang2 | bunnei | 2020-11-06 | 2 | -5/+4 |
|\ \ \ | | | | | | | | | General: Fix clang build | ||||
| * | | | General: Fix clang build | Lioncash | 2020-11-05 | 2 | -5/+4 |
| | | | | | | | | | | | | | | | | Allows building on clang to work again | ||||
* | | | | Merge pull request #4854 from ReinUsesLisp/cube-array-shadow | bunnei | 2020-11-06 | 3 | -25/+37 |
|\ \ \ \ | |/ / / |/| | | | shader: Partially implement texture cube array shadow | ||||
| * | | | shader: Partially implement texture cube array shadow | ReinUsesLisp | 2020-10-28 | 3 | -25/+37 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This implements texture cube arrays with shadow comparisons but doesn't fix the asserts related to it. Fixes out of bounds reads on swizzle constructors and makes them use bounds checked ::at instead of the unsafe operator[]. | ||||
* | | | | Merge pull request #4858 from lioncash/initializer | bunnei | 2020-11-04 | 3 | -2/+10 |
|\ \ \ \ | | | | | | | | | | | General: Resolve a few missing initializer warnings | ||||
| * | | | | General: Resolve a few missing initializer warnings | Lioncash | 2020-10-30 | 3 | -2/+10 |
| | | | | | | | | | | | | | | | | | | | | Resolves a few -Wmissing-initializer warnings. | ||||
* | | | | | Merge pull request #4869 from bunnei/improve-gpu-sync | Chloe | 2020-11-04 | 4 | -53/+149 |
|\ \ \ \ \ | | | | | | | | | | | | | Improvements to GPU synchronization & various refactoring | ||||
| * | | | | | video_core: dma_pusher: Add support for integrity checks. | bunnei | 2020-11-01 | 2 | -0/+27 |
| | | | | | | | | | | | | | | | | | | | | | | | | - Log corrupted command lists, rather than crash. | ||||
| * | | | | | video_core: dma_pusher: Add support for prefetched command lists. | bunnei | 2020-11-01 | 2 | -25/+52 |
| | | | | | | |||||
| * | | | | | video_core: gpu: Implement WaitFence and IncrementSyncPoint. | bunnei | 2020-11-01 | 3 | -28/+70 |
| | | | | | | |||||
* | | | | | | Merge pull request #4874 from lioncash/nodiscard2 | bunnei | 2020-11-04 | 7 | -16/+16 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | nvdec: Make use of [[nodiscard]] where applicable | ||||
| * | | | | | | nvdec: Make use of [[nodiscard]] where applicable | Lioncash | 2020-11-02 | 7 | -16/+16 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Prevents bugs from occurring where the results of a function are accidentally discarded | ||||
* | | | | | | | Merge pull request #4865 from ameerj/async-threadcount | bunnei | 2020-11-01 | 1 | -8/+9 |
|\ \ \ \ \ \ \ | |_|/ / / / / |/| | | | | | | async_shaders: Increase Async worker thread count for >8 thread cpus | ||||
| * | | | | | | async_shaders: Increase Async worker thread count for 8+ thread cpus | ameerj | 2020-10-29 | 1 | -8/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Adds 1 async worker thread for every 2 available threads above 8 | ||||
* | | | | | | | Merge pull request #4853 from ReinUsesLisp/fcmp-imm | bunnei | 2020-10-31 | 2 | -1/+4 |
|\ \ \ \ \ \ \ | |_|/ / / / / |/| | | | | | | shader/arithmetic: Implement FCMP immediate + register variant | ||||
| * | | | | | | shader/arithmetic: Implement FCMP immediate + register variant | ReinUsesLisp | 2020-10-28 | 2 | -1/+4 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | Trivially add the encoding for this. | ||||
* | | | | | | vp9: Be explicit with copy and move operators | Lioncash | 2020-10-30 | 1 | -0/+18 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | It's deprecated in the language to autogenerate these if the destructor for a type is specified, so we can explicitly specify how we want these to be generated. | ||||
* | | | | | | vp9: Mark functions with [[nodiscard]] where applicable | Lioncash | 2020-10-30 | 2 | -13/+13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Prevents values from mistakenly being discarded in cases where it's a bug to do so. | ||||
* | | | | | | vp9: Provide a default initializer for "hidden" member | Lioncash | 2020-10-30 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The API of VP9 exposes a WasFrameHidden() function which accesses this member. Given the constructor previously didn't initialize this member, it's a potential vector for an uninitialized read. Instead, we can initialize this to a deterministic value to prevent that from occurring. | ||||
* | | | | | | vp9: Make some member functions internally linked | Lioncash | 2020-10-30 | 2 | -58/+54 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These helper functions don't directly modify any member state and can be hidden from view. | ||||
* | | | | | | Merge pull request #4837 from lioncash/nvdec-2 | bunnei | 2020-10-29 | 13 | -88/+81 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | nvdec: Minor tidying up | ||||
| * | | | | | h264: Make WriteUe take a u32 | Lioncash | 2020-10-27 | 2 | -7/+8 |
| | | | | | | | | | | | | | | | | | | | | | | | | Enforces the type of the desired value in calling code. | ||||
| * | | | | | vp9: std::move buffer within ComposeFrameHeader() | Lioncash | 2020-10-27 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | We can move the buffer here to avoid a heap reallocation | ||||
| * | | | | | vp9: Remove dead code | Lioncash | 2020-10-27 | 1 | -6/+0 |
| | | | | | | |||||
| * | | | | | vp9: Join declarations with assignments | Lioncash | 2020-10-27 | 1 | -7/+8 |
| | | | | | | |||||
| * | | | | | vp9: Remove pessimizing moves | Lioncash | 2020-10-27 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | The move will already occur without std::move. | ||||
| * | | | | | vp9: Resolve variable shadowing | Lioncash | 2020-10-27 | 1 | -4/+4 |
| | | | | | | |||||
| * | | | | | nvdec: Tidy up header includes | Lioncash | 2020-10-27 | 13 | -62/+59 |
| | | | | | | | | | | | | | | | | | | | | | | | | Prevents a few unnecessary inclusions. | ||||
* | | | | | | Merge pull request #4838 from lioncash/syncmgr | bunnei | 2020-10-29 | 2 | -9/+9 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | sync_manager: Amend parameter order of calls to SyncptIncr constructor | ||||
| * | | | | | sync_manager: Amend parameter order of calls to SyncptIncr constructor | Lioncash | 2020-10-27 | 2 | -9/+9 |
| |/ / / / | | | | | | | | | | | | | | | | Corrects some cases where the arguments would be incorrectly swapped. | ||||
* | / / / | video_core: cdma_pusher: Add missing LOG_DEBUG field in ExecuteCommand. | bunnei | 2020-10-29 | 1 | -1/+1 |
| |/ / / |/| | | | |||||
* | | | | Merge pull request #4848 from ReinUsesLisp/type-limits | LC | 2020-10-28 | 2 | -1/+2 |
|\ \ \ \ | | | | | | | | | | | video_core: Enforce -Werror=type-limits | ||||
| * | | | | video_core: Enforce -Werror=type-limits | ReinUsesLisp | 2020-10-28 | 2 | -1/+2 |
| |/ / / | | | | | | | | | | | | | Silences one warning and avoids introducing more in the future. | ||||
* / / / | video_core: Enforce -Wredundant-move and -Wpessimizing-move | ReinUsesLisp | 2020-10-28 | 4 | -4/+5 |
|/ / / | | | | | | | | | | Silence three warnings and make them errors to avoid introducing more in the future. | ||||
* | | | video_core: NVDEC Implementation | ameerj | 2020-10-27 | 30 | -22/+3311 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit aims to implement the NVDEC (Nvidia Decoder) functionality, with video frame decoding being handled by the FFmpeg library. The process begins with Ioctl commands being sent to the NVDEC and VIC (Video Image Composer) emulated devices. These allocate the necessary GPU buffers for the frame data, along with providing information on the incoming video data. A Submit command then signals the GPU to process and decode the frame data. To decode the frame, the respective codec's header must be manually composed from the information provided by NVDEC, then sent with the raw frame data to the ffmpeg library. Currently, H264 and VP9 are supported, with VP9 having some minor artifacting issues related mainly to the reference frame composition in its uncompressed header. Async GPU is not properly implemented at the moment. Co-Authored-By: David <25727384+ogniK5377@users.noreply.github.com> | ||||
* | | | Merge pull request #4706 from ReinUsesLisp/cmake-host-shaders | bunnei | 2020-10-23 | 2 | -12/+7 |
|\ \ \ | | | | | | | | | video_core: Fix instances where msbuild always regenerated host shaders | ||||
| * | | | video_core: Fix instances where msbuild always regenerated host shaders | ReinUsesLisp | 2020-09-24 | 2 | -12/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | When HEADER_GENERATOR was included in the DEPENDS section of custom commands, msbuild assumed this was always modified. Changing this file is not common so we can remove it from there. | ||||
* | | | | video_core: Conditially activate relevant compiler warnings | Lioncash | 2020-10-21 | 1 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These compiler flags aren't shared with clang, so specifying these flags unconditionally can lead to a bit of warning spam. While we're in the area, we can also enable -Wunused-but-set-parameter given this is almost always a bug. | ||||
* | | | | gl_arb_decompiler: Implement robust buffer operations | ReinUsesLisp | 2020-10-20 | 3 | -33/+54 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This emulates the behavior we get on GLSL with regular SSBOs with a pointer + length pair. It aims to be consistent with the crashes we might get. Out of bounds stores are ignored. Atomics are ignored and return zero. Reads return zero. | ||||
* | | | | Merge pull request #4204 from ReinUsesLisp/vulkan-1.0 | bunnei | 2020-10-19 | 7 | -58/+92 |
|\ \ \ \ | | | | | | | | | | | renderer_vulkan: Create and properly use Vulkan 1.0 instances when 1.1 is not available | ||||
| * | | | | vk_device: Use Vulkan 1.0 properly | ReinUsesLisp | 2020-08-20 | 5 | -52/+66 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Enable the required capabilities to use Vulkan 1.0 without validation errors and disable those that are not compatible with it. | ||||
| * | | | | renderer_vulkan: Create a Vulkan 1.0 instance when 1.1 is not available | ReinUsesLisp | 2020-08-20 | 3 | -6/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | | This commit doesn't make yuzu compatible with Vulkan 1.0 yet, it only creates an 1.0 instance. | ||||
* | | | | | Merge pull request #4782 from ReinUsesLisp/remove-dyn-primitive | bunnei | 2020-10-18 | 6 | -26/+7 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_graphics_pipeline: Manage primitive topology as fixed state | ||||
| * | | | | | vk_graphics_pipeline: Manage primitive topology as fixed state | ReinUsesLisp | 2020-10-13 | 6 | -26/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Vulkan has requirements for primitive topologies that don't play nicely with yuzu's. Since it's only 4 bits, we can move it to fixed state without changing the size of the pipeline key. - Fixes a regression on recent Nvidia drivers on Fire Emblem: Three Houses. | ||||
* | | | | | | Merge pull request #4772 from goldenx86/block-rdna | bunnei | 2020-10-15 | 1 | -0/+24 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | vk_device: Block VK_EXT_extended_dynamic_state for RDNA devices | ||||
| * | | | | | vk_device: Block VK_EXT_extended_dynamic_state for RDNA devices | goldenx86 | 2020-10-09 | 1 | -0/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | RDNA devices seem to crash when using VK_EXT_extended_dynamic_state in the latest 20.9.2 proprietary Windows drivers. As a workaround, for now we block device names corresponding to current RDNA released products. | ||||
* | | | | | | Merge pull request #4766 from ReinUsesLisp/tmml-cube | bunnei | 2020-10-12 | 1 | -19/+22 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader/texture: Implement CUBE texture type for TMML and fix arrays | ||||
| * | | | | | | shader/texture: Implement CUBE texture type for TMML and fix arrays | ReinUsesLisp | 2020-10-08 | 1 | -19/+22 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TMML takes an array argument that has no known meaning, this one appears as the first component in gpr8 followed by s, t and r. Skip this component when arrays are being used. Also implement CUBE texture types. - Used by Pikmin 3: Deluxe Demo. | ||||
* | | | | | | | video_core: Enforce -Wclass-memaccess | ReinUsesLisp | 2020-10-09 | 2 | -7/+7 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #4771 from ReinUsesLisp/warn-unused-var | LC | 2020-10-09 | 3 | -4/+7 |
|\ \ \ \ \ \ \ | |_|/ / / / / |/| | | | | | | video_core: Enforce -Wunused-variable and -Wunused-but-set-variable | ||||
| * | | | | | | video_core: Enforce -Wunused-variable and -Wunused-but-set-variable | ReinUsesLisp | 2020-10-03 | 3 | -4/+7 |
| |/ / / / / | |||||
* / / / / / | renderer_vulkan/wrapper: Fix physical device sorting | ReinUsesLisp | 2020-10-07 | 1 | -13/+35 |
|/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The old code had a sort function that was invalid and it didn't work as expected when the base vector had a different order (e.g. renderdoc was attached). This sorts devices as expected and fixes a debug assert on MSVC. | ||||
* | | | | | Remove ext_extended_dynamic_state blacklist | Matías Locatti | 2020-09-30 | 1 | -8/+0 |
| | | | | | | | | | | | | | | | Latest AMD 20.9.2 driver fixed this, there's no reason to keep it blocked, as the previous stable signed driver release doesn't include the extension. | ||||
* | | | | | Merge pull request #4724 from lat9nq/fix-vulkan-nvidia-allocate-2 | Rodrigo Locatti | 2020-09-27 | 1 | -1/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_stream_buffer: Fix initializing Vulkan with NVIDIA on Linux | ||||
| * | | | | | vk_stream_buffer: Fix initializing Vulkan with NVIDIA on Linux | lat9nq | 2020-09-25 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | The previous fix only partially solved the issue, as only certain GPUs that needed 9 or less MiB subtracted would work (i.e. GTX 980 Ti, GT 730). This takes from DXVK's example to divide `heap_size` by 2 to determine `allocable_size`. Additionally tested on my Quadro K4200, which previously required setting it to 12 to boot. | ||||
* | | | | | | Merge pull request #4703 from lioncash/desig7 | bunnei | 2020-09-27 | 2 | -26/+26 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader/registry: Make use of designated initializers where applicable | ||||
| * | | | | | | shader/registry: Silence a -Wshadow warning | Lioncash | 2020-09-23 | 2 | -5/+5 |
| | | | | | | | |||||
| * | | | | | | shader/registry: Remove unnecessary namespace qualifiers | Lioncash | 2020-09-23 | 1 | -5/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Using statements already make these unnecessary. | ||||
| * | | | | | | shader/registry: Make use of designated initializers where applicable | Lioncash | 2020-09-23 | 1 | -17/+19 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Same behavior, less repetition. | ||||
* | | | | | | | vk_command_pool: Move definition of Pool into the cpp file | Lioncash | 2020-09-25 | 2 | -4/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Allows the implementation details to be changed without recompiling any files that include this header. | ||||
* | | | | | | | vk_command_pool: Make use of override on destructor | Lioncash | 2020-09-25 | 1 | -1/+1 |
| | | | | | | | |||||
* | | | | | | | vk_command_pool: Add missing header guard | Lioncash | 2020-09-25 | 1 | -0/+2 |
| |/ / / / / |/| | | | | | |||||
* | | | | | | Merge pull request #4711 from lioncash/move5 | bunnei | 2020-09-25 | 1 | -16/+19 |
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | | | arithmetic_integer_immediate: Make use of std::move where applicable | ||||
| * | | | | | arithmetic_integer_immediate: Make use of std::move where applicable | Lioncash | 2020-09-24 | 1 | -16/+19 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | Same behavior, minus any redundant atomic reference count increments and decrements. | ||||
* | | | | | Merge pull request #4674 from ReinUsesLisp/timeline-semaphores | bunnei | 2020-09-24 | 42 | -814/+647 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | renderer_vulkan: Make unconditional use of VK_KHR_timeline_semaphore | ||||
| * | | | | vk_query_cache: Hack counter destructor to avoid reserving queries | ReinUsesLisp | 2020-09-19 | 1 | -1/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is a hack to destroy all HostCounter instances before the base class destructor is called. The query cache should be redesigned to have a proper ownership model instead of using shared pointers. For now, destroy the host counter hierarchy from the derived class destructor. | ||||
| * | | | | renderer_vulkan: Make unconditional use of VK_KHR_timeline_semaphore | ReinUsesLisp | 2020-09-19 | 42 | -814/+638 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This reworks how host<->device synchronization works on the Vulkan backend. Instead of "protecting" resources with a fence and signalling these as free when the fence is known to be signalled by the host GPU, use timeline semaphores. Vulkan timeline semaphores allow use to work on a subset of D3D12 fences. As far as we are concerned, timeline semaphores are a value set by the host or the device that can be waited by either of them. Taking advantange of this, we can have a monolithically increasing atomic value for each submission to the graphics queue. Instead of protecting resources with a fence, we simply store the current logical tick (the atomic value stored in CPU memory). When we want to know if a resource is free, it can be compared to the current GPU tick. This greatly simplifies resource management code and the free status of resources should have less false negatives. To workaround bugs in validation layers, when these are attached there's a thread waiting for timeline semaphores. | ||||
* | | | | | control_flow: emplace elements in place within TryQuery() | Lioncash | 2020-09-23 | 1 | -6/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Places data structures where they'll eventually be moved to to avoid needing to even move them in the first place. | ||||
* | | | | | control_flow: Make use of std::move in InsertBranch() | Lioncash | 2020-09-23 | 1 | -7/+8 |
| |/ / / |/| | | | | | | | | | | | Avoids unnecessary atomic increments and decrements. | ||||
* | | | | General: Make use of std::nullopt where applicable | Lioncash | 2020-09-22 | 7 | -32/+29 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Allows some implementations to avoid completely zeroing out the internal buffer of the optional, and instead only set the validity byte within the structure. This also makes it consistent how we return empty optionals. | ||||
* | | | | renderer_opengl: Remove emulated mailbox presentation | ReinUsesLisp | 2020-09-20 | 5 | -293/+22 |
| | | | | | | | | | | | | | | | | | | | | Emulated mailbox presentation was causing performance issues on Nvidia's OpenGL driver. Remove it. | ||||
* | | | | fermi_2d: Make use of designated initializers | Lioncash | 2020-09-18 | 2 | -8/+8 |
| | | | | | | | | | | | | | | | | | | | | Same behavior, less repetition. We can also ensure all members of Config are initialized. | ||||
* | | | | Merge pull request #4672 from lioncash/narrowing | Rodrigo Locatti | 2020-09-17 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | decoder/texture: Eliminate narrowing conversion in GetTldCode() | ||||
| * | | | | decoder/texture: Eliminate narrowing conversion in GetTldCode() | Lioncash | 2020-09-17 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | The assignment was previously truncating a u64 value to a bool. | ||||
* / / / | decode/image: Eliminate switch fallthrough in DecodeImage() | Lioncash | 2020-09-17 | 1 | -0/+1 |
|/ / / | | | | | | | | | | | | | Fortunately this didn't result in any issues, given the block that code was falling through to would immediately break. | ||||
* | | | video_core: Enforce -Werror=switch | ReinUsesLisp | 2020-09-16 | 7 | -10/+59 |
| | | | | | | | | | | | | This forces us to fix all -Wswitch warnings in video_core. | ||||
* | | | video_core: Remove all Core::System references in renderer | ReinUsesLisp | 2020-09-06 | 49 | -629/+566 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Now that the GPU is initialized when video backends are initialized, it's no longer needed to query components once the game is running: it can be done when yuzu is booting. This allows us to pass components between constructors and in the process remove all Core::System references in the video backend. | ||||
* | | | Merge pull request #4611 from lioncash/xbyak2 | bunnei | 2020-09-04 | 1 | -5/+5 |
|\ \ \ | | | | | | | | | externals: Update Xbyak to 5.96 | ||||
| * | | | externals: Update Xbyak to 5.96 | Lioncash | 2020-08-30 | 1 | -5/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | I made a request on the Xbyak issue tracker to allow some constructors to be constexpr in order to avoid static constructors from needing to execute for some of our register constants. This request was implemented, so this updates Xbyak so that we can make use of it. | ||||
* | | | | Merge pull request #4575 from lioncash/async | bunnei | 2020-09-03 | 2 | -17/+15 |
|\ \ \ \ | | | | | | | | | | | async_shaders: Mark getters as const member functions | ||||
| * | | | | async_shaders: Mark getters as const member functions | Lioncash | 2020-08-24 | 2 | -17/+15 |
| | | | | | | | | | | | | | | | | | | | | While we're at it, we can also mark them as nodiscard. | ||||
* | | | | | vk_device: Fix driver id check on AMD for VK_EXT_extended_dynamic_state | ReinUsesLisp | 2020-08-31 | 1 | -6/+9 |
| |/ / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 'driver_id' can only be known on Vulkan 1.1 after creating a logical device. Move the driver id check to disable VK_EXT_extended_dynamic_state after the logical device is successfully initialized. The Vulkan device will have the extension enabled but it will not be used. | ||||
* | | | | vk_device: Blacklist AMD proprietary from VK_EXT_extended_dynamic_state | ReinUsesLisp | 2020-08-29 | 1 | -1/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | Vertex binding's <stride> is bugged on AMD's proprietary drivers when using VK_EXT_extended_dynamic_state. Blacklist it for now while we investigate how to report this issue to AMD. | ||||
* | | | | Merge pull request #4524 from lioncash/memory-log | bunnei | 2020-08-27 | 1 | -1/+2 |
|\ \ \ \ | | | | | | | | | | | shader/memory: Amend UNIMPLEMENTED_IF_MSG without a message | ||||
| * | | | | shader/memory: Amend UNIMPLEMENTED_IF_MSG without a message | Lioncash | 2020-08-14 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | We need to provide a message for this variant of the macro, so we can simply log out the type being used. | ||||
* | | | | | Merge pull request #4569 from ReinUsesLisp/glsl-cmake | bunnei | 2020-08-27 | 12 | -51/+127 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/host_shaders: Add CMake integration for string shaders | ||||
| * | | | | | video_core/host_shaders: Add CMake integration for string shaders | ReinUsesLisp | 2020-08-24 | 7 | -42/+106 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add the necessary CMake code to copy the contents in a string source shader (GLSL or GLASM) to a header file then consumed by video_core files. This allows editting GLSL in its own files without having to maintain them in source files. For now, only OpenGL presentation shaders are moved, but we can add GLASM presentation shaders and static SPIR-V generation through glslangValidator in the future. | ||||
| * | | | | | gl_shader_util: Use std::string_view instead of star pointer | ReinUsesLisp | 2020-08-24 | 5 | -9/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows us passing any type of string and hinting the length of the string to the OpenGL driver. | ||||
* | | | | | | Merge pull request #4555 from ReinUsesLisp/fix-primitive-topology | bunnei | 2020-08-27 | 3 | -13/+14 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vk_state_tracker: Fix primitive topology | ||||
| * | | | | | | vk_state_tracker: Fix primitive topology | ReinUsesLisp | 2020-08-21 | 3 | -13/+14 |
| | |_|_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | State track the current primitive topology with a regular comparison instead of using dirty flags. This fixes a bug in dirty flags for this particular state and it also avoids unnecessary state changes as this property is stored in a frequently changed bit field. | ||||
* | | | | | | memory_manager: Make use of [[nodiscard]] in the interface | Lioncash | 2020-08-27 | 1 | -17/+17 |
| | | | | | | |||||
* | | | | | | memory_manager: Make operator+ const qualified | Lioncash | 2020-08-27 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | This doesn't modify member state, so it can be marked as const. | ||||
* | | | | | | Merge pull request #4574 from lioncash/const-fn | bunnei | 2020-08-25 | 2 | -3/+3 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | memory_manager: Mark IsGranularRange() as a const member function | ||||
| * | | | | | | memory_manager: Mark IsGranularRange() as a const member function | Lioncash | 2020-08-24 | 2 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | This doesn't modify internal member state, so it can be marked as const. | ||||
* | | | | | | | Merge pull request #4542 from ReinUsesLisp/gpu-init-base | bunnei | 2020-08-25 | 22 | -119/+172 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | video_core: Initialize renderer with a GPU | ||||
| * | | | | | | video_core: Initialize renderer with a GPU | ReinUsesLisp | 2020-08-22 | 22 | -119/+172 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add an extra step in GPU initialization to be able to initialize render backends with a valid GPU instance. | ||||
* | | | | | | | gl_texture_cache: Take std::string by reference in DecorateViewName() | Lioncash | 2020-08-24 | 2 | -2/+2 |
| |/ / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | LabelGLObject takes a string_view, so we don't need to make copies of the std::string. | ||||
* | | | | | | video_core/fence_manager: Remove unnecessary includes | Lioncash | 2020-08-24 | 3 | -9/+4 |
| |_|/ / / |/| | | | | | | | | | | | | | | | | | | | Avoids pulling in unnecessary things that can cause rebuilds when they aren't required. | ||||
* | | | | | Merge pull request #4521 from lioncash/optionalcache | bunnei | 2020-08-22 | 1 | -11/+12 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_shader_disk_cache: Make use of std::nullopt where applicable | ||||
| * | | | | | gl_shader_disk_cache: Make use of std::nullopt where applicable | Lioncash | 2020-08-14 | 1 | -11/+12 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | Allows the compiler to avoid unnecessarily zeroing out the internal buffer of std::optional on some implementations. | ||||
* | | | | | Merge pull request #4523 from lioncash/self-assign | bunnei | 2020-08-22 | 1 | -1/+0 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | macro-interpreter: Resolve -Wself-assign-field warning | ||||
| * | | | | macro-interpreter: Resolve -Wself-assign-field warning | Lioncash | 2020-08-14 | 1 | -1/+0 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | This was assigning the field to itself, which is a no-op. The size doesn't change between its initial assignment and this one, so this is a safe change to make. | ||||
* | | | | Merge pull request #4546 from lioncash/telemetry | bunnei | 2020-08-20 | 2 | -4/+5 |
|\ \ \ \ | | | | | | | | | | | common/telemetry: Migrate namespace into the Common namespace | ||||
| * | | | | common/telemetry: Migrate namespace into the Common namespace | Lioncash | 2020-08-18 | 2 | -4/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Migrates the Telemetry namespace into the Common namespace to make the code consistent with the rest of our common code. | ||||
* | | | | | Merge pull request #4522 from lioncash/vulk-copy | bunnei | 2020-08-19 | 1 | -1/+1 |
|\ \ \ \ \ | |/ / / / |/| | | | | vulkan/wrapper: Avoid unnecessary copy in EnumerateInstanceExtensionProperties() | ||||
| * | | | | vulkan/wrapper: Avoid unnecessary copy in EnumerateInstanceExtensionProperties() | Lioncash | 2020-08-14 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | | | | | Given this is implicitly creating a std::optional, we can move the vector into it. | ||||
* | | | | Merge pull request #4535 from lioncash/fileutil | bunnei | 2020-08-18 | 4 | -30/+30 |
|\ \ \ \ | | | | | | | | | | | common/fileutil: Convert namespace to Common::FS | ||||
| * | | | | common/fileutil: Convert namespace to Common::FS | Lioncash | 2020-08-16 | 4 | -30/+30 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Migrates a remaining common file over to the Common namespace, making it consistent with the rest of common files. This also allows for high-traffic FS related code to alias the filesystem function namespace as namespace FS = Common::FS; for more concise typing. | ||||
* | | | | Merge pull request #4443 from ameerj/vk-async-shaders | David | 2020-08-17 | 14 | -87/+209 |
|\ \ \ \ | | | | | | | | | | | vulkan_renderer: Async shader/graphics pipeline compilation | ||||
| * | | | | Remove unneeded newlines, optional Registry in shader params | ameerj | 2020-08-16 | 5 | -14/+9 |
| | | | | | | | | | | | | | | | | | | | | Addressing feedback from Rodrigo | ||||
| * | | | | Morph: Update worker allocation comment | Ameer J | 2020-08-16 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> | ||||
| * | | | | move thread 1/4 count computation into allocate workers method | ameerj | 2020-08-16 | 4 | -23/+14 |
| | | | | | |||||
| * | | | | Address feedback, add shader compile notifier, update setting text | ameerj | 2020-08-16 | 8 | -161/+116 |
| | | | | | |||||
| * | | | | Vk Async Worker directly emplace in cache | ameerj | 2020-08-16 | 3 | -58/+41 |
| | | | | | |||||
| * | | | | Address feedback. Bruteforce delete duplicates | ameerj | 2020-08-16 | 7 | -80/+116 |
| | | | | | |||||
| * | | | | Vk Async pipeline compilation | ameerj | 2020-08-16 | 13 | -20/+182 |
| |/ / / | |||||
* | | | | Merge pull request #4520 from lioncash/pessimize | David | 2020-08-17 | 1 | -2/+2 |
|\ \ \ \ | |/ / / |/| | | | async_shaders: Resolve -Wpessimizing-move warning | ||||
| * | | | async_shaders: Resolve -Wpessimizing-move warning | Lioncash | 2020-08-14 | 1 | -2/+2 |
| |/ / | | | | | | | | | | | | | Prevents pessimization of the move constructor (which thankfully didn't actually happen in practice here, given std::thread isn't copyable). | ||||
* | | | Merge pull request #4528 from lioncash/discard | bunnei | 2020-08-16 | 1 | -1/+2 |
|\ \ \ | | | | | | | | | common: Make use of [[nodiscard]] where applicable | ||||
| * | | | common/compression: Roll back std::span changes | Lioncash | 2020-08-15 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | Seems like all compilers don't support std::span yet. | ||||
* | | | | Merge pull request #4519 from lioncash/semi | bunnei | 2020-08-16 | 1 | -1/+1 |
|\ \ \ \ | |/ / / |/| | | | maxwell_3d: Resolve -Wextra-semi warning | ||||
| * | | | maxwell_3d: Resolve -Wextra-semi warning | Lioncash | 2020-08-14 | 1 | -1/+1 |
| |/ / | | | | | | | | | | Semicolons after a function definition aren't necessary. | ||||
* | | | Merge pull request #4416 from lioncash/span | bunnei | 2020-08-15 | 1 | -2/+1 |
|\ \ \ | | | | | | | | | lz4_compression/zstd_compression: Make use of std::span in interfaces | ||||
| * | | | zstd_compression: Make use of std::span in interfaces | Lioncash | 2020-07-25 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | Allows condensing the data and size parameters into a single argument. | ||||
* | | | | Merge pull request #4453 from ReinUsesLisp/block-to-linear | bunnei | 2020-08-15 | 3 | -34/+34 |
|\ \ \ \ | |_|/ / |/| | | | textures/decoders: Fix block linear to pitch copies | ||||
| * | | | textures/decoders: Fix block linear to pitch copies | ReinUsesLisp | 2020-08-11 | 3 | -34/+34 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | There were two issues with block linear copies. First the swizzling was wrong and this commit reimplements them. The other issue was that these copies are generally used to download render targets from the GPU and yuzu was not downloading them from host GPU memory unless the extreme GPU accuracy setting was selected. This commit enables cached memory reads for all accuracy levels. - Fixes level thumbnails in Super Mario Maker 2. | ||||
* | | | | Merge pull request #4514 from Morph1984/worker-alloc | bunnei | 2020-08-13 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | gl_shader_cache: Use std::max() for determining num_workers | ||||
| * | | | | gl_shader_cache: Use std::max() for determining num_workers | Morph | 2020-08-12 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | Does not allocate more threads than available in the host system for boot-time shader compilation and always allocates at least 1 thread if hardware_concurrency() returns 0. | ||||
* / / / | General: Tidy up clang-format warnings part 2 | Lioncash | 2020-08-13 | 3 | -31/+33 |
|/ / / | |||||
* | | | Merge pull request #4389 from ogniK5377/redundant-format-type | bunnei | 2020-08-07 | 1 | -1/+0 |
|\ \ \ | | | | | | | | | video_core: Remove redundant pixel format type | ||||
| * | | | video_core: Remove redundant pixel format type | David Marcec | 2020-07-21 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | We already get the format type before converting shadow formats and during shadow formats. | ||||
* | | | | Merge pull request #4430 from bunnei/new-gpu-vmm | bunnei | 2020-08-05 | 2 | -500/+204 |
|\ \ \ \ | | | | | | | | | | | hle: nvdrv: Rewrite of GPU memory management. | ||||
| * | | | | hle: nvdrv: Rewrite of GPU memory management. | bunnei | 2020-07-26 | 2 | -500/+204 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #4445 from Morph1984/async-threads | bunnei | 2020-08-05 | 1 | -9/+4 |
|\ \ \ \ | | | | | | | | | | | renderer_opengl: Use 1/4 of all threads for async shader compilation | ||||
| * | | | | renderer_opengl: Use 1/4 of all threads for async shader compilation | Morph | 2020-07-28 | 1 | -9/+4 |
| | | | | | |||||
* | | | | | Merge pull request #4469 from lioncash/missing | bunnei | 2020-08-04 | 6 | -3/+18 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_texture_cache: Silence -Wmissing-field-initializer warnings | ||||
| * | | | | | vulkan: Silence more -Wmissing-field-initializer warnings | Lioncash | 2020-08-03 | 6 | -3/+18 |
| |/ / / / | |||||
* | | | | | yuzu: Resolve C++20 deprecation warnings related to lambda captures | Lioncash | 2020-08-03 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | C++20 deprecates capturing the this pointer via the '=' capture. Instead, we replace it or extend the capture specification. | ||||
* | | | | | Merge pull request #4392 from lioncash/guard | David | 2020-07-30 | 1 | -0/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | compatible_formats: Add missing header guard | ||||
| * | | | | | compatible_formats: Add missing header guard | Lioncash | 2020-07-21 | 1 | -0/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Prevents potential inclusion issues from occurring. | ||||
* | | | | | | Merge pull request #4396 from lioncash/comma | bunnei | 2020-07-30 | 1 | -45/+52 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | surface_params: Replace questionable usages of the comma operator with semicolons | ||||
| * | | | | | surface_params: Make use of designated initializers where applicable | Lioncash | 2020-07-21 | 1 | -38/+46 |
| | | | | | | | | | | | | | | | | | | | | | | | | Provides a convenient way to avoid unnecessary zero initializing. | ||||
| * | | | | | surface_params: Remove redundant assignment | Lioncash | 2020-07-21 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | This is a redundant assignment that can be removed. | ||||
| * | | | | | surface_params: Replace questionable usages of the comma operator with semicolons | Lioncash | 2020-07-21 | 1 | -9/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | These are bugs waiting to happen. | ||||
* | | | | | | Merge pull request #4419 from lioncash/initializer | bunnei | 2020-07-28 | 2 | -0/+4 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vulkan: Resolve -Wmissing-field-initializer warnings | ||||
| * | | | | | | vulkan: Resolve -Wmissing-field-initializer warnings | Lioncash | 2020-07-25 | 2 | -0/+4 |
| | |_|/ / / | |/| | | | | |||||
* / | | | | | video_core/gpu: Correct the size of the puller registers | Billy Laws | 2020-07-26 | 1 | -2/+2 |
|/ / / / / | | | | | | | | | | | The puller register array is made up of u32s however the `NUM_REGS` value is the size in bytes, so switch it to avoid making the struct unnecessary large. Also fix a small typo in a comment. | ||||
* | | | | | Merge pull request #4393 from lioncash/unused5 | bunnei | 2020-07-25 | 1 | -4/+0 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_rasterizer: Remove unused variable in Clear() | ||||
| * | | | | | vk_rasterizer: Remove unused variable in Clear() | Lioncash | 2020-07-21 | 1 | -4/+0 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | The relevant values are already assigned further down in the lambda, so this can be removed entirely. | ||||
* | | | | | Merge pull request #4388 from lioncash/written | bunnei | 2020-07-24 | 1 | -6/+3 |
|\ \ \ \ \ | | | | | | | | | | | | | buffer_cache: Eliminate redundant map lookup in MarkRegionAsWritten() | ||||
| * | | | | | buffer_cache: Eliminate redundant map lookup in MarkRegionAsWritten() | Lioncash | 2020-07-20 | 1 | -6/+3 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We can make use of emplace()'s return value to determine whether or not we need to perform an increment. emplace() performs no insertion if an element already exist, so this can eliminate a find() call. | ||||
* | | | | | Merge pull request #4391 from lioncash/nrvo | bunnei | 2020-07-24 | 7 | -26/+26 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core: Allow copy elision to take place where applicable | ||||
| * | | | | | video_core: Allow copy elision to take place where applicable | Lioncash | 2020-07-21 | 7 | -26/+26 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | Removes const from some variables that are returned from functions, as this allows the move assignment/constructors to execute for them. | ||||
* | | | | | Merge pull request #4394 from lioncash/unused6 | bunnei | 2020-07-24 | 8 | -33/+5 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core: Remove unused variables | ||||
| * | | | | | video_core: Remove unused variables | Lioncash | 2020-07-21 | 8 | -33/+5 |
| |/ / / / | | | | | | | | | | | | | | | | Silences several compiler warnings about unused variables. | ||||
* | | | | | Merge pull request #4359 from ReinUsesLisp/clamp-shared | Rodrigo Locatti | 2020-07-21 | 6 | -9/+42 |
|\ \ \ \ \ | | | | | | | | | | | | | renderer_{opengl,vulkan}: Clamp shared memory to host's limit | ||||
| * | | | | | renderer_{opengl,vulkan}: Clamp shared memory to host's limit | ReinUsesLisp | 2020-07-16 | 6 | -9/+42 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This stops shaders from failing to build when the exceed host's shared memory size limit. An error is logged. | ||||
* | | | | | | Merge pull request #4360 from ReinUsesLisp/glasm-bar | Rodrigo Locatti | 2020-07-21 | 1 | -4/+0 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_arb_decompiler: Execute BAR even when inside control flow | ||||
| * | | | | | | gl_arb_decompiler: Execute BAR even when inside control flow | ReinUsesLisp | 2020-07-16 | 1 | -4/+0 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Unlike GLSL, GLASM allows us to call BAR inside control flow. - Fixes graphical artifacts in Paper Mario. | ||||
* | | | | | | Merge pull request #4361 from ReinUsesLisp/lane-id | Rodrigo Locatti | 2020-07-21 | 1 | -2/+1 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | decode/other: Implement S2R.LaneId | ||||
| * | | | | | decode/other: Implement S2R.LaneId | ReinUsesLisp | 2020-07-16 | 1 | -2/+1 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | This maps to host's thread id. - Fixes graphical issues on Paper Mario. | ||||
* | | | | | Merge pull request #4324 from ReinUsesLisp/formats | bunnei | 2020-07-21 | 20 | -1100/+1111 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | video_core: Fix, add and rename pixel formats | ||||
| * | | | | video_core: Rearrange pixel format names | ReinUsesLisp | 2020-07-13 | 19 | -1179/+1077 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Normalizes pixel format names to match Vulkan names. Previous to this commit pixel formats had no convention, leading to confusion and potential bugs. | ||||
| * | | | | video_core: Fix DXT4 and RGB565 | ReinUsesLisp | 2020-07-13 | 7 | -37/+31 |
| | | | | | |||||
| * | | | | video_core/format_lookup_table: Add formats with existing PixelFormat | ReinUsesLisp | 2020-07-13 | 1 | -1/+9 |
| | | | | | |||||
| * | | | | video_core: Fix B5G6R5_UNORM render target format | ReinUsesLisp | 2020-07-13 | 5 | -1/+10 |
| | | | | | |||||
| * | | | | video_core: Fix B5G6R5U | ReinUsesLisp | 2020-07-13 | 2 | -2/+2 |
| | | | | | |||||
| * | | | | video_core: Implement RGBA32_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -58/+71 |
| | | | | | |||||
| * | | | | video_core: Implement RGBA32_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core: Implement RGBA16_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core: Implement RGBA8_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core: Implement RG32_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core: Implement RG8_SINT render target and fix RG8_UINT | ReinUsesLisp | 2020-07-13 | 7 | -1/+14 |
| | | | | | |||||
| * | | | | video_core: Implement R8_SINT render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core: Implement R8_SNORM render target | ReinUsesLisp | 2020-07-13 | 7 | -0/+13 |
| | | | | | |||||
| * | | | | video_core/surface: Remove explicit values on PixelFormat's definition | ReinUsesLisp | 2020-07-13 | 1 | -80/+80 |
| | | | | | |||||
| * | | | | video_core/surface: Reorder render target to pixel format switch | ReinUsesLisp | 2020-07-13 | 1 | -53/+51 |
| | | | | | |||||
* | | | | | gl_arb_decompiler: Use NV_shader_buffer_{load,store} on assembly shaders | ReinUsesLisp | 2020-07-18 | 7 | -110/+173 |
| |_|/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | NV_shader_buffer_{load,store} is a 2010 extension that allows GL applications to use what in Vulkan is known as physical pointers, this is basically C pointers. On GLASM these is exposed through the LOAD/STORE/ATOM instructions. Up until now, assembly shaders were using NV_shader_storage_buffer_object. These work fine, but have a (probably unintended) limitation that forces us to have the limit of a single stage for all shader stages. In contrast, with NV_shader_buffer_{load,store} we can pass GPU addresses to the shader through local parameters (GLASM equivalent uniform constants, or push constants on Vulkan). Local parameters have the advantage of being per stage, allowing us to generate code without worrying about binding overlaps. | ||||
* | | | | Merge pull request #4273 from ogniK5377/async-shaders-prod | bunnei | 2020-07-18 | 14 | -58/+584 |
|\ \ \ \ | | | | | | | | | | | video_core: Add asynchronous shader decompilation and compilation | ||||
| * | | | | Fix style issues | David Marcec | 2020-07-18 | 2 | -7/+13 |
| | | | | | |||||
| * | | | | Remove duplicate config | David Marcec | 2020-07-17 | 1 | -0/+1 |
| | | | | | |||||
| * | | | | Use conditional var | David Marcec | 2020-07-17 | 2 | -9/+15 |
| | | | | | |||||
| * | | | | Drop max workers from 8->2 for testing | David Marcec | 2020-07-17 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | Rebase for per game settings | David Marcec | 2020-07-17 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | async shaders | David Marcec | 2020-07-17 | 14 | -58/+571 |
| | | | | | |||||
* | | | | | Merge pull request #4364 from lioncash/desig5 | bunnei | 2020-07-18 | 19 | -664/+763 |
|\ \ \ \ \ | | | | | | | | | | | | | vulkan: Make use of designated initializers where applicable | ||||
| * | | | | | wrapper: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -56/+64 |
| | | | | | | |||||
| * | | | | | vk_texture_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -96/+135 |
| | | | | | | |||||
| * | | | | | vk_swapchain: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -43/+51 |
| | | | | | | |||||
| * | | | | | vk_stream_buffer: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -19/+16 |
| | | | | | | |||||
| * | | | | | vk_staging_buffer_pool: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -13/+12 |
| | | | | | | |||||
| * | | | | | vk_shader_util: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -7/+7 |
| | | | | | | |||||
| * | | | | | vk_scheduler: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -27/+30 |
| | | | | | | |||||
| * | | | | | vk_sampler_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -24/+27 |
| | | | | | | |||||
| * | | | | | vk_resource_manager: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -15/+14 |
| | | | | | | |||||
| * | | | | | vk_renderpass_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -59/+70 |
| | | | | | | |||||
| * | | | | | vk_rasterizer: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -41/+47 |
| | | | | | | |||||
| * | | | | | vk_query_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -8/+8 |
| | | | | | | |||||
| * | | | | | vk_pipeline_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -31/+35 |
| | | | | | | |||||
| * | | | | | vk_memory_manager: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -7/+6 |
| | | | | | | |||||
| * | | | | | vk_image: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -15/+23 |
| | | | | | | |||||
| * | | | | | vk_descriptor_pool: Make use of designated initializers where applicable | Lioncash | 2020-07-17 | 1 | -15/+18 |
| | | | | | | |||||
| * | | | | | vk_compute_pipeline: Make use of designated initializers where applicable | Lioncash | 2020-07-16 | 1 | -63/+68 |
| | | | | | | |||||
| * | | | | | vk_compute_pass: Make use of designated initializers where applicable | Lioncash | 2020-07-16 | 1 | -95/+99 |
| | | | | | | | | | | | | | | | | | | | | | | | | Note: Some barriers can't be converted over yet, as they ICE MSVC. | ||||
| * | | | | | vk_buffer_cache: Make use of designated initializers where applicable | Lioncash | 2020-07-16 | 1 | -30/+33 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Note: An array within CopyFrom() cannot be converted over yet, as it ICEs MSVC when converted over. | ||||
* | | | | | | vk_device: Fix build error on old MSVC versions | ReinUsesLisp | 2020-07-18 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Designated initializers on old MSVC versions fail to build when they take the address of a constant. | ||||
* | | | | | | Merge pull request #4322 from ReinUsesLisp/fix-dynstate | bunnei | 2020-07-17 | 1 | -0/+1 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vk_state_tracker: Fix dirty flags for stencil_enable on VK_EXT_extended_dynamic_state | ||||
| * | | | | | | vk_state_tracker: Fix dirty flags for stencil_enable on VK_EXT_extended_dynamic_state | ReinUsesLisp | 2020-07-13 | 1 | -0/+1 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | Fixes a regression on any game using stencil on devices with VK_EXT_extended_dynamic_state. | ||||
* | | | | | | Merge pull request #4369 from lioncash/hle-macro | LC | 2020-07-17 | 1 | -10/+7 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | macro_hle: Remove unnecessary std::make_pair calls | ||||
| * | | | | | | macro_hle: Remove unnecessary static keywords | Lioncash | 2020-07-17 | 1 | -7/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These functions are already in an anonymous namespace which makes the functions internally linked. | ||||
| * | | | | | | macro_hle: Remove unnecessary std::make_pair calls | Lioncash | 2020-07-17 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The purpose of make_pair is generally to deduce the types within the pair without explicitly specifying the types, so these usages were generally unnecessary, particularly when the type is enforced by the array declaration. | ||||
* | | | | | | | Merge pull request #4340 from lioncash/remove | LC | 2020-07-17 | 1 | -2/+2 |
|\ \ \ \ \ \ \ | |_|_|_|/ / / |/| | | | | | | shader_cache: Make use of std::erase_if | ||||
| * | | | | | | shader_cache: Make use of std::erase_if | Lioncash | 2020-07-14 | 1 | -2/+2 |
| | |/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | Now that we use C++20, we can also make use of std::erase_if instead of needing to do the erase-remove idiom. | ||||
* | | | | | | Merge pull request #4368 from lioncash/macro | David | 2020-07-17 | 1 | -1/+2 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | macro: Resolve missing parameter in doxygen comment | ||||
| * | | | | | | macro: Resolve missing parameter in doxygen comment | Lioncash | 2020-07-17 | 1 | -1/+2 |
| | |/ / / / | |/| | | | | | | | | | | | | | | | | Resolves a -Wdocumentation warning. | ||||
* | | | | | | Merge pull request #4370 from lioncash/simplify | David | 2020-07-17 | 1 | -2/+1 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | macro_hle: Simplify shift expression in HLE_771BB18C62444DA0() | ||||
| * | | | | | | macro_hle: Simplify shift expression in HLE_771BB18C62444DA0() | Lioncash | 2020-07-17 | 1 | -2/+1 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given the expression involves a 32-bit value, this simplifies down to just: 0x3ffffff. This is likely a remnant from testing that was never cleaned up. Resolves a -Wshift-overflow warning. | ||||
* / / / / / | vk_texture_cache: Amend mismatched access masks and indices in UploadBuffer | Lioncash | 2020-07-17 | 1 | -6/+4 |
|/ / / / / | | | | | | | | | | | | | | | | | | | | | Discovered while converting relevant parts of the codebase over to designated initializers. | ||||
* | / / / | vk_graphics_pipeline: Resolve narrowing warnings | Lioncash | 2020-07-17 | 1 | -2/+4 |
| |/ / / |/| | | | | | | | | | | | | | | | For whatever reason, VK_TRUE and VK_FALSE aren't defined as having a VkBool32 type, so we need to cast to it explicitly. | ||||
* | | | | Merge pull request #4333 from lioncash/desig3 | Rodrigo Locatti | 2020-07-16 | 1 | -198/+223 |
|\ \ \ \ | | | | | | | | | | | vk_graphics_pipeline: Make use of designated initializers where applicable | ||||
| * | | | | vk_graphics_pipeline: Make use of designated initializers where applicable | Lioncash | 2020-07-14 | 1 | -198/+223 |
| |/ / / | | | | | | | | | | | | | Avoids redundant variable name repetitions. | ||||
* | | | | Merge pull request #4332 from lioncash/vkdev | Rodrigo Locatti | 2020-07-16 | 1 | -124/+152 |
|\ \ \ \ | | | | | | | | | | | vk_device: Make use of designated initializers where applicable | ||||
| * | | | | vk_device: Make use of designated initializers where applicable | Lioncash | 2020-07-14 | 1 | -124/+152 |
| |/ / / | | | | | | | | | | | | | | | | | Avoids redundant repetitions of variable names, and allows assignment all in one statement. | ||||
* | | | | Merge pull request #4321 from lioncash/desig | bunnei | 2020-07-16 | 1 | -334/+384 |
|\ \ \ \ | |_|/ / |/| | | | vk_blit_screen: Make use of designated initializers where applicable | ||||
| * | | | vk_blit_screen: Make use of designated initializers where applicable | Lioncash | 2020-07-13 | 1 | -334/+384 |
| |/ / | | | | | | | | | | | | | Now that we make use of C++20, we can use designated initializers to make things a little nicer to read. | ||||
* | | | Merge pull request #4242 from ReinUsesLisp/maxwell-dma | bunnei | 2020-07-14 | 6 | -359/+468 |
|\ \ \ | |/ / |/| | | maxwell_dma: Match official doc and support pitch->voxel copies | ||||
| * | | video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changes | ReinUsesLisp | 2020-07-10 | 5 | -84/+125 |
| | | | | | | | | | | | | | | | | | | | | | Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies. | ||||
| * | | maxwell_dma: Rename registers to match official docs and reorder | ReinUsesLisp | 2020-07-08 | 2 | -287/+355 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions. | ||||
* | | | vk_rasterizer: Pass <pSizes> to CmdBindVertexBuffers2EXT | ReinUsesLisp | 2020-07-10 | 1 | -6/+6 |
| | | | | | | | | | | | | | | | This has been fixed in Nvidia's public beta driver 451.74. The previous beta driver will be broken, people using these will have to update. | ||||
* | | | Merge pull request #4283 from lat9nq/fix-linux-nvidia-vulkan | Rodrigo Locatti | 2020-07-10 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | vk_stream_buffer: Prevent Vulkan crash in Linux on recent NVIDIA driver | ||||
| * | | | vk_stream_buffer: set allocable_size to 9 MiB | lat9nq | 2020-07-10 | 1 | -1/+1 |
| |/ / | | | | | | | | | | This solves the crash on Linux systems running the current Linux Long Lived branch nVidia driver. | ||||
* / / | configuration: implement per-game configurations (#4098) | lat9nq | 2020-07-10 | 11 | -21/+23 |
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Switch game settings to use a pointer In order to add full per-game settings, we need to be able to tell yuzu to switch to using either the global or game configuration. Using a pointer makes it easier to switch. * configuration: add new UI without changing existing funcitonality The new UI also adds General, System, Graphics, Advanced Graphics, and Audio tabs, but as yet they do nothing. This commit keeps yuzu to the same functionality as originally branched. * configuration: Rename files These weren't included in the last commit. Now they are. * configuration: setup global configuration checkbox Global config checkbox now enables/disables the appropriate tabs in the game properties dialog. The use global configuration setting is now saved to the config, defaulting to true. This also addresses some changes requested in the PR. * configuration: swap to per-game config memory for properties dialog Does not set memory going in-game. Swaps to game values when opening the properties dialog, then swaps back when closing it. Uses a `memcpy` to swap. Also implements saving config files, limited to certain groups of configurations so as to not risk setting unsafe configurations. * configuration: change config interfaces to use config-specific pointers When a game is booted, we need to be able to open the configuration dialogs without changing the settings pointer in the game's emualtion. A new pointer specific to just the configuration dialogs can be used to separate changes to just those config dialogs without affecting the emulation. * configuration: boot a game using per-game settings Swaps values where needed to boot a game. * configuration: user correct config during emulation Creates a new pointer specifically for modifying the configuration while emulation is in progress. Both the regular configuration dialog and the game properties dialog now use the pointer Settings::config_values to focus edits to the correct struct. * settings: split Settings::values into two different structs By splitting the settings into two mutually exclusive structs, it becomes easier, as a developer, to determine how to use the Settings structs after per-game configurations is merged. Other benefits include only duplicating the required settings in memory. * settings: move use_docked_mode to Controls group `use_docked_mode` is set in the input settings and cannot be accessed from the system settings. Grouping it with system settings causes it to be saved with per-game settings, which may make transferring configs more difficult later on, especially since docked mode cannot be set from within the game properties dialog. * configuration: Fix the other yuzu executables and a regression In main.cpp, we have to get the title ID before the ROM is loaded, else the renderer will reflect only the global settings and now the user's game specific settings. * settings: use a template to duplicate memory for each setting Replaces the type of each variable in the Settings::Values struct with a new class that allows basic data reading and writing. The new struct Settings::Setting duplicates the data in memory and can manage global overrides per each setting. * configuration: correct add-ons config and swap settings when apropriate Any add-ons interaction happens directly through the global values struct. Swapping bewteen structs now also includes copying the necessary global configs that cannot be changed nor saved in per-game settings. General and System config menus now update based on whether it is viewing the global or per-game settings. * settings: restore old values struct No longer needed with the Settings::Setting class template. * configuration: implement hierarchical game properties dialog This sets the apropriate global or local data in each setting. * clang format * clang format take 2 can the docker container save this? * address comments and style issues * config: read and write settings with global awareness Adds new functions to read and write settings while keeping the global state in focus. Files now generated per-game are much smaller since often they only need address the global state. * settings: restore global state when necessary Upon closing a game or the game properties dialog, we need to restore all global settings to the original global state so that we can properly open the configuration dialog or boot a different game. * configuration: guard setting values incorrectly This disables setting values while a game is running if the setting is overwritten by a per game setting. * config: don't write local settings in the global config Simple guards to prevent writing the wrong settings in the wrong files. * configuration: add comments, assume less, and clang format No longer assumes that a disabled UI element means the global state is turned off, instead opting to directly answer that question. Still however assumes a game is running if it is in that state. * configuration: fix a logic error Should not be negated * restore settings' global state regardless of accept/cancel Fixes loading a properties dialog and causing the global config dialog to show local settings. * fix more logic errors Fixed the frame limit would set the global setting from the game properties dialog. Also strengthened the Settings::Setting member variables and simplified the logic in config reading (ReadSettingGlobal). * fix another logic error In my efforts to guard RestoreGlobalState, I accidentally negated the IsPowered condition. * configure_audio: set toggle_stretched_audio to tristate * fixed custom rtc and rng seed overwriting the global value * clang format * rebased * clang format take 4 * address my own review Basically revert unintended changes * settings: literal instead of casting "No need to cast, use 1U instead" Thanks, Morph! Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> * Revert "settings: literal instead of casting " This reverts commit 95e992a87c898f3e882ffdb415bb0ef9f80f613f. * main: fix status buttons reporting wrong settings after stop emulation * settings: Log UseDockedMode in the Controls group This should have happened when use_docked_mode was moved over to the controls group internally. This just reflects this in the log. * main: load settings if the file has a title id In other words, don't exit if the loader has trouble getting a title id. * use a zero * settings: initalize resolution factor with constructor instead of casting * Revert "settings: initalize resolution factor with constructor instead of casting" This reverts commit 54c35ecb46a29953842614620f9b7de1aa9d5dc8. * configure_graphics: guard device selector when Vulkan is global Prevents the user from editing the device selector if Vulkan is the global renderer backend. Also resets the vulkan_device variable when the users switches back-and-forth between global and Vulkan. * address reviewer concerns Changes function variables to const wherever they don't need to be changed. Sets Settings::Setting to final as it should not be inherited from. Sets ConfigurationShared::use_global_text to static. Co-Authored-By: VolcaEM <volcaem@users.noreply.github.com> * main: load per-game settings after LoadROM This prevents `Restart Emulation` from restoring the global settings *after* the per-game settings were applied. Thanks to BSoDGamingYT for finding this bug. * Revert "main: load per-game settings after LoadROM" This reverts commit 9d0d48c52d2dcf3bfb1806cc8fa7d5a271a8a804. * main: only restore global settings when necessary Loading the per-game settings cannot happen after the ROM is loaded, so we have to specify when to restore the global state. Again thanks to BSoD for finding the bug. * configuration_shared: address reviewer concerns except operator overrides Dropping operator override usage in next commit. Co-Authored-By: LC <lioncash@users.noreply.github.com> * settings: Drop operator overrides from Setting template Requires using GetValue and SetValue explicitly. Also reverts a change that broke title ID formatting in the game properties dialog. * complete rebase * configuration_shared: translate "Use global configuration" Uses ConfigurePerGame to do so, since its usage, at least as of now, corresponds with ConfigurationShared. * configure_per_game: address reviewer concern As far as I understand, it prevents the program from unnecessarily copying strings. Co-Authored-By: LC <lioncash@users.noreply.github.com> Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> Co-authored-by: VolcaEM <volcaem@users.noreply.github.com> Co-authored-by: LC <lioncash@users.noreply.github.com> | ||||
* | | Merge pull request #4150 from ReinUsesLisp/dynamic-state-impl | bunnei | 2020-07-07 | 13 | -275/+660 |
|\ \ | | | | | | | vulkan: Use VK_EXT_extended_dynamic_state when available | ||||
| * | | vk_rasterizer: Use nullptr for <pSizes> in CmdBindVertexBuffers2EXT | ReinUsesLisp | 2020-06-27 | 1 | -6/+6 |
| | | | | | | | | | | | | Disable this temporarily. | ||||
| * | | vk_pipeline_cache: Avoid hashing and comparing dynamic state when possible | ReinUsesLisp | 2020-06-27 | 6 | -23/+51 |
| | | | | | | | | | | | | | | | | | | With extended dynamic states, some bytes don't have to be collected from the pipeline key, hence we can avoid hashing and comparing them on lookups. | ||||
| * | | vulkan/fixed_pipeline_state: Move state out of individual structures | ReinUsesLisp | 2020-06-27 | 4 | -121/+84 |
| | | | |||||
| * | | vk_rasterizer: Use VK_EXT_extended_dynamic_state | ReinUsesLisp | 2020-06-27 | 5 | -46/+356 |
| | | | |||||
| * | | renderer_vulkan/wrapper: Add VK_EXT_extended_dynamic_state functions | ReinUsesLisp | 2020-06-27 | 2 | -0/+64 |
| | | | |||||
| * | | fixed_pipeline_state: Add requirements for VK_EXT_extended_dynamic_state | ReinUsesLisp | 2020-06-27 | 7 | -155/+143 |
| | | | | | | | | | | | | | | | | | | | | | This moves dynamic state present in VK_EXT_extended_dynamic_state to a separate structure in FixedPipelineState. This is structure is at the bottom allowing us to hash and memcmp only when the extension is not supported. | ||||
| * | | vk_device: Enable VK_EXT_extended_dynamic_state when available | ReinUsesLisp | 2020-06-27 | 2 | -0/+32 |
| | | | |||||
* | | | Merge pull request #4194 from ReinUsesLisp/fix-shader-cache | Fernando Sahmkow | 2020-07-05 | 1 | -29/+41 |
|\ \ \ | | | | | | | | | shader_cache: Fix use-after-free and orphan invalidation cache entries | ||||
| * | | | shader_cache: Fix use-after-free and orphan invalidation cache entries | ReinUsesLisp | 2020-07-01 | 1 | -29/+41 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This fixes some cases where entries could have been removed multiple times reading freed memory. To address this issue this commit removes duplicates from entries marked for removal and sorts out the removal process to fix another use-after-free situation. Another issue fixed in this commit is orphan invalidation cache entries. Previously only the entries that were invalidated in the current operations had its entries removed. This led to more use-after-free situations when these entries were actually invalidated but referenced an object that didn't exist. | ||||
* | | | | Merge pull request #4175 from ReinUsesLisp/read-buffer | bunnei | 2020-07-03 | 5 | -18/+24 |
|\ \ \ \ | | | | | | | | | | | gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading | ||||
| * | | | | gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading | ReinUsesLisp | 2020-06-26 | 5 | -18/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing. | ||||
* | | | | | Merge pull request #4082 from Morph1984/mirror-once-clamp | Rodrigo Locatti | 2020-07-02 | 1 | -0/+6 |
|\ \ \ \ \ | |_|/ / / |/| | | | | maxwell_to_gl: Implement MirrorOnceClampOGL wrap mode using GL_MIRROR_CLAMP_EXT | ||||
| * | | | | maxwell_to_gl: Implement MirrorOnceClampOGL using GL_MIRROR_CLAMP_EXT | Morph | 2020-06-30 | 1 | -0/+6 |
| | | | | | | | | | | | | | | | | | | | | Like MirrorOnceBorder, this requires the GL_EXT_texture_mirror_clamp extension. This extension is unfortunately not available on Intel's drivers (both Windows proprietary and Linux Mesa). Use GL_MIRROR_CLAMP_TO_EDGE as a fallback if the extension is unavailable. | ||||
* | | | | | Merge pull request #4176 from ReinUsesLisp/compatible-formats | Fernando Sahmkow | 2020-06-30 | 4 | -5/+216 |
|\ \ \ \ \ | | | | | | | | | | | | | texture_cache: Check format compatibility before copying | ||||
| * | | | | | texture_cache: Test format compatibility before copying | ReinUsesLisp | 2020-06-27 | 2 | -6/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoid illegal copies. This intercepts the last step of a copy to avoid generating validation errors or corrupting the driver on some instances. We can create views and emit copies accordingly in future commits and remove this last-step validation. | ||||
| * | | | | | video_core/compatible_formats: Table to test if two formats are legal to view or copy | ReinUsesLisp | 2020-06-27 | 3 | -0/+196 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add a flat table to test if it's legal to create a texture view between two formats or copy betweem them. This table is based on ARB_copy_image and ARB_texture_view. Copies are more permissive than views. | ||||
* | | | | | Merge pull request #4157 from ReinUsesLisp/unified-turing | Fernando Sahmkow | 2020-06-30 | 1 | -19/+1 |
|\ \ \ \ \ | |_|/ / / |/| | | | | gl_device: Enable NV_vertex_buffer_unified_memory on Turing devices | ||||
| * | | | | gl_device: Enable NV_vertex_buffer_unified_memory on Turing devices | ReinUsesLisp | 2020-06-25 | 1 | -19/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Once we make sure not to corrupt Nvidia's driver, we can safely use resident buffers on Turing devices. See GitHub pull request #4156 | ||||
* | | | | | Merge pull request #4191 from Morph1984/vertex-formats | Rodrigo Locatti | 2020-06-30 | 3 | -109/+78 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_to_gl/vk: Reorder vertex formats | ||||
| * | | | | | maxwell_to_gl: Rename VertexType() to VertexFormat() | Morph | 2020-06-29 | 2 | -4/+5 |
| | | | | | | |||||
| * | | | | | maxwell_to_vk: Reorder vertex formats and add A2B10G10R10 for all types except float | Morph | 2020-06-28 | 1 | -75/+69 |
| | | | | | | |||||
| * | | | | | maxwell_to_gl: Add 32 bit component sizes to (un)signed scaled formats | Morph | 2020-06-28 | 1 | -30/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | Add 32 bit component sizes to (un)signed scaled formats and group (un)signed normalized, scaled, and integer formats together. | ||||
* | | | | | | macro: Add support for "middle methods" on the code cache (#4112) | David | 2020-06-30 | 1 | -8/+27 |
| | | | | | | | | | | | | | | | | | | Macro code is just uploaded sequentially from a starting address, however that does not mean the entry point for the macro is at that address. This PR adds preliminary support for executing macros in the middle of our cached code. | ||||
* | | | | | | Merge pull request #4140 from ReinUsesLisp/validation-layers | Rodrigo Locatti | 2020-06-29 | 3 | -5/+43 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | renderer_vulkan: Update validation layer name and test before enabling | ||||
| * | | | | | | renderer_vulkan: Update validation layer name and test before enabling | ReinUsesLisp | 2020-06-22 | 3 | -5/+43 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Update validation layer string to VK_LAYER_KHRONOS_validation. While we are at it, properly check for available validation layers before enabling them. | ||||
* | | | | | | | General: Tune the priority of main emulation threads so they have higher priority than less important helper threads. | Fernando Sahmkow | 2020-06-27 | 2 | -0/+3 |
| | | | | | | | |||||
* | | | | | | | General: Correct rebase, sync gpu and context management. | Fernando Sahmkow | 2020-06-27 | 5 | -2/+25 |
| | | | | | | | |||||
* | | | | | | | General: Setup yuzu threads' microprofile, naming and registry. | Fernando Sahmkow | 2020-06-27 | 1 | -1/+5 |
| | | | | | | | |||||
* | | | | | | | General: Recover Prometheus project from harddrive failure | Fernando Sahmkow | 2020-06-27 | 1 | -2/+3 |
| |/ / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit: Implements CPU Interrupts, Replaces Cycle Timing for Host Timing, Reworks the Kernel's Scheduler, Introduce Idle State and Suspended State, Recreates the bootmanager, Initializes Multicore system. | ||||
* | | | | | | Merge pull request #4147 from ReinUsesLisp/hset2-imm | bunnei | 2020-06-27 | 2 | -21/+75 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | shader/half_set: Implement HSET2_IMM | ||||
| * | | | | | shader/half_set: Implement HSET2_IMM | ReinUsesLisp | 2020-06-23 | 2 | -21/+75 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone. | ||||
* | | | | | Merge pull request #4144 from FernandoS27/tt-fix | bunnei | 2020-06-27 | 1 | -0/+3 |
|\ \ \ \ \ | | | | | | | | | | | | | TextureCache: Fix case where layer goes off bound. | ||||
| * | | | | | TextureCache: Fix case where layer goes off bound. | Fernando Sahmkow | 2020-06-22 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The returned layer is expected to be between 0 and the depth of the surface, anything larger is off bounds. | ||||
* | | | | | | Merge pull request #4111 from ReinUsesLisp/preserve-contents-vk | bunnei | 2020-06-27 | 2 | -7/+58 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | vk_rasterizer: Don't preserve contents on full screen clears | ||||
| * | | | | | vk_rasterizer: Don't preserve contents on full screen clears | ReinUsesLisp | 2020-06-18 | 2 | -7/+58 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | There's no need to load contents from the CPU when a clear resets all the contents of the underlying memory. This is already implemented on OpenGL and the texture cache. | ||||
* | | | | | | Merge pull request #4151 from ReinUsesLisp/gl-invalidations | Rodrigo Locatti | 2020-06-26 | 2 | -6/+7 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_cache: Avoid use after move for program size | ||||
| * | | | | | | gl_shader_cache: Avoid use after move for program size | ReinUsesLisp | 2020-06-24 | 2 | -6/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | All programs had a size of zero due to this bug, skipping invalidations. While we are at it, remove some unused forward declarations. | ||||
* | | | | | | | gl_device: Fix IsASTCSupported | David Marcec | 2020-06-25 | 1 | -1/+1 |
| |_|_|_|/ / |/| | | | | | | | | | | | | | | | | | Other targets were never actually checked | ||||
* | | | | | | Merge pull request #4105 from ReinUsesLisp/resident-buffers | bunnei | 2020-06-24 | 14 | -213/+277 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness | ||||
| * | | | | | | buffer_cache: Use buffer methods instead of cache virtual methods | ReinUsesLisp | 2020-06-24 | 5 | -99/+90 |
| | | | | | | | |||||
| * | | | | | | gl_stream_buffer: Use InvalidateBufferData instead unmap and map | ReinUsesLisp | 2020-06-24 | 2 | -15/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Making the stream buffer resident increases GPU usage significantly on some games. This seems to be addressed invalidating the stream buffer with InvalidateBufferData instead of using a Unmap + Map (with invalidation flags). | ||||
| * | | | | | | gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness | ReinUsesLisp | 2020-06-24 | 3 | -9/+39 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Switch games are allowed to bind less data than what they use in a vertex buffer, the expected behavior here is that these values are read as zero. At the moment of writing this only D3D12, OpenGL and NVN through NV_vertex_buffer_unified_memory support vertex buffer with a size limit. In theory this could be emulated on Vulkan creating a new VkBuffer for each (handle, offset, length) tuple and binding the expected data to it. This is likely going to be slow and memory expensive when used on the vertex buffer and we have to do it on all draws because we can't know without analyzing indices when a game is going to read vertex data out of bounds. This is not a problem on OpenGL's BufferAddressRangeNV because it takes a length parameter, unlike Vulkan's CmdBindVertexBuffers that only takes buffers and offsets (the length is implicit in VkBuffer). It isn't a problem on D3D12 either, because D3D12_VERTEX_BUFFER_VIEW on IASetVertexBuffers takes SizeInBytes as a parameter (although I am not familiar with robustness on D3D12). Currently this only implements buffer ranges for vertex buffers, although indices can also be affected. A KHR_robustness profile is not created, but Nvidia's driver reads out of bound vertex data as zero anyway, this might have to be changed in the future. - Fixes SMO random triangles when capturing an enemy, getting hit, or looking at the environment on certain maps. | ||||
| * | | | | | | gl_buffer_cache: Mark buffers as resident | ReinUsesLisp | 2020-06-24 | 10 | -67/+111 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions. | ||||
| * | | | | | | gl_device: Expose NV_vertex_buffer_unified_memory except on Turing | ReinUsesLisp | 2020-06-24 | 2 | -1/+30 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Expose NV_vertex_buffer_unified_memory when the driver supports it. This commit adds a function the determine if a GL_RENDERER is a Turing GPU. This is required because on Turing GPUs Nvidia's driver crashes when the buffer is marked as resident or on DeleteBuffers. Without a synchronous debug output (single threaded driver), it's likely that the driver will crash in the first blocking call. | ||||
| * | | | | | | gl_stream_buffer: Always use a non-coherent buffer | ReinUsesLisp | 2020-06-24 | 2 | -14/+10 |
| | | | | | | | |||||
| * | | | | | | gl_stream_buffer: Always use persistent memory maps | ReinUsesLisp | 2020-06-24 | 2 | -30/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | yuzu no longer supports platforms without persistent maps. | ||||
* | | | | | | | Merge pull request #4083 from Morph1984/B10G11R11F | bunnei | 2020-06-24 | 1 | -9/+17 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | decode/image: Implement B10G11R11F | ||||
| * | | | | | | | decode/image: Implement B10G11R11F | Morph | 2020-06-20 | 1 | -9/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Used by Kirby Star Allies | ||||
* | | | | | | | | Merge pull request #4046 from ogniK5377/macro-hle-prod | Fernando Sahmkow | 2020-06-24 | 9 | -10/+219 |
|\ \ \ \ \ \ \ \ | |_|/ / / / / / |/| | | | | | | | Add support for HLEing Macros | ||||
| * | | | | | | | addressed issues | David Marcec | 2020-06-24 | 2 | -4/+7 |
| | | | | | | | | |||||
| * | | | | | | | clear mme draw mode | David Marcec | 2020-06-24 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We already draw, so we can clear it | ||||
| * | | | | | | | Addressed issues | David Marcec | 2020-06-24 | 5 | -13/+17 |
| | | | | | | | | |||||
| * | | | | | | | Fix constbuffer for 0217920100488FF7 | David Marcec | 2020-06-24 | 1 | -6/+6 |
| | | | | | | | | |||||
| * | | | | | | | Macro HLE support | David Marcec | 2020-06-24 | 9 | -10/+209 |
| | |/ / / / / | |/| | | | | | |||||
* | | | | | | | Merge pull request #4129 from Morph1984/texture-shadow-lod-workaround | Rodrigo Locatti | 2020-06-24 | 3 | -7/+50 |
|\ \ \ \ \ \ \ | |/ / / / / / |/| | | | | | | gl_shader_decompiler: Workaround textureLod when GL_EXT_texture_shadow_lod is not available | ||||
| * | | | | | | gl_shader_decompiler: Enable GL_EXT_texture_shadow_lod if available | Morph | 2020-06-21 | 1 | -7/+43 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Enable GL_EXT_texture_shadow_lod if available. If this extension is not available, such as on Intel/AMD proprietary drivers, use textureGrad as a workaround. | ||||
| * | | | | | | gl_device: Check for GL_EXT_texture_shadow_lod | Morph | 2020-06-21 | 2 | -0/+7 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #4127 from lioncash/dst-typo | bunnei | 2020-06-23 | 1 | -1/+1 |
|\ \ \ \ \ \ \ | |_|_|_|/ / / |/| | | | | | | texture_cache: Fix incorrect address used in a DeduceSurface() call | ||||
| * | | | | | | texture_cache: Fix incorrect address used in a DeduceSurface() call | Lioncash | 2020-06-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously the source was being deduced twice in a row. | ||||
* | | | | | | | Merge pull request #4110 from ReinUsesLisp/direct-upload-sets | Rodrigo Locatti | 2020-06-22 | 3 | -42/+30 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | vk_update_descriptor: Upload descriptor sets data directly | ||||
| * | | | | | | vk_update_descriptor: Upload descriptor sets data directly | ReinUsesLisp | 2020-06-18 | 3 | -42/+30 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | Instead of copying to a temporary payload before sending the update task to the worker thread, insert elements to the payload directly. | ||||
* | | | | | | Merge pull request #4122 from lioncash/hide | bunnei | 2020-06-22 | 4 | -28/+31 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | video_core: Eliminate some variable shadowing | ||||
| * | | | | | | memory_manager: Eliminate variable shadowing | Lioncash | 2020-06-20 | 2 | -24/+28 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Renames some variables to prevent ones in inner scopes from shadowing outer-scoped variables. The Copy* functions have no shadowing, but we rename them anyways to remain consistent with the other functions. | ||||
| * | | | | | | macro_jit_x64: Eliminate variable shadowing in Compile_ProcessResult() | Lioncash | 2020-06-20 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We can reduce the capture scope so that it's not possible for both "reg" variables to clash with one another. While we're at it, we can prevent unnecessary copies while we're at it. | ||||
| * | | | | | | buffer_cache: Eliminate local variable shadowing | Lioncash | 2020-06-20 | 1 | -2/+1 |
| | |/ / / / | |/| | | | | | | | | | | | | | | | | We can just make use of the instance in the scope above this one. | ||||
* | | | | | | Merge pull request #4126 from lioncash/noexcept | bunnei | 2020-06-22 | 2 | -3/+2 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR() | ||||
| * | | | | | | vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR() | Lioncash | 2020-06-20 | 2 | -3/+2 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Check() can throw an exception if the Vulkan result isn't successful. We remove the check so that std::terminate isn't outright called and allows for better debugging (should it ever actually fail). | ||||
* | | | | | | Merge pull request #4120 from lioncash/arb | bunnei | 2020-06-21 | 1 | -32/+31 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_arb_decompiler: Avoid several string copies | ||||
| * | | | | | | gl_arb_decompiler: Avoid several string copies | Lioncash | 2020-06-20 | 1 | -32/+31 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Variables that are marked as const cannot have the move constructor invoked when returning from a function (the move constructor requires a non-const variable so it can "steal" the resources from it. | ||||
* | | | | | | macro_jit_x64: Use ecx for shift register | MerryMage | 2020-06-20 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | shl/shr only accept cl as their second argument | ||||
* | | | | | | Merge pull request #4125 from lioncash/macro-shift | merry | 2020-06-20 | 1 | -6/+7 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | macro_jit_x64: Amend readability of Compile_ExtractShiftLeftRegister() | ||||
| * | | | | | | macro_jit_x64: Correct readability of Compile_ExtractShiftLeftImmediate() | Lioncash | 2020-06-20 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously dst wasn't being used. | ||||
| * | | | | | | macro_jit_x64: Correct readability of Compile_ExtractShiftLeftRegister() | Lioncash | 2020-06-20 | 1 | -3/+4 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | Previously dst wasn't being used. | ||||
* | | | | | | Merge pull request #4123 from lioncash/unused-var | merry | 2020-06-20 | 1 | -2/+1 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | macro_jit_x64: Remove unused variable | ||||
| * | | | | | macro_jit_x64: Remove unused variable | Lioncash | 2020-06-20 | 1 | -2/+1 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | Removes a completely unused label and marks another variable as unused, given it seems like it has potential uses in the future. | ||||
* | | | | | Merge pull request #4099 from MerryMage/macOS-build | bunnei | 2020-06-20 | 5 | -14/+16 |
|\ \ \ \ \ | | | | | | | | | | | | | Fix compilation on macOS | ||||
| * | | | | | vk_rasterizer: BindTransformFeedbackBuffersEXT accepts a size of type VkDeviceSize | MerryMage | 2020-06-18 | 1 | -1/+1 |
| | | | | | | |||||
| * | | | | | renderer_vulkan: Fix macOS GetBundleDirectory reference | MerryMage | 2020-06-18 | 1 | -1/+3 |
| | | | | | | |||||
| * | | | | | memory_util: boost hashes are size_t | MerryMage | 2020-06-18 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * boost::hash_value returns a size_t * boost::hash_combine takes a size_t& argument | ||||
| * | | | | | Rename PAGE_SHIFT to PAGE_BITS | MerryMage | 2020-06-18 | 2 | -10/+10 |
| | |/ / / | |/| | | | | | | | | | | | | | macOS header files #define PAGE_SHIFT | ||||
* | | | | | Merge pull request #4087 from MerryMage/macrojit-inline-Read | bunnei | 2020-06-20 | 2 | -14/+22 |
|\ \ \ \ \ | |_|/ / / |/| | | | | macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue | ||||
| * | | | | macro_jit_x64: Remove unused function Read | MerryMage | 2020-06-19 | 1 | -8/+4 |
| | | | | | |||||
| * | | | | macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue | MerryMage | 2020-06-17 | 2 | -6/+18 |
| | | | | | |||||
* | | | | | Merge pull request #4090 from MerryMage/macrojit-bugs | bunnei | 2020-06-19 | 1 | -2/+5 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | macro_jit_x64: Optimization correctness | ||||
| * | | | | macro_jit_x64: Optimization implicitly assumes same destination | MerryMage | 2020-06-17 | 1 | -1/+2 |
| | | | | | |||||
| * | | | | macro_jit_x64: Should not skip zero registers for certain ALU ops | MerryMage | 2020-06-17 | 1 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | The code generated for these ALU ops assume src_a and src_b are always valid. | ||||
* | | | | | vk_sampler_cache: Emulate GL_LINEAR/NEAREST minification filters | Morph | 2020-06-18 | 1 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | Emulate GL_LINEAR/NEAREST minification filters using minLod = 0 and maxLod = 0.25 during sampler creation | ||||
* | | | | | maxwell_to_vk: Reorder filter cases and correct mipmap_filter=None | Morph | 2020-06-18 | 1 | -17/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | maxwell_to_vk: Reorder filtering modes to start with None, then Nearest, then Linear. maxwell_to_vk: Logs filter modes under UNREACHABLE_MSG instead of UNIMPLEMENTED_MSG, since any unknown filter modes are invalid and not unimplemented. maxwell_to_vk: Return VK_SAMPLER_MIPMAP_MODE_NEAREST instead of VK_SAMPLER_MIPMAP_MODE_LINEAR when mipmap_filter is None with the description from the VkSamplerCreateInfo(3) man page. | ||||
* | | | | | maxwell_to_gl: Miscellaneous changes | Morph | 2020-06-18 | 1 | -48/+34 |
| |_|/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | maxwell_to_gl: Log unimplemented features under UNIMPLEMENTED_MSG instead of LOG_ERROR to bring into parity with maxwell_to_vk maxwell_to_gl: Deduplicate logging in VertexType(), merging them into one. maxwell_to_gl: Return GL_NEAREST instead of GL_LINEAR if an unknown texture filter mode is encountered. maxwell_to_gl: Log the mipmap filter mode if an unknown value is passed in. maxwell_to_gl: Reorder filtering modes to start with None, then Nearest, then Linear. | ||||
* | | | | Merge pull request #4092 from Morph1984/image-bindings | Rodrigo Locatti | 2020-06-18 | 1 | -6/+14 |
|\ \ \ \ | |_|/ / |/| | | | gl_device: Reserve 4 image bindings for fragment stage | ||||
| * | | | gl_device: Reserve at least 4 image bindings for fragment stage | Morph | 2020-06-16 | 1 | -6/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | Due to the limitation of GL_MAX_IMAGE_UNITS being low (8) on Intel's and Nvidia's proprietary drivers, we have to reserve an appropriate amount of image bindings for each of the stages. So far games have been observed to use 4 image bindings on the fragment stage (Kirby Star Allies) and 1 on the vertex stage (TWD series). No games thus far in my limited testing used more than 4 images concurrently and across all currently active programs. This fixes shader compilation errors on Kirby Star Allies on OpenGL (GLSL/GLASM) | ||||
* | | | | Merge pull request #4086 from MerryMage/abi | bunnei | 2020-06-17 | 1 | -6/+6 |
|\ \ \ \ | |_|/ / |/| | | | xbyak_abi: Cleanup | ||||
| * | | | xbyak_abi: Remove *GPS variants of stack manipulation functions | MerryMage | 2020-06-15 | 1 | -6/+6 |
| | | | | |||||
* | | | | Merge pull request #4089 from MerryMage/macrojit-cleanup-1 | bunnei | 2020-06-17 | 2 | -51/+18 |
|\ \ \ \ | | | | | | | | | | | macro_jit_x64: Cleanup | ||||
| * | | | | macro_jit_x64: Remove NEXT_PARAMETER | MerryMage | 2020-06-15 | 1 | -5/+2 |
| | | | | | | | | | | | | | | | | | | | | Not required, as PARAMETERS can just be incremented directly. | ||||
| * | | | | macro_jit_x64: Remove unused function Compile_WriteCarry | MerryMage | 2020-06-15 | 2 | -9/+0 |
| | | | | | |||||
| * | | | | macro_jit_x64: Select better registers | MerryMage | 2020-06-15 | 1 | -8/+8 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | All registers are now callee-save registers. RBX and RBP selected for STATE and RESULT because these are most commonly accessed; this is to avoid the REX prefix. RBP not used for STATE because there are some SIB restrictions, RBX emits smaller code. | ||||
| * | | | | macro_jit_x64: Remove REGISTERS | MerryMage | 2020-06-15 | 1 | -7/+3 |
| | | | | | | | | | | | | | | | | | | | | Unnecessary since this is just an offset from STATE. | ||||
| * | | | | macro_jit_x64: Remove JITState::parameters | MerryMage | 2020-06-15 | 2 | -6/+3 |
| | | | | | | | | | | | | | | | | | | | | This can be passed in as an argument instead. | ||||
| * | | | | macro_jit_x64: Remove METHOD_ADDRESS_64 | MerryMage | 2020-06-15 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | Unnecessary variable. | ||||
| * | | | | macro_jit_x64: Remove RESULT_64 | MerryMage | 2020-06-15 | 2 | -16/+3 |
| |/ / / | | | | | | | | | | | | | This Reg64 codepath has the exact same behaviour as the Reg32 one. | ||||
* | | | | Merge pull request #4041 from ReinUsesLisp/arb-decomp | bunnei | 2020-06-16 | 6 | -1/+2114 |
|\ \ \ \ | |_|/ / |/| | | | gl_arb_decompiler: Implement an assembly shader decompiler | ||||
| * | | | gl_arb_decompiler: Implement FSwizzleAdd | ReinUsesLisp | 2020-06-12 | 1 | -4/+27 |
| | | | | |||||
| * | | | gl_arb_decompiler: Implement an assembly shader decompiler | ReinUsesLisp | 2020-06-12 | 6 | -1/+2091 |
| | | | | | | | | | | | | | | | | | | | | | | | | Emit code compatible with NV_gpu_program5. This should emit code compatible with Fermi, but it wasn't tested on that architecture. Pascal has some issues not present on Turing GPUs. | ||||
* | | | | Merge pull request #4066 from ReinUsesLisp/shared-ptr-buf | Rodrigo Locatti | 2020-06-16 | 9 | -174/+150 |
|\ \ \ \ | | | | | | | | | | | buffer_cache: Avoid passing references of shared pointers and misc style changes | ||||
| * | | | | buffer_cache: Avoid passing references of shared pointers and misc style changes | ReinUsesLisp | 2020-06-09 | 9 | -174/+150 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=). | ||||
* | | | | | video_core/macro_jit_x64: Remove initializer in member variable | ReinUsesLisp | 2020-06-15 | 1 | -2/+2 |
| |_|/ / |/| | | | | | | | | | | | | | | | Fix build time issues on gcc. Confirmed through asan that avoiding this initialization is safe. | ||||
* | | | | Merge pull request #4064 from ReinUsesLisp/invalidate-buffers | bunnei | 2020-06-14 | 2 | -8/+19 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation | ||||
| * | | | | gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation | ReinUsesLisp | 2020-06-09 | 1 | -1/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Vertex buffers bindings become invalid after the stream buffer is invalidated. We were originally doing this, but it got lost at some point. - Fixes Animal Crossing: New Horizons, but it affects everything. | ||||
| * | | | | buffer_cache: Return stream buffer invalidation in Map instead of Unmap | ReinUsesLisp | 2020-06-09 | 1 | -7/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | We have to invalidate whatever cache is being used before uploading the data, hence it makes more sense to return this on Map instead of Unmap. | ||||
* | | | | | Merge pull request #4049 from ReinUsesLisp/separate-samplers | bunnei | 2020-06-13 | 16 | -113/+273 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/texture: Join separate image and sampler pairs offline | ||||
| * | | | | | shader/texture: Join separate image and sampler pairs offline | ReinUsesLisp | 2020-06-05 | 16 | -88/+234 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch | ||||
| * | | | | | shader/track: Move bindless tracking to a separate function | ReinUsesLisp | 2020-06-05 | 2 | -25/+39 |
| | | | | | | |||||
* | | | | | | Merge pull request #3986 from ReinUsesLisp/shader-cache | bunnei | 2020-06-13 | 14 | -417/+364 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | shader_cache: Implement a generic runtime shader cache | ||||
| * | | | | | rasterizer_cache: Remove files and includes | ReinUsesLisp | 2020-06-07 | 7 | -269/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The rasterizer cache is no longer used. Each cache has its own generic implementation optimized for the cached data. | ||||
| * | | | | | vk_pipeline_cache: Use generic shader cache | ReinUsesLisp | 2020-06-07 | 5 | -58/+55 |
| | | | | | | | | | | | | | | | | | | | | | | | | Trivial port the generic shader cache to Vulkan. | ||||
| * | | | | | gl_shader_cache: Use generic shader cache | ReinUsesLisp | 2020-06-07 | 4 | -93/+80 |
| | | | | | | | | | | | | | | | | | | | | | | | | Trivially port the generic shader cache to OpenGL. | ||||
| * | | | | | shader_cache: Implement a generic shader cache | ReinUsesLisp | 2020-06-07 | 2 | -0/+229 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement a generic shader cache for fast lookups and invalidations. Invalidations are cheap but expensive when a shader is invalidated. Use two mutexes instead of one to avoid locking invalidations for lookups and vice versa. When a shader has to be removed, lookups are locked as expected. | ||||
* | | | | | | Merge pull request #4027 from ReinUsesLisp/3d-slices | bunnei | 2020-06-10 | 10 | -131/+205 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | texture_cache: Implement rendering to 3D textures | ||||
| * | | | | | texture_cache: Port original code management for 2D vs 3D textures | ReinUsesLisp | 2020-06-08 | 2 | -16/+35 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Handle blits to images as 2D, even when they have block depth. - Fixes rendering issues on Luigi's Mansion 3 | ||||
| * | | | | | texture_cache: Simplify blit code | ReinUsesLisp | 2020-06-08 | 1 | -9/+7 |
| | | | | | | |||||
| * | | | | | texture_cache: Handle 3D texture blits with one layer | ReinUsesLisp | 2020-06-08 | 3 | -5/+10 |
| | | | | | | |||||
| * | | | | | texture_cache: Implement rendering to 3D textures | ReinUsesLisp | 2020-06-08 | 10 | -139/+191 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly. | ||||
* | | | | | Merge pull request #4040 from ReinUsesLisp/nv-transform-feedback | bunnei | 2020-06-08 | 3 | -1/+96 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_rasterizer: Use NV_transform_feedback for XFB on assembly shaders | ||||
| * | | | | | gl_rasterizer: Use NV_transform_feedback for XFB on assembly shaders | ReinUsesLisp | 2020-06-04 | 3 | -1/+96 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | NV_transform_feedback, NV_transform_feedback2 and ARB_transform_feedback3 with NV_transform_feedback interactions allows implementing transform feedbacks as dynamic state. Maxwell implements transform feedbacks as dynamic state, so using these extensions with TransformFeedbackStreamAttribsNV allows us to properly emulate transform feedbacks without having to recompile shaders when the state changes. | ||||
* | | | | | | Merge pull request #4052 from ReinUsesLisp/debug-output | bunnei | 2020-06-08 | 1 | -4/+2 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | renderer_opengl: Only enable DEBUG_OUTPUT when graphics debugging is enabled | ||||
| * | | | | | renderer_opengl: Only enable DEBUG_OUTPUT when graphics debugging is enabled | ReinUsesLisp | 2020-06-06 | 1 | -4/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids logging when it's not relevant. This can potentially reduce driver's internal thread overhead. | ||||
* | | | | | | Merge pull request #4034 from ReinUsesLisp/storage-texels | Rodrigo Locatti | 2020-06-07 | 8 | -91/+143 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | vk_rasterizer: Implement storage texels and atomic image operations | ||||
| * | | | | | vk_shader_decompiler: Implement atomic image operations | ReinUsesLisp | 2020-06-02 | 1 | -40/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement atomic operations on images. On GLSL these are atomicImage* functions (e.g. atomicImageAdd). | ||||
| * | | | | | vk_rasterizer: Implement storage texels | ReinUsesLisp | 2020-06-02 | 8 | -52/+120 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is the equivalent of an image buffer on OpenGL. - Used by Octopath Traveler | ||||
* | | | | | | gl_device: Black list NVIDIA 443.24 for fast buffer uploads | ReinUsesLisp | 2020-06-06 | 1 | -2/+10 |
| |/ / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Skip fast buffer uploads on Nvidia 443.24 Vulkan beta driver on OpenGL. This driver throws the following error when calling BufferSubData or BufferData on buffers that are candidates for fast constant buffer uploads. This is the equivalens to push constants on Vulkan, except that they can access the full buffer. The error: Unknown internal debug message. The NVIDIA OpenGL driver has encountered an out of memory error. This application might behave inconsistently and fail. If this error persists on future drivers, we might have to look deeper into this issue. For now, we can black list it and log it as a temporary solution. | ||||
* | | | | | Merge pull request #4013 from ReinUsesLisp/skip-no-xfb | bunnei | 2020-06-05 | 1 | -0/+7 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_rasterizer: Skip transform feedbacks when extension is unavailable | ||||
| * | | | | | vk_rasterizer: Skip transform feedbacks when extension is unavailable | ReinUsesLisp | 2020-05-29 | 1 | -0/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids calling transform feedback procedures when VK_EXT_transform_feedback is not available. | ||||
* | | | | | | Merge pull request #4031 from Morph1984/fix-gs-outputs | bunnei | 2020-06-04 | 1 | -1/+13 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_decompiler: Fix geometry shader outputs on Intel drivers | ||||
| * | | | | | | gl_shader_decompiler: Declare gl_Layer and gl_ViewportIndex within gl_PerVertex for vertex and tessellation shaders | Morph | 2020-06-01 | 1 | -6/+16 |
| | | | | | | | |||||
| * | | | | | | gl_shader_decompiler: Fix geometry shader outputs for Intel drivers | Morph | 2020-06-01 | 1 | -13/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | On Intel's proprietary drivers, gl_Layer and gl_ViewportIndex are not allowed members of gl_PerVertex block, causing the shader to fail to compile. Fix this by declaring these variables outside of gl_PerVertex. | ||||
* | | | | | | | Merge pull request #4009 from ogniK5377/macro-jit-prod | bunnei | 2020-06-04 | 9 | -187/+1018 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | video_core: Implement Macro JIT | ||||
| * | | | | | | | Default init labels and use initializer list for macro engine | David Marcec | 2020-06-04 | 2 | -2/+2 |
| | | | | | | | | |||||
| * | | | | | | | Mark parameters as const | David Marcec | 2020-06-03 | 8 | -11/+11 |
| | | | | | | | | |||||
| * | | | | | | | Pass by reference instead of copying parameters | David Marcec | 2020-06-02 | 4 | -7/+9 |
| | | | | | | | | |||||
| * | | | | | | | Favor switch case over jump table | David Marcec | 2020-05-30 | 2 | -18/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Easier to read and will emit a jump table automatically. | ||||
| * | | | | | | | Implement macro JIT | David Marcec | 2020-05-30 | 9 | -189/+1010 |
| | | | | | | | | |||||
* | | | | | | | | Merge pull request #4012 from ReinUsesLisp/mipmap-overlaps | bunnei | 2020-06-03 | 1 | -28/+43 |
|\ \ \ \ \ \ \ \ | |_|_|_|_|/ / / |/| | | | | | | | texture_cache: Handle overlaps with multiple subresources | ||||
| * | | | | | | | texture_cache: More relaxed reconstruction | ReinUsesLisp | 2020-05-30 | 1 | -13/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Only reupload textures when they've not been modified from the GPU. | ||||
| * | | | | | | | texture_cache: Only copy textures that were modified from host | ReinUsesLisp | 2020-05-30 | 1 | -2/+6 |
| | | | | | | | | |||||
| * | | | | | | | texture_cache: Reload textures when number of resources mismatch | ReinUsesLisp | 2020-05-30 | 1 | -0/+9 |
| | | | | | | | | |||||
| * | | | | | | | texture_cache: Handle overlaps with multiple subresources | ReinUsesLisp | 2020-05-29 | 1 | -27/+33 |
| | |_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement more surface reconstruct cases. Allow overlaps with more than one layer and mipmap and copies all of them to the new texture. - Fixes textures moving around objects on Xenoblade games | ||||
* | | | | | | | Merge pull request #4014 from ReinUsesLisp/astc-nvidia | bunnei | 2020-06-02 | 2 | -8/+19 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | gl_device: Avoid devices with CAVEAT_SUPPORT on ASTC | ||||
| * | | | | | | gl_device: Avoid devices with CAVEAT_SUPPORT on ASTC | ReinUsesLisp | 2020-06-01 | 2 | -8/+19 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This avoids using Nvidia's ASTC decoder on OpenGL. The last time it was profiled, it was slower than yuzu's decoder. While we are at it, fix a bug in the texture cache when native ASTC is not supported. | ||||
* | | | | | | Merge pull request #4006 from ReinUsesLisp/squash-ubos | bunnei | 2020-06-02 | 7 | -79/+173 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | glsl: Squash constant buffers into a single SSBO when we hit the limit | ||||
| * | | | | | | glsl: Squash constant buffers into a single SSBO when we hit the limit | ReinUsesLisp | 2020-06-01 | 7 | -79/+173 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | Avoids compilation errors at the cost of shader build times and runtime performance when a game hits the limit of uniform buffers we can use. | ||||
* | | | | | | Merge pull request #4016 from ReinUsesLisp/invocation-info | LC | 2020-06-02 | 1 | -1/+1 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader/other: Fix hardcoded value in S2R INVOCATION_INFO | ||||
| * | | | | | | shader/other: Fix hardcoded value in S2R INVOCATION_INFO | ReinUsesLisp | 2020-05-30 | 1 | -1/+1 |
| | |/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Geometry shaders built from Nvidia's compiler check for bits[16:23] to be less than or equal to 0 with VSETP to default to a "safe" value of 0x8000'0000 (safe from hardware's perspective). To avoid hitting this path in the shader, return 0x00ff'0000 from S2R INVOCATION_INFO. This seems to be the maximum number of vertices a geometry shader can emit in a primitive. | ||||
* | | | | | | Merge pull request #4033 from ReinUsesLisp/vk-r16ui | LC | 2020-06-02 | 2 | -71/+74 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | maxwell_to_vk: Add R16UI image format | ||||
| * | | | | | | maxwell_to_vk: Add R16UI image format | ReinUsesLisp | 2020-06-02 | 2 | -71/+74 |
| | |_|_|/ / | |/| | | | | | | | | | | | | | | | | - Used by Octopath Traveler | ||||
* | | | | | | Merge pull request #4001 from ReinUsesLisp/avoid-copies | bunnei | 2020-06-01 | 1 | -17/+23 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | buffer_cache: Avoid copying twice on certain cases | ||||
| * | | | | | | buffer_cache: Avoid copying twice on certain cases | ReinUsesLisp | 2020-05-28 | 1 | -17/+23 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoid copying to a staging buffer on non-granular memory addresses. Add a callable argument to StreamBufferUpload to be able to copy to the staging buffer directly from ReadBlockUnsafe. | ||||
* | | | | | | | Merge pull request #3998 from ReinUsesLisp/init-3d | bunnei | 2020-06-01 | 1 | -0/+4 |
|\ \ \ \ \ \ \ | |_|/ / / / / |/| | | | | | | maxwell_3d: Initialize more registers to their expected value | ||||
| * | | | | | | maxwell_3d: Initialize line widths | ReinUsesLisp | 2020-05-27 | 1 | -0/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Initialize line widths to avoid setting a line width of zero. | ||||
| * | | | | | | maxwell_3d: Initialize polygon modes | ReinUsesLisp | 2020-05-27 | 1 | -0/+2 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | NVN expects this to be initialized as Fill, otherwise games that never bind a rasterizer state will log an invalid polygon mode. | ||||
* | | | | | | Merge pull request #4005 from ReinUsesLisp/g24r8 | Rodrigo Locatti | 2020-06-01 | 1 | -1/+2 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | format_lookup_table: Implement G24S8 format as S8Z24 | ||||
| * | | | | | | format_lookup_table: Implement G24S8 format as S8Z24 | ReinUsesLisp | 2020-05-28 | 1 | -1/+2 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #3996 from ReinUsesLisp/front-faces | bunnei | 2020-06-01 | 2 | -7/+21 |
|\ \ \ \ \ \ \ | |_|_|_|/ / / |/| | | | | | | fixed_pipeline_state,gl_rasterizer: Swap negative viewport checks for front faces | ||||
| * | | | | | | gl_rasterizer: Port front face flip check from Vulkan | ReinUsesLisp | 2020-05-26 | 1 | -5/+20 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | While Vulkan was assuming we had no negative viewports, OpenGL code was assuming we had them. Port the old code from Vulkan to OpenGL, checking if the first viewport is negative before flipping faces. This is not a complete implementation since we only check for the first viewport to be negative. That said, unless a game is using Vulkan, OpenGL and NVN games should be fine here, and we can always compare with our Vulkan backend to see if there's a difference. | ||||
| * | | | | | | fixed_pipeline_state: Remove unnecessary check for front faces flip | ReinUsesLisp | 2020-05-26 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The check to flip faces when viewports are negative were a left over from the old OpenGL code. This is not required on Vulkan where we have negative viewports. | ||||
* | | | | | | | Merge pull request #3930 from ReinUsesLisp/animal-borders | bunnei | 2020-06-01 | 4 | -17/+26 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | vk_rasterizer: Implement constant attributes | ||||
| * | | | | | | | vk_rasterizer: Implement constant attributes | ReinUsesLisp | 2020-05-13 | 4 | -13/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Constant attributes (in OpenGL known disabled attributes) are not supported on Vulkan, even with extensions. To emulate this behavior we return zero on reads from disabled vertex attributes in shader code. This has no caching cost because attribute formats are not dynamic state on Vulkan and we have to store it in the pipeline cache anyway. - Fixes Animal Crossing: New Horizons terrain borders | ||||
| * | | | | | | | vk_rasterizer: Remove buffer check in attribute selection | ReinUsesLisp | 2020-05-13 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This was a left over from OpenGL when disabled buffers where not properly emulated. We no longer have to assert this as it is checked in vertex buffer initialization. | ||||
* | | | | | | | | Merge pull request #3958 from FernandoS27/gl-debug | bunnei | 2020-05-31 | 1 | -0/+3 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | OpenGL: Enable Debug Context and Synchronous debugging when graphics debugging is enabled | ||||
| * | | | | | | | | OpenGL: Enable Debug Context and Synchronous debugging when graphics debugging is enabled. | Fernando Sahmkow | 2020-05-18 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit aims to help easing debugging of driver crashes without having to modify existing code. | ||||
* | | | | | | | | | Merge pull request #3999 from ReinUsesLisp/opt-tex-cache | bunnei | 2020-05-31 | 1 | -24/+29 |
|\ \ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | texture_cache: Optimize GetSurfacesInRegion | ||||
| * | | | | | | | | | texture_cache: Use unordered_map::find instead of operator[] on hot code | ReinUsesLisp | 2020-05-27 | 1 | -15/+19 |
| | | | | | | | | | | |||||
| * | | | | | | | | | texture_cache: Use small vector for surface vectors | ReinUsesLisp | 2020-05-27 | 1 | -9/+10 |
| | |_|_|_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This avoids most heap allocations when collecting surfaces into a vector. | ||||
* | | | | | | | | | gl_device: Enable compute shaders for Intel proprietary drivers | Morph | 2020-05-31 | 3 | -13/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously we were disabling compute shaders on Intel's proprietary driver due to broken compute. This has been fixed in the latest Intel drivers. Re-enable compute for Intel proprietary drivers and remove the check for broken compute. | ||||
* | | | | | | | | | Merge pull request #3982 from ReinUsesLisp/membar-cts | bunnei | 2020-05-30 | 4 | -9/+27 |
|\ \ \ \ \ \ \ \ \ | |_|_|_|_|_|_|/ / |/| | | | | | | | | shader/other: Implement MEMBAR.CTS | ||||
| * | | | | | | | | shader/other: Implement MEMBAR.CTS | ReinUsesLisp | 2020-05-27 | 4 | -9/+27 |
| |/ / / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This silences an assertion we were hitting and uses workgroup memory barriers when the game requests it. | ||||
* | | | | | | | | Add xbyak external | David Marcec | 2020-05-30 | 1 | -1/+1 |
| | | | | | | | | |||||
* | | | | | | | | Merge pull request #4007 from ReinUsesLisp/reduce-logs | bunnei | 2020-05-29 | 1 | -6/+7 |
|\ \ \ \ \ \ \ \ | |_|_|_|_|_|/ / |/| | | | | | | | maxwell_3d: Reduce severity of logs that can be spammed | ||||
| * | | | | | | | maxwell_3d: Reduce severity of logs that can be spammed | ReinUsesLisp | 2020-05-28 | 1 | -6/+7 |
| | |_|_|_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | These logs were killing performance on some games when they were spammed. Reduce them to Debug severity. | ||||
* | | | | | | | Merge pull request #3991 from ReinUsesLisp/depth-sampling | bunnei | 2020-05-29 | 5 | -68/+83 |
|\ \ \ \ \ \ \ | |/ / / / / / |/| | | | | | | texture_cache: Implement depth stencil texture swizzles | ||||
| * | | | | | | texture_cache: Fix layered null surfaces | ReinUsesLisp | 2020-05-26 | 1 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Null texture cubes were not considered arrays, causing issues on Vulkan and OpenGL when creating views. | ||||
| * | | | | | | gl_texture_cache: Implement small texture view cache for swizzles | ReinUsesLisp | 2020-05-26 | 3 | -37/+44 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This fixes cases where the texture swizzle was applied twice on the same draw to a texture bound to two different slots. | ||||
| * | | | | | | texture_cache: Implement depth stencil texture swizzles | ReinUsesLisp | 2020-05-26 | 3 | -36/+42 |
| | |_|_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Stop ignoring image swizzles on depth and stencil images. This doesn't fix a known issue on Xenoblade Chronicles 2 where an OpenGL texture changes swizzles twice before being used. A proper fix would be having a small texture view cache for this like we do on Vulkan. | ||||
* | | | | | | Merge pull request #3993 from ReinUsesLisp/fix-zla | bunnei | 2020-05-28 | 1 | -0/+4 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_manager: Unbind GLSL program when binding a host pipeline | ||||
| * | | | | | | gl_shader_manager: Unbind GLSL program when binding a host pipeline | ReinUsesLisp | 2020-05-26 | 1 | -0/+4 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | Fixes regression in Link's Awakening caused by 420cc13248350ef5c2d19e0b961cb4185cd16a8a | ||||
* | | | | | | Merge pull request #3961 from Morph1984/bgra8_srgb | bunnei | 2020-05-27 | 2 | -2/+3 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | maxwell_to_vk: Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM | ||||
| * | | | | | maxwell_to_vk: Add format B8G8R8A8_SRGB | Morph | 2020-05-18 | 2 | -2/+3 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM Used by Bravely Default II | ||||
* | | | | | Merge pull request #3981 from ReinUsesLisp/bar | bunnei | 2020-05-26 | 4 | -0/+33 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/other: Implement BAR.SYNC 0x0 | ||||
| * | | | | | shader/other: Implement BAR.SYNC 0x0 | ReinUsesLisp | 2020-05-22 | 4 | -0/+33 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | Trivially implement this particular case of BAR. Unless games use OpenCL or CUDA barriers, we shouldn't hit any other case here. | ||||
* | | | | | Merge pull request #3980 from ReinUsesLisp/red-op | bunnei | 2020-05-26 | 1 | -2/+1 |
|\ \ \ \ \ | |_|/ / / |/| | | | | shader/memory: Implement non-addition operations in RED | ||||
| * | | | | shader/memory: Implement non-addition operations in RED | ReinUsesLisp | 2020-05-22 | 1 | -2/+1 |
| |/ / / | | | | | | | | | | | | | Trivially implement these instructions. They are used in Astral Chain. | ||||
* | | | | Merge pull request #3978 from ReinUsesLisp/write-rz | bunnei | 2020-05-26 | 2 | -4/+7 |
|\ \ \ \ | | | | | | | | | | | shader_decompiler: Visit source nodes even when they assign to RZ | ||||
| * | | | | shader_decompiler: Visit source nodes even when they assign to RZ | ReinUsesLisp | 2020-05-22 | 2 | -2/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some operations like atomicMin were ignored because they returned were being stored to RZ. This operations have a side effect and it was being ignored. | ||||
| * | | | | vk_shader_decompiler: Don't assert for void returns | ReinUsesLisp | 2020-05-22 | 1 | -2/+1 |
| |/ / / | | | | | | | | | | | | | | | | | Atomic instructions can be used without returning anything and this is valid code. Remove the assert. | ||||
* | | | | Merge pull request #3905 from FernandoS27/vulkan-fix | bunnei | 2020-05-24 | 4 | -7/+62 |
|\ \ \ \ | | | | | | | | | | | Correct a series of crashes and intructions on Async GPU and Vulkan Pipeline | ||||
| * | | | | RasterizerCache: Correct documentation. | Fernando Sahmkow | 2020-05-10 | 1 | -2/+2 |
| | | | | | |||||
| * | | | | VkPipelineCache: Use a null shader on invalid address. | Fernando Sahmkow | 2020-05-10 | 1 | -2/+1 |
| | | | | | |||||
| * | | | | VideoCore: Use SyncGuestMemory mechanism for Shader/Pipeline Cache invalidation. | Fernando Sahmkow | 2020-05-10 | 3 | -5/+61 |
| | | | | | |||||
* | | | | | Merge pull request #3964 from ReinUsesLisp/arb-integration | bunnei | 2020-05-24 | 12 | -109/+339 |
|\ \ \ \ \ | | | | | | | | | | | | | renderer_opengl: Add assembly program code paths | ||||
| * | | | | | renderer_opengl: Add assembly program code paths | ReinUsesLisp | 2020-05-19 | 12 | -109/+339 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add code required to use OpenGL assembly programs based on NV_gpu_program5. Decompilation for ARB programs is intended to be added in a follow up commit. This does **not** include ARB decompilation and it's not in an usable state. The intention behind assembly programs is to reduce shader stutter significantly on drivers supporting NV_gpu_program5 (and other required extensions). Currently only Nvidia's proprietary driver supports these extensions. Add a UI option hidden for now to avoid people enabling this option accidentally. This code path has some limitations that OpenGL compatibility doesn't have: - NV_shader_storage_buffer_object is limited to 16 entries for a single OpenGL context state (I don't know if this is an intended limitation, an specification issue or I am missing something). Currently causes issues on The Legend of Zelda: Link's Awakening. - NV_parameter_buffer_object can't bind buffers using an offset different to zero. The used workaround is to copy to a temporary buffer (this doesn't happen often so it's not an issue). On the other hand, it has the following advantages: - Shaders build a lot faster. - We have control over how floating point rounding is done over individual instructions (SPIR-V on Vulkan can't do this). - Operations on shared memory can be unsigned and signed. - Transform feedbacks are dynamic state (not yet implemented). - Parameter buffers (uniform buffers) are per stage, matching NVN and hardware's behavior. - The API to bind and create assembly programs makes sense, unlike ARB_separate_shader_objects. | ||||
* | | | | | Merge pull request #3979 from ReinUsesLisp/thread-group | bunnei | 2020-05-24 | 4 | -0/+72 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/other: Implement thread comparisons (NV_shader_thread_group) | ||||
| * | | | | | shader/other: Implement thread comparisons (NV_shader_thread_group) | ReinUsesLisp | 2020-05-22 | 4 | -0/+72 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Hardware S2R special registers match gl_Thread*MaskNV. We can trivially implement these using Nvidia's extension on OpenGL or naively stubbing them with the ARB instructions to match. This might cause issues if the host device warp size doesn't match Nvidia's. That said, this is unlikely on proper shaders. Refer to the attached url for more documentation about these flags. https://www.khronos.org/registry/OpenGL/extensions/NV/NV_shader_thread_group.txt | ||||
* | | | | | buffer_cache: Remove unused boost headers | ReinUsesLisp | 2020-05-21 | 1 | -2/+0 |
| | | | | | |||||
* | | | | | map_interval: Add interval allocator and drop hack | ReinUsesLisp | 2020-05-21 | 4 | -3/+79 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Drop the std::list hack to allocate memory indefinitely. Instead use a custom allocator that keeps references valid until destruction. This allocates fixed chunks of memory and puts pointers in a free list. When an allocation is no longer used put it back to the free list, this doesn't heap allocate because std::vector doesn't change the capacity. If the free list is empty, allocate a new chunk. | ||||
* | | | | | buffer_cache: Use boost::container::small_vector for maps in range | ReinUsesLisp | 2020-05-21 | 1 | -13/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Most overlaps in the buffer cache only contain one mapped address. We can avoid close to all heap allocations once the buffer cache is warmed up by using a small_vector with a stack size of one. | ||||
* | | | | | buffer_cache: Use boost::intrusive::set for caching | ReinUsesLisp | 2020-05-21 | 6 | -30/+48 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of using boost::icl::interval_map for caching, use boost::intrusive::set. interval_map is intended as a container where the keys can overlap with one another; we don't need this for caching buffers and a std::set-like data structure that allows us to search with lower_bound is enough. | ||||
* | | | | | buffer_cache: Remove shared pointers | ReinUsesLisp | 2020-05-21 | 2 | -70/+72 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Removing shared pointers is a first step to be able to use intrusive objects and keep allocations close to one another in memory. | ||||
* | | | | | buffer_cache: Minor style changes | ReinUsesLisp | 2020-05-21 | 2 | -129/+65 |
|/ / / / | | | | | | | | | | | | | | | | | Minor style changes. Mostly done so I avoid editing it while doing other changes. | ||||
* | | | | DmaPusher: Remove dead code in step | David Marcec | 2020-05-16 | 2 | -9/+1 |
| | | | | |||||
* | | | | vk_rasterizer: Match OpenGL's FlushAndInvalidate behavior | ReinUsesLisp | 2020-05-16 | 1 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | Match OpenGL's behavior. This can fix or simplify bisecting issues on Vulkan. | ||||
* | | | | Merge pull request #3899 from ReinUsesLisp/float-comparisons | bunnei | 2020-05-13 | 7 | -136/+173 |
|\ \ \ \ | |_|/ / |/| | | | shader_ir: Add separate instructions for ordered and unordered comparisons and fix NE on GLSL | ||||
| * | | | gl_shader_decompiler: Properly emulate NaN behaviour on NE | ReinUsesLisp | 2020-05-10 | 1 | -0/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | "Not equal" operators on GLSL seem to behave as unordered when we expect an ordered comparison. Manually emulate this checking for LGE values (numbers, not-NaNs). | ||||
| * | | | shader_ir: Separate float-point comparisons in ordered and unordered | ReinUsesLisp | 2020-05-09 | 7 | -135/+163 |
| | | | | | | | | | | | | | | | | | | | | This allows us to use native SPIR-V instructions without having to manually check for NAN. | ||||
* | | | | Merge pull request #3816 from ReinUsesLisp/vk-rasterizer-enable | bunnei | 2020-05-12 | 3 | -1/+3 |
|\ \ \ \ | |_|/ / |/| | | | vk_graphics_pipeline: Implement rasterizer_enable on Vulkan | ||||
| * | | | vk_graphics_pipeline: Implement rasterizer_enable on Vulkan | ReinUsesLisp | 2020-05-02 | 3 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | We can simply enable rasterizer discard matching the current pipeline key. | ||||
* | | | | Merge pull request #3839 from Morph1984/r8g8ui | Rodrigo Locatti | 2020-05-09 | 8 | -38/+52 |
|\ \ \ \ | | | | | | | | | | | texture: Implement R8G8UI | ||||
| * | | | | texture: Implement R8G8UI | Morph | 2020-04-30 | 8 | -38/+52 |
| | | | | | | | | | | | | | | | | | | | | - Used by The Walking Dead: The Final Season | ||||
* | | | | | Merge pull request #3842 from makigumo/maxwell_to_vk_vertexattribute_signed_int | bunnei | 2020-05-09 | 1 | -2/+14 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_to_vk: implement missing signed int formats | ||||
| * | | | | | maxwell_to_vk: implement missing signed int formats | Dan | 2020-04-30 | 1 | -2/+14 |
| | | | | | | |||||
* | | | | | | Merge pull request #3885 from ReinUsesLisp/viewport-swizzles | bunnei | 2020-05-08 | 11 | -1/+121 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | video_core: Implement viewport swizzles with NV_viewport_swizzle | ||||
| * | | | | | vk_graphics_pipeline: Implement viewport swizzles with NV_viewport_swizzle | ReinUsesLisp | 2020-05-04 | 8 | -0/+84 |
| | | | | | | |||||
| * | | | | | gl_rasterizer: Implement viewport swizzles with NV_viewport_swizzle | ReinUsesLisp | 2020-05-04 | 2 | -0/+13 |
| | | | | | | |||||
| * | | | | | maxwell_3d: Add viewport swizzles | ReinUsesLisp | 2020-05-04 | 2 | -1/+24 |
| | | | | | | |||||
* | | | | | | Merge pull request #3884 from ReinUsesLisp/border-colors | bunnei | 2020-05-07 | 3 | -2/+44 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vk_sampler_cache: Use VK_EXT_custom_border_color when available | ||||
| * | | | | | | vk_sampler_cache: Use VK_EXT_custom_border_color when available | ReinUsesLisp | 2020-05-05 | 3 | -2/+44 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This should fix grass interactions on Breath of the Wild on Vulkan. It is currently untested against validation layers. Nvidia's Windows 443.09 beta driver or Linux 440.66.12 is required for now. | ||||
* | | | | | | Merge pull request #3815 from FernandoS27/command-list-2 | bunnei | 2020-05-05 | 18 | -66/+198 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | GPU: More optimizations to GPU Command List Processing and DMA Copy Optimizations | ||||
| * | | | | | Update src/video_core/gpu.cpp | bunnei | 2020-05-05 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | Co-authored-by: David <25727384+ogniK5377@users.noreply.github.com> | ||||
| * | | | | | Update src/video_core/gpu.cpp | bunnei | 2020-05-05 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | Co-authored-by: David <25727384+ogniK5377@users.noreply.github.com> | ||||
| * | | | | | Clang Format and Documentation. | Fernando Sahmkow | 2020-04-28 | 10 | -10/+20 |
| | | | | | | |||||
| * | | | | | MaxwellDMA: Optimize micro copies. | Fernando Sahmkow | 2020-04-28 | 3 | -0/+57 |
| | | | | | | |||||
| * | | | | | VideoCore/GPU: Delegate subchannel engines to the dma pusher. | Fernando Sahmkow | 2020-04-28 | 3 | -4/+49 |
| | | | | | | |||||
| * | | | | | VideoCore/Engines: Refactor Engines CallMethod. | Fernando Sahmkow | 2020-04-28 | 13 | -62/+82 |
| | | | | | | |||||
* | | | | | | Merge pull request #3808 from ReinUsesLisp/wait-for-idle | bunnei | 2020-05-03 | 13 | -16/+57 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | {maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers | ||||
| * | | | | | | {maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers | ReinUsesLisp | 2020-04-28 | 13 | -16/+57 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU). | ||||
* | | | | | | Merge pull request #3732 from lioncash/header | bunnei | 2020-05-02 | 31 | -59/+3 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vulkan: Remove unnecessary includes | ||||
| * | | | | | | vulkan: Remove unnecessary includes | Lioncash | 2020-04-29 | 31 | -59/+3 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reduces some header churn and reduces rebuilds when some header internals change. While we're at it we can also resolve a missing include in buffer_cache. | ||||
* | | | | | | Merge pull request #3809 from ReinUsesLisp/empty-index | bunnei | 2020-05-02 | 1 | -0/+3 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | vk_rasterizer: Skip index buffer setup when vertices are zero | ||||
| * | | | | | vk_rasterizer: Skip index buffer setup when vertices are zero | ReinUsesLisp | 2020-04-28 | 1 | -0/+3 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Xenoblade 2 invokes a draw call with zero vertices. This is likely due to indirect drawing (glDrawArraysIndirect). This causes a crash in the staging buffer pool when trying to create a buffer with a size of zero. To workaround this, skip index buffer setup entirely when the number of indices is zero. | ||||
* | | | | | Merge pull request #3693 from ReinUsesLisp/clean-samplers | bunnei | 2020-05-02 | 13 | -340/+263 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/texture: Support multiple unknown sampler properties | ||||
| * | | | | | shader/texture: Support multiple unknown sampler properties | ReinUsesLisp | 2020-04-23 | 2 | -62/+87 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows deducing some properties from the texture instruction before asking the runtime. By doing this we can handle type mismatches in some instructions from the renderer instead of the shader decoder. Fixes texelFetch issues with games using 2D texture instructions on a 1D sampler. | ||||
| * | | | | | shader_ir: Turn classes into data structures | ReinUsesLisp | 2020-04-23 | 13 | -299/+197 |
| | | | | | | |||||
* | | | | | | fixed_pipeline_state: explicitly use template keyword after 1f345ebe3a55 | Jan Beich | 2020-05-02 | 1 | -2/+4 |
| |_|_|/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In file included from src/video_core/renderer_opengl/renderer_opengl.cpp:25: In file included from src/./video_core/renderer_opengl/gl_rasterizer.h:26: In file included from src/./video_core/renderer_opengl/gl_fence_manager.h:11: src/./video_core/fence_manager.h:91:32: error: use 'template' keyword to treat 'Write' as a dependent template name memory_manager.Write<u32>(current_fence->GetAddress(), current_fence->GetPayload()); ^ template src/./video_core/fence_manager.h:137:32: error: use 'template' keyword to treat 'Write' as a dependent template name memory_manager.Write<u32>(current_fence->GetAddress(), current_fence->GetPayload()); ^ template | ||||
* | | | | | Merge pull request #3807 from ReinUsesLisp/fix-depth-clamp | bunnei | 2020-04-30 | 5 | -8/+5 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_3d: Fix depth clamping register | ||||
| * | | | | | maxwell_3d: Fix depth clamping register | ReinUsesLisp | 2020-04-28 | 5 | -8/+5 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Using deko3d as reference: https://github.com/devkitPro/deko3d/blob/4e47ba0013552e592a86ab7a2510d1e7dadf236a/source/maxwell/gpu_3d_state.cpp#L42 We were using bits 3 and 4 to determine depth clamping, but these are the same both enabled and disabled: state->depthClampEnable ? 0x101A : 0x181D The same happens on Nvidia's OpenGL driver, where they do something like this (default capabilities, GL 4.5 compatibility): (state & DEPTH_CLAMP) != 0 ? 0x201a : 0x281c There's always a difference between the first bits in this register, but bit 11 is consistently disabled on both deko3d/NVN and OpenGL. This commit changes yuzu's behaviour to use bit 11 to determine depth clamping. - Fixes depth issues on Super Mario Odyssey's intro. | ||||
* | | | | | Merge pull request #3799 from ReinUsesLisp/iadd-cc | bunnei | 2020-04-30 | 6 | -27/+83 |
|\ \ \ \ \ | | | | | | | | | | | | | shader: Implement P2R CC, IADD Rd.CC and IADD.X | ||||
| * | | | | | shader/arithmetic_integer: Fix tracking issue in temporary | ReinUsesLisp | 2020-04-28 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This temporary is not needed as we mark Rd.CC + IADD.X as unimplemented. It caused issues when tracking global buffers. | ||||
| * | | | | | shader/arithmetic_integer: Fix edge case and mark IADD.X Rd.CC as unimplemented | ReinUsesLisp | 2020-04-26 | 1 | -1/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | IADD.X Rd.CC requires some extra logic that is not currently implemented. Abort when this is hit. | ||||
| * | | | | | shader/arithmetic_integer: Change IAdd to UAdd to avoid signed overflow | ReinUsesLisp | 2020-04-26 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Signed integer addition overflow might be undefined behavior. It's free to change operations to UAdd and use unsigned integers to avoid potential bugs. | ||||
| * | | | | | shader/arithmetic_integer: Implement IADD.X | ReinUsesLisp | 2020-04-26 | 2 | -0/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | IADD.X takes the carry flag and adds it to the result. This is generally used to emulate 64-bit operations with 32-bit registers. | ||||
| * | | | | | shader/arithmetic_integer: Implement CC for IADD | ReinUsesLisp | 2020-04-26 | 4 | -3/+42 |
| | | | | | | |||||
| * | | | | | decode/register_set_predicate: Implement CC | ReinUsesLisp | 2020-04-26 | 1 | -9/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | P2R CC takes the state of condition codes and puts them into a register. We already have this implemented for PR (predicates). This commit implements CC over that. | ||||
| * | | | | | decode/register_set_predicate: Use move for shared pointers | ReinUsesLisp | 2020-04-26 | 1 | -16/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | Avoid atomic counters used by shared pointers. | ||||
* | | | | | | Merge pull request #3805 from ReinUsesLisp/preserve-contents | bunnei | 2020-04-30 | 4 | -41/+81 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | texture_cache: Reintroduce preserve_contents accurately | ||||
| * | | | | | | texture_cache: Reintroduce preserve_contents accurately | ReinUsesLisp | 2020-04-27 | 4 | -41/+81 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This reverts commit 94b0e2e5dae4e0bd0021ac2d8fe1ff904a93ee69. preserve_contents proved to be a meaningful optimization. This commit reintroduces it but properly implemented on OpenGL. We have to make sure the clear removes all the previous contents of the image. It's not currently implemented on Vulkan because we can do smart things there that's preferred to be introduced in a separate commit. | ||||
* | | | | | | | Merge pull request #3788 from FernandoS27/revert | bunnei | 2020-04-30 | 1 | -14/+6 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | Revert: shader_decode: Fix LD, LDG when track constant buffer. | ||||
| * | | | | | | Revert: shader_decode: Fix LD, LDG when track constant buffer. | Fernando Sahmkow | 2020-04-24 | 1 | -14/+6 |
| | |_|_|/ / | |/| | | | | |||||
* | | | | | | Merge pull request #3784 from ReinUsesLisp/shader-memory-util | bunnei | 2020-04-28 | 9 | -159/+153 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | shader/memory_util: Deduplicate code | ||||
| * | | | | | shader/memory_util: Deduplicate code | ReinUsesLisp | 2020-04-26 | 9 | -159/+153 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader. | ||||
* | | | | | Merge pull request #3766 from ReinUsesLisp/renderpass-cache-key | Fernando Sahmkow | 2020-04-27 | 8 | -126/+129 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_renderpass_cache: Pack renderpass cache key and unify keys | ||||
| * | | | | | vk_rasterizer: Pack texceptions and color formats on invalid formats | ReinUsesLisp | 2020-04-25 | 2 | -5/+19 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Sometimes for unknown reasons NVN games can bind a render target format of 0. This may be a yuzu bug. With the commits before this the formats were specified without being "packed", assuming all formats and texceptions will be written like in the color_attachments vector. To address this issue, iterate all render targets and pack them as they are valid. This way they will match color_attachments. - Fixes validation errors and graphical issues on Breath of the Wild. | ||||
| * | | | | | vk_rasterizer: Fix framebuffer creation validation errors | ReinUsesLisp | 2020-04-23 | 1 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | Framebuffer creation was ignoring the number of color attachments. | ||||
| * | | | | | vk_pipeline_cache: Unify pipeline cache keys into a single operation | ReinUsesLisp | 2020-04-23 | 5 | -47/+59 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows us to call Common::CityHash and std::memcmp only once for GraphicsPipelineCacheKey. While we are at it, do the same for compute. | ||||
| * | | | | | vk_renderpass_cache: Pack renderpass cache key to 12 bytes | ReinUsesLisp | 2020-04-23 | 4 | -84/+59 |
| |/ / / / | |||||
* | | | | | Merge pull request #3756 from ReinUsesLisp/integrated-devices | Fernando Sahmkow | 2020-04-27 | 5 | -35/+6 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_memory_manager: Remove unified memory model flag | ||||
| * | | | | | vk_memory_manager: Remove unified memory model flag | ReinUsesLisp | 2020-04-22 | 5 | -35/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | All drivers (even Intel) seem to have a device local memory type that is not host visible. Remove this flag so all devices follow the same path. This fixes a crash when trying to map to host device local memory on integrated devices. | ||||
* | | | | | | Merge pull request #3742 from FernandoS27/command-list | bunnei | 2020-04-27 | 14 | -15/+198 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | Optimize GPU Command Lists and Introduce Fast GPU Time Option | ||||
| * | | | | | Clang Format. | Fernando Sahmkow | 2020-04-23 | 5 | -10/+17 |
| | | | | | | |||||
| * | | | | | GPU: Add Fast GPU Time Option. | Fernando Sahmkow | 2020-04-23 | 1 | -1/+5 |
| | | | | | | |||||
| * | | | | | Maxwell3D: Process Macros on MultiMethod. | Fernando Sahmkow | 2020-04-23 | 1 | -25/+47 |
| | | | | | | |||||
| * | | | | | DMAPusher: Propagate multimethod writes into the engines. | Fernando Sahmkow | 2020-04-23 | 14 | -14/+164 |
| | | | | | | |||||
* | | | | | | Merge pull request #3753 from ReinUsesLisp/ac-vulkan | Rodrigo Locatti | 2020-04-26 | 5 | -12/+53 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | {gl,vk}_rasterizer: Add lazy default buffer maker and use it for empty buffers | ||||
| * | | | | | vk_rasterizer: Add lazy default buffer maker and use it for empty buffers | ReinUsesLisp | 2020-04-22 | 3 | -4/+40 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Introduce a default buffer getter that lazily constructs an empty buffer. This is intended to match OpenGL's buffer 0. Use this for disabled vertex and uniform buffers. While we are at it, include vertex buffer usages for staging buffers to silence validation errors. | ||||
| * | | | | | gl_rasterizer: Fix buffers without size | ReinUsesLisp | 2020-04-22 | 3 | -8/+13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | On NVN buffers can be enabled but have no size. According to deko3d and the behavior we see in Animal Crossing: New Horizons these buffers get the special address of 0x1000 and limit themselves to 0xfff. Implement buffers without a size by binding a null buffer to OpenGL without a side. https://github.com/devkitPro/deko3d/blob/1d1930beea093b5a663419e93b0649719a3ca5da/source/maxwell/gpu_3d_vbo.cpp#L62-L63 | ||||
* | | | | | | Merge pull request #3721 from ReinUsesLisp/sort-devices | bunnei | 2020-04-25 | 1 | -1/+20 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | vulkan/wrapper: Sort physical devices | ||||
| * | | | | | | vulkan/wrapper: Sort physical devices | ReinUsesLisp | 2020-04-19 | 1 | -1/+20 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Sort discrete GPUs over the rest, Nvidia over AMD, AMD over Intel, Intel over the rest. This gives us a somewhat consistent order when Optimus is removed (renderdoc does this when it's attached). This can break the configuration of users with an Intel GPU that manually remove Optimus on yuzu. That said, it's a very unlikely to happen. | ||||
* | | | | | | | Merge pull request #3734 from ReinUsesLisp/half-float-mods | bunnei | 2020-04-25 | 2 | -16/+37 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits | ||||
| * | | | | | | | decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits | ReinUsesLisp | 2020-04-23 | 2 | -16/+37 |
| |/ / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The encoding for negation and absolute value was wrong. Extracting is now done manually. Similar instructions having different encodings is the rule, not the exception. To keep sanity and readability I preferred to extract the desired bit manually. This is implemented against nxas: https://github.com/ReinUsesLisp/nxas/blob/8dbc38995711cc12206aa370145a3a02665fd989/table.h#L68 That is itself tested against nvdisasm (Nvidia's official disassembler). | ||||
* | | | | | | | Merge pull request #3749 from ReinUsesLisp/lea-imm | bunnei | 2020-04-24 | 1 | -2/+2 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | shader/arithmetic_integer: Fix LEA_IMM encoding | ||||
| * | | | | | | | shader/arithmetic_integer: Fix LEA_IMM encoding | ReinUsesLisp | 2020-04-21 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The operand order in LEA_IMM was flipped compared to nvdisasm. Fix that using nxas as reference: https://github.com/ReinUsesLisp/nxas/blob/8dbc38995711cc12206aa370145a3a02665fd989/table.h#L122 | ||||
* | | | | | | | | Fix -Wdeprecated-copy warning. | Markus Wick | 2020-04-24 | 1 | -0/+1 |
| | | | | | | | | |||||
* | | | | | | | | Fix -Werror=conversion error. | Markus Wick | 2020-04-24 | 1 | -1/+1 |
| |_|_|_|_|/ / |/| | | | | | | |||||
* | | | | | | | kernel: memory: Improve implementation of device shared memory. (#3707) | bunnei | 2020-04-23 | 1 | -13/+5 |
| |_|_|/ / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * kernel: memory: Improve implementation of device shared memory. * fixup! kernel: memory: Improve implementation of device shared memory. * fixup! kernel: memory: Improve implementation of device shared memory. | ||||
* | | | | | | Merge pull request #3697 from lioncash/declarations | bunnei | 2020-04-23 | 4 | -6/+8 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | CMakeLists: Enable -Wmissing-declarations on Linux builds | ||||
| * | | | | | | General: Resolve warnings related to missing declarations | Lioncash | 2020-04-17 | 4 | -6/+8 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #3677 from FernandoS27/better-sync | bunnei | 2020-04-23 | 32 | -41/+1118 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | Introduce Predictive Flushing and Improve ASYNC GPU | ||||
| * | | | | | | | GL_Fence_Manager: use GL_TIMEOUT_IGNORED instead of a loop, | Fernando Sahmkow | 2020-04-23 | 1 | -2/+1 |
| | | | | | | | | |||||
| * | | | | | | | Address Feedback. | Fernando Sahmkow | 2020-04-22 | 3 | -24/+18 |
| | | | | | | | | |||||
| * | | | | | | | Async GPU: Correct flushing behavior to be similar to old async GPU behavior. | Fernando Sahmkow | 2020-04-22 | 3 | -0/+11 |
| | | | | | | | | |||||
| * | | | | | | | MaxwellDMA: Correct copying on accuracy level. | Fernando Sahmkow | 2020-04-22 | 1 | -2/+7 |
| | | | | | | | | |||||
| * | | | | | | | ShaderCache/PipelineCache: Cache null shaders. | Fernando Sahmkow | 2020-04-22 | 4 | -8/+31 |
| | | | | | | | | |||||
| * | | | | | | | Address Feedback. | Fernando Sahmkow | 2020-04-22 | 13 | -132/+117 |
| | | | | | | | | |||||
| * | | | | | | | Fix GCC error. | Fernando Sahmkow | 2020-04-22 | 2 | -6/+5 |
| | | | | | | | | |||||
| * | | | | | | | QueryCache: Only do async flushes on async gpu. | Fernando Sahmkow | 2020-04-22 | 1 | -1/+4 |
| | | | | | | | | |||||
| * | | | | | | | Async GPU: Only do reactive flushing on Extreme Level. | Fernando Sahmkow | 2020-04-22 | 1 | -1/+1 |
| | | | | | | | | |||||
| * | | | | | | | vk_fence_manager: Initial implementation | ReinUsesLisp | 2020-04-22 | 8 | -12/+222 |
| | | | | | | | | |||||
| * | | | | | | | QueryCache: Implement Async Flushes. | Fernando Sahmkow | 2020-04-22 | 5 | -12/+77 |
| | | | | | | | | |||||
| * | | | | | | | OpenGL: Guarantee writes to Buffers. | Fernando Sahmkow | 2020-04-22 | 3 | -4/+2 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Implement Flush Requests for Async mode. | Fernando Sahmkow | 2020-04-22 | 6 | -8/+70 |
| | | | | | | | | |||||
| * | | | | | | | FenceManager: Manage syncpoints and rename fences to semaphores. | Fernando Sahmkow | 2020-04-22 | 11 | -25/+123 |
| | | | | | | | | |||||
| * | | | | | | | BufferCache: Refactor async managing. | Fernando Sahmkow | 2020-04-22 | 2 | -10/+27 |
| | | | | | | | | |||||
| * | | | | | | | FenceManager: Implement async buffer cache flushes on High settings | Fernando Sahmkow | 2020-04-22 | 6 | -10/+69 |
| | | | | | | | | |||||
| * | | | | | | | Rasterizer: Document SignalFence & ReleaseFences and setup skeletons on Vulkan. | Fernando Sahmkow | 2020-04-22 | 5 | -4/+35 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Fix rebase errors. | Fernando Sahmkow | 2020-04-22 | 2 | -4/+4 |
| | | | | | | | | |||||
| * | | | | | | | Rasterizer: Disable fence managing in synchronous gpu. | Fernando Sahmkow | 2020-04-22 | 2 | -1/+11 |
| | | | | | | | | |||||
| * | | | | | | | ThreadManager: Sync async reads on accurate gpu. | Fernando Sahmkow | 2020-04-22 | 9 | -8/+48 |
| | | | | | | | | |||||
| * | | | | | | | FenceManager: Implement should wait. | Fernando Sahmkow | 2020-04-22 | 2 | -2/+17 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Implement a Fence Manager. | Fernando Sahmkow | 2020-04-22 | 6 | -23/+208 |
| | | | | | | | | |||||
| * | | | | | | | OpenGL: Implement Fencing backend. | Fernando Sahmkow | 2020-04-22 | 12 | -19/+94 |
| | | | | | | | | |||||
| * | | | | | | | TextureCache: Flush linear textures after finishing rendering. | Fernando Sahmkow | 2020-04-22 | 1 | -2/+8 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Delay Fences. | Fernando Sahmkow | 2020-04-22 | 6 | -2/+20 |
| | | | | | | | | |||||
| * | | | | | | | BufferCache: Implement OnCPUWrite and SyncGuestHost | Fernando Sahmkow | 2020-04-22 | 6 | -7/+67 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Refactor synchronization on Async GPU | Fernando Sahmkow | 2020-04-22 | 11 | -7/+56 |
| | | | | | | | | |||||
| * | | | | | | | Texture Cache: Implement OnCPUWrite and SyncGuestHost | Fernando Sahmkow | 2020-04-22 | 2 | -3/+63 |
| | | | | | | | | |||||
| * | | | | | | | UI: Replasce accurate GPU option for GPU Accuracy Level | Fernando Sahmkow | 2020-04-22 | 3 | -6/+6 |
| | |_|_|_|/ / | |/| | | | | | |||||
* | | | | | | | vk_pipeline_cache: Fix unintentional memcpy into optional | ReinUsesLisp | 2020-04-23 | 1 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The intention behind this was to assign a float to from an uint32_t, but it was unintentionally being copied directly into the std::optional. Copy to a temporary and assign that temporary to std::optional. This can be replaced with std::bit_cast<float> once we are in C++20. | ||||
* | | | | | | | Merge pull request #3653 from ReinUsesLisp/nsight-aftermath | Fernando Sahmkow | 2020-04-22 | 12 | -77/+389 |
|\ \ \ \ \ \ \ | |/ / / / / / |/| | | | | | | renderer_vulkan: Integrate Nvidia Nsight Aftermath on Windows | ||||
| * | | | | | | renderer_vulkan: Integrate Nvidia Nsight Aftermath on Windows | ReinUsesLisp | 2020-04-14 | 9 | -22/+360 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Adds optional support for Nsight Aftermath. It is enabled through ENABLE_NSIGHT_AFTERMATH in cmake. A path to the SDK has to be provided by the environment variable NSIGHT_AFTERMATH_SDK. Nsight Aftermath allows an application to generate "minidumps" of the GPU state when a device loss happens. By analysing these on Nsight we can know what a game was doing and why it triggered a device loss. The dump is generated inside %APPDATA%\yuzu\log\gpucrash and this directory is deleted every time a new instance is initialized with Nsight enabled. To enable it on yuzu there has a to be a driver and device capable of running Nsight Aftermath on Vulkan. That means only Turing based GPUs on the latest stable driver, beta drivers won't work for now. It is manually enabled in Configuration>Debug>Enable Graphics Debugging because when using all debugging capabilities there is a runtime cost. | ||||
| * | | | | | | renderer_vulkan: Remove Nvidia checkpoints | ReinUsesLisp | 2020-04-13 | 4 | -34/+0 |
| | | | | | | | |||||
| * | | | | | | renderer_vulkan: Catch device losses in more places | ReinUsesLisp | 2020-04-13 | 3 | -21/+29 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #3714 from lioncash/copies | bunnei | 2020-04-22 | 1 | -3/+3 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | gl_shader_decompiler: Avoid copies where applicable | ||||
| * | | | | | | gl_shader_decompiler: Avoid copies where applicable | Lioncash | 2020-04-18 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids unnecessary reference count increments where applicable and also avoids reallocating a vector. Unlikely to make a huge difference, but given how trivial of an amendment it is, why not? | ||||
* | | | | | | | Merge pull request #3718 from ReinUsesLisp/better-pipeline-state | Rodrigo Locatti | 2020-04-21 | 6 | -479/+531 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | fixed_pipeline_state: Pack structure, use memcmp and CityHash on it | ||||
| * | | | | | | | fixed_pipeline_state: Hash and compare the whole structure | ReinUsesLisp | 2020-04-19 | 2 | -105/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Pad FixedPipelineState's size to 384 bytes to be a multiple of 16. Compare the whole struct with std::memcmp and hash with CityHash. Using CityHash instead of a naive hash should reduce the number of collisions. Improve used type traits to ensure this operation is safe. With these changes the improvements to the hashable pipeline state are: Optimized structure Hash: 89 ns Comparison: 103 ns Construction*: 164 ns Struct size: 384 bytes Original structure Hash: 148 ns Equal: 174 ns Construction*: 281 ns Size: 1384 bytes * Attribute state initialization is not measured These measures are averages taken with std::chrono::high_accuracy_clock on MSVC shipped on Visual Studio 16.6.0 Preview 2.1. | ||||
| * | | | | | | | fixed_pipeline_state: Pack blending state | ReinUsesLisp | 2020-04-19 | 3 | -98/+227 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reduce FixedPipelineState's size to 364 bytes. | ||||
| * | | | | | | | fixed_pipeline_state: Pack rasterizer state | ReinUsesLisp | 2020-04-19 | 4 | -163/+155 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reduce FixedPipelineState's size to 600 bytes. | ||||
| * | | | | | | | fixed_pipeline_state: Pack depth stencil state | ReinUsesLisp | 2020-04-19 | 3 | -97/+140 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reduce FixedPipelineState's size to 632 bytes. | ||||
| * | | | | | | | fixed_pipeline_state: Pack attribute state | ReinUsesLisp | 2020-04-19 | 6 | -101/+85 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reduce FixedPipelineState's size from 1384 to 664 bytes | ||||
* | | | | | | | | Merge pull request #3698 from lioncash/warning | bunnei | 2020-04-21 | 2 | -12/+13 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | General: Resolve minor assorted warnings | ||||
| * | | | | | | | | decode/memory: Resolve unused variable warning | Lioncash | 2020-04-17 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Only the first element of the returned pair is ever used. | ||||
| * | | | | | | | | decode/texture: Resolve unused variable warnings. | Lioncash | 2020-04-17 | 1 | -5/+7 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some variables aren't used, so we can remove these. Unfortunately, diagnostics are still reported on structured bindings even when annotated with [[maybe_unused]], so we need to unpack the elements that we want to use manually. | ||||
| * | | | | | | | | decode/texture: Collapse loop down into std::generate | Lioncash | 2020-04-17 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Same behavior, less code. | ||||
| * | | | | | | | | decode/texture: Eliminate trivial missing field initializer warnings | Lioncash | 2020-04-17 | 1 | -3/+4 |
| | |_|_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | We can just specify the initializers. | ||||
* | | | | | | | | Merge pull request #3695 from ReinUsesLisp/default-attributes | bunnei | 2020-04-21 | 1 | -0/+4 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | maxwell_3d: Initialize format attributes constant as one | ||||
| * | | | | | | | | maxwell_3d: Initialize format attributes constant as one | ReinUsesLisp | 2020-04-17 | 1 | -0/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | nouveau expects this to be true but it doesn't set it. | ||||
* | | | | | | | | | Merge pull request #3733 from ambasta/patch-2 | Mat M | 2020-04-21 | 1 | -1/+1 |
|\ \ \ \ \ \ \ \ \ | |_|_|_|_|_|/ / / |/| | | | | | | | | Initialize quad_indexed_pass before uint8_pass | ||||
| * | | | | | | | | Initialize quad_indexed_pass before uint8_pass | Amit Prakash Ambasta | 2020-04-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Fixes Werror=reorder in gcc | ||||
* | | | | | | | | | Merge pull request #3700 from ReinUsesLisp/stream-buffer-sizes | Fernando Sahmkow | 2020-04-20 | 2 | -33/+48 |
|\ \ \ \ \ \ \ \ \ | |/ / / / / / / / |/| | | | | | | | | vk_stream_buffer: Fix out of memory on boot on recent Nvidia drivers | ||||
| * | | | | | | | | vk_stream_buffer: Fix out of memory on boot on recent Nvidia drivers | ReinUsesLisp | 2020-04-17 | 2 | -33/+48 |
| | |_|/ / / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Nvidia recently introduced a new memory type for data streaming (awesome!), but yuzu was assuming that all heaps had enough memory for the assumed stream buffer size (256 MiB). This worked fine on AMD but Nvidia's new memory heap was smaller than 256 MiB. This commit changes this assumption and allocates a bit less than the size of the preferred heap, with a maximum of 256 MiB (to avoid allocating all system memory on integrated devices). - Fixes a crash on NVIDIA 450.82.0.0 | ||||
* | | | | | | | | Merge pull request #3729 from lioncash/globals | Rodrigo Locatti | 2020-04-20 | 3 | -6/+11 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | dma_pusher: Remove reliance on the global system instance | ||||
| * | | | | | | | | dma_pusher: Remove reliance on the global system instance | Lioncash | 2020-04-19 | 3 | -6/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | With this, the video core is now has no calls to the global system instance at all. | ||||
* | | | | | | | | | Merge pull request #3694 from ReinUsesLisp/indexed-quads | bunnei | 2020-04-19 | 5 | -12/+280 |
|\ \ \ \ \ \ \ \ \ | |/ / / / / / / / |/| | | | | | | | | vk_compute_pass: Implement indexed quads | ||||
| * | | | | | | | | vk_compute_pass: Implement indexed quads | ReinUsesLisp | 2020-04-17 | 5 | -12/+280 |
| | |/ / / / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement indexed quads (GL_QUADS used with glDrawElements*) with a compute pass conversion. The compute shader converts from uint8/uint16/uint32 indices to uint32. The format is passed through push constants to avoid having different variants of the same shader. - Used by Fast RMX - Used by Xenoblade Chronicles 2 (it still has graphical due to synchronization issues on Vulkan) | ||||
* | | | | | | | | Merge pull request #3686 from lioncash/table | bunnei | 2020-04-19 | 1 | -3/+3 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | texture_cache/format_lookup_table: Fix incorrect green, blue, and alpha indices | ||||
| * | | | | | | | | texture_cache/format_lookup_table: Fix incorrect green, blue, and alpha indices | Lioncash | 2020-04-16 | 1 | -3/+3 |
| | |_|_|_|_|_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously these were all using the red component to derive the indices, which is definitely not intentional. | ||||
* | | | | | | | | Merge pull request #3679 from lioncash/track | bunnei | 2020-04-19 | 1 | -5/+6 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | track: Eliminate redundant copies | ||||
| * | | | | | | | | track: Eliminate redundant copies | Lioncash | 2020-04-16 | 1 | -5/+6 |
| |/ / / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Two variables can be references, while two others can be std::moved. Makes for 4 less atomic reference count increments and decrements. | ||||
* | | | | | | / | renderer_vulkan: assume X11 if not Windows/macOS after bf1d66b7c074 | Jan Beich | 2020-04-19 | 1 | -3/+3 |
| |_|_|_|_|_|/ |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Render.Vulkan <Error> video_core/renderer_vulkan/renderer_vulkan.cpp:CreateInstance:131: Presentation not supported on this platform Render.Vulkan <Error> video_core/renderer_vulkan/renderer_vulkan.cpp:CreateSurface:378: Presentation not supported on this platform Core <Critical> core/core.cpp:Load:199: Failed to initialize system (Error 5)! | ||||
* | | | | | | | Merge pull request #3716 from bunnei/fix-another-impl-fallthrough | Mat M | 2020-04-18 | 1 | -0/+1 |
|\ \ \ \ \ \ \ | |_|_|_|_|/ / |/| | | | | | | video_core: gl_shader_decompiler: Fix implicit fallthrough errors. | ||||
| * | | | | | | video_core: gl_shader_decompiler: Fix implicit fallthrough errors. | bunnei | 2020-04-18 | 1 | -0/+1 |
| | | | | | | | |||||
* | | | | | | | video_code: Fix implicit switch fallthrough. | Markus Wick | 2020-04-17 | 1 | -0/+2 |
| |_|_|/ / / |/| | | | | | | | | | | | | | | | | | | | | | | | Since yesterday, this breaks the build on linux. So let's fix it. | ||||
* | | | | | | Revert "gl_shader_cache: Use CompileDepth::FullDecompile on GLSL" | Rodrigo Locatti | 2020-04-17 | 1 | -3/+1 |
|/ / / / / | |||||
* | | | | | Merge pull request #3666 from bunnei/new-vmm | bunnei | 2020-04-17 | 4 | -75/+42 |
|\ \ \ \ \ | | | | | | | | | | | | | Implement a new virtual memory manager | ||||
| * | | | | | video_core: memory_manager: Updates for Common::PageTable changes. | bunnei | 2020-04-17 | 2 | -67/+34 |
| | | | | | | |||||
| * | | | | | core: memory: Move to Core::Memory namespace. | bunnei | 2020-04-17 | 2 | -8/+8 |
| | |_|/ / | |/| | | | | | | | | | | | | | - helpful to disambiguate Kernel::Memory namespace. | ||||
* | | | | | Merge pull request #3682 from lioncash/uam | bunnei | 2020-04-17 | 1 | -1/+1 |
|\ \ \ \ \ | |/ / / / |/| | | | | gl_query_cache: Resolve use-after-move in CachedQuery move assignment operator | ||||
| * | | | | gl_query_cache: Resolve use-after-move in CachedQuery move assignment operator | Lioncash | 2020-04-16 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | Avoids potential invalid junk data from being read. | ||||
* | | | | Merge pull request #3673 from lioncash/extra | bunnei | 2020-04-17 | 7 | -14/+20 |
|\ \ \ \ | |_|/ / |/| | | | CMakeLists: Specify -Wextra on linux builds | ||||
| * | | | CMakeLists: Specify -Wextra on linux builds | Lioncash | 2020-04-16 | 7 | -14/+20 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Allows reporting more cases where logic errors may exist, such as implicit fallthrough cases, etc. We currently ignore unused parameters, since we currently have many cases where this is intentional (virtual interfaces). While we're at it, we can also tidy up any existing code that causes warnings. This also uncovered a few bugs as well. | ||||
* | | | Merge pull request #3600 from ReinUsesLisp/no-pointer-buf-cache | Fernando Sahmkow | 2020-04-17 | 14 | -228/+90 |
|\ \ \ | | | | | | | | | buffer_cache: Return handles instead of pointer to handles | ||||
| * | | | buffer_cache: Return handles instead of pointer to handles | ReinUsesLisp | 2020-04-16 | 14 | -228/+90 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | The original idea of returning pointers is that handles can be moved. The problem is that the implementation didn't take that in mind and made everything harder to work with. This commit drops pointer to handles and returns the handles themselves. While it is still true that handles can be invalidated, this way we get an old handle instead of a dangling pointer. This problem can be solved in the future with sparse buffers. | ||||
* | | | Merge pull request #3689 from lioncash/unused-var | Rodrigo Locatti | 2020-04-16 | 1 | -1/+0 |
|\ \ \ | | | | | | | | | decode/shift: Remove unused variable within Shift() | ||||
| * | | | decode/shift: Remove unused variable within Shift() | Lioncash | 2020-04-16 | 1 | -1/+0 |
| | |/ | |/| | | | | | | | | | | Removes a redundant variable that is already satisfied by the IsFull() utility function. | ||||
* | | | Merge pull request #3688 from lioncash/nequal | Rodrigo Locatti | 2020-04-16 | 2 | -0/+5 |
|\ \ \ | | | | | | | | | surface_view: Add missing operator!= to ViewParams | ||||
| * | | | surface_view: Add missing operator!= to ViewParams | Lioncash | 2020-04-16 | 2 | -0/+5 |
| |/ / | | | | | | | | | | Provides logical symmetry to the interface. | ||||
* | | | Merge pull request #3680 from lioncash/static | Rodrigo Locatti | 2020-04-16 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | gl_device: Mark stage_swizzle as constexpr | ||||
| * | | | gl_device: Mark stage_swizzle as constexpr | Lioncash | 2020-04-16 | 1 | -1/+1 |
| |/ / | | | | | | | | | | Previously this was mutable even though it shouldn't be. | ||||
* | | | Merge pull request #3687 from lioncash/constness | Rodrigo Locatti | 2020-04-16 | 1 | -2/+2 |
|\ \ \ | | | | | | | | | surface_base: Make IsInside() a const member function | ||||
| * | | | surface_base: Make IsInside() a const member function | Lioncash | 2020-04-16 | 1 | -2/+2 |
| |/ / | | | | | | | | | | This doesn't modify internal state, so this can be made const. | ||||
* | | | Merge pull request #3685 from lioncash/copies | Rodrigo Locatti | 2020-04-16 | 1 | -3/+3 |
|\ \ \ | | | | | | | | | control_flow: Make use of std::move in TryInspectAddress() | ||||
| * | | | control_flow: Make use of std::move in TryInspectAddress() | Lioncash | 2020-04-16 | 1 | -3/+3 |
| |/ / | | | | | | | | | | Eliminates redundant atomic reference count increments and decrements. | ||||
* | | | Merge pull request #3683 from lioncash/docs | bunnei | 2020-04-16 | 2 | -5/+5 |
|\ \ \ | | | | | | | | | video_core: Amend doxygen comment references | ||||
| * | | | video_core: Amend doxygen comment references | Lioncash | 2020-04-16 | 2 | -5/+5 |
| |/ / | | | | | | | | | | Fixes broken documentation references. | ||||
* | | | Merge pull request #3649 from FernandoS27/3d-fix | Rodrigo Locatti | 2020-04-15 | 1 | -0/+6 |
|\ \ \ | |/ / |/| | | Texture Cache: Read current data when flushing a 3D segment. | ||||
| * | | Texture Cache: Read current data when flushing a 3D segment. | Fernando Sahmkow | 2020-04-15 | 1 | -0/+6 |
| |/ | | | | | | | | | This PR corrects flushing of 3D segments when data of other segments is mixed, this aims to preserve the data in place. | ||||
* | | Merge pull request #3612 from ReinUsesLisp/red | Fernando Sahmkow | 2020-04-15 | 5 | -70/+141 |
|\ \ | | | | | | | shader/memory: Implement RED.E.ADD and minor changes to ATOM | ||||
| * | | shader/memory: Implement RED.E.ADD | ReinUsesLisp | 2020-04-06 | 5 | -28/+99 |
| | | | | | | | | | | | | | | | | | | | | | | | | Implements a reduction operation. It's an atomic operation that doesn't return a value. This commit introduces another primitive because some shading languages might have a primitive for reduction operations. | ||||
| * | | shader/memory: Add "using std::move" | ReinUsesLisp | 2020-04-06 | 1 | -11/+13 |
| | | | |||||
| * | | shader/memory: Minor fixes in ATOM | ReinUsesLisp | 2020-04-06 | 1 | -32/+30 |
| | | | |||||
* | | | CMakeLists: Make -Wreorder a compile-time error | Lioncash | 2020-04-15 | 2 | -4/+5 |
| | | | | | | | | | | | | | | | | | | This can result in silent logic bugs within code, and given the amount of times these kind of warnings are caused, they should be flagged at compile-time so no new code is submitted with them. | ||||
* | | | Merge pull request #3662 from ReinUsesLisp/constant-attrs | Mat M | 2020-04-15 | 2 | -2/+6 |
|\ \ \ | | | | | | | | | gl_rasterizer: Implement constant vertex attributes | ||||
| * | | | gl_rasterizer: Implement constant vertex attributes | ReinUsesLisp | 2020-04-14 | 2 | -2/+6 |
| | |/ | |/| | | | | | | | | | | Credits go to gdkchan from Ryujinx for finding constant attributes are used in retail games. | ||||
* | | | Merge pull request #3668 from ReinUsesLisp/vtx-format-16ui | Mat M | 2020-04-15 | 1 | -0/+10 |
|\ \ \ | | | | | | | | | maxwell_to_vk: Add uint16 vertex formats | ||||
| * | | | maxwell_to_vk: Add uint16 vertex formats | ReinUsesLisp | 2020-04-15 | 1 | -0/+8 |
| | | | | |||||
| * | | | maxwell_to_vk: Add missing breaks | ReinUsesLisp | 2020-04-15 | 1 | -0/+2 |
| | | | | | | | | | | | | | | | | Avoid invalid fallbacks. | ||||
* | | | | Merge pull request #3656 from ReinUsesLisp/glsl-full-decompile | Mat M | 2020-04-15 | 1 | -1/+3 |
|\ \ \ \ | | | | | | | | | | | gl_shader_cache: Use CompileDepth::FullDecompile on GLSL | ||||
| * | | | | gl_shader_cache: Use CompileDepth::FullDecompile on GLSL | ReinUsesLisp | 2020-04-14 | 1 | -1/+3 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | From my testing on a Splatoon 2 shader that takes 3800ms on average to compile changing to FullDecompile reduces it to 900ms on average. The shader decoder will automatically fallback to a more naive method if it can't use full decompile. | ||||
* | | | | Merge pull request #3654 from ReinUsesLisp/fix-fb-attach | Mat M | 2020-04-15 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | gl_texture_cache: Fix layered texture attachment base level | ||||
| * | | | | gl_texture_cache: Fix layered texture attachment base level | ReinUsesLisp | 2020-04-13 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | The base level is already included in the texture view. If we specify the base level in the texture again, this will end up in the incorrect level and potentially out of bounds. | ||||
* | | | | Merge pull request #3663 from ReinUsesLisp/fcmp-rc | Mat M | 2020-04-15 | 2 | -3/+6 |
|\ \ \ \ | | | | | | | | | | | shader/arithmetic: Add FCMP_CR variant | ||||
| * | | | | shader/arithmetic: Add FCMP_CR variant | ReinUsesLisp | 2020-04-15 | 2 | -3/+6 |
| |/ / / | | | | | | | | | | | | | Adds another variant of FCMP. | ||||
* | | | | Merge pull request #3664 from ReinUsesLisp/fe3h-black-squares | Mat M | 2020-04-15 | 1 | -2/+4 |
|\ \ \ \ | | | | | | | | | | | Revert "gl_shader_decompiler: Implement merges with bitfieldInsert" | ||||
| * | | | | Revert "gl_shader_decompiler: Implement merges with bitfieldInsert" | ReinUsesLisp | 2020-04-15 | 1 | -2/+4 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This reverts commit 05cf27083608bebd3ee4c38f2f948c8f2030f881. Apparently the first approach using floats instead of bitfieldInert worked better for Fire Emblem: Three Houses. Reverting to get that behavior back. | ||||
* | / / | vk_blit_screen: Initialize all members in VkPipelineViewportStateCreateInfo | ReinUsesLisp | 2020-04-15 | 1 | -0/+2 |
| |/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When the dynamic state is specified, pViewports and pScissors are ignored, quoting the specification: pViewports is a pointer to an array of VkViewport structures, defining the viewport transforms. If the viewport state is dynamic, this member is ignored. That said, AMD's proprietary driver itself seem to read it regardless of what the specification says. | ||||
* | | | Merge pull request #3657 from ReinUsesLisp/viewport-zero | Rodrigo Locatti | 2020-04-15 | 1 | -3/+6 |
|\ \ \ | | | | | | | | | vk_rasterizer: Default to 1 viewports with a size of 0 | ||||
| * | | | vk_rasterizer: Default to 1 viewports with a size of 0 | ReinUsesLisp | 2020-04-14 | 1 | -3/+6 |
| |/ / | | | | | | | | | | Silence validation layer errors. | ||||
* / / | Texture Cache: Only do buffer copies on accurate GPU. (#3634) | Fernando Sahmkow | 2020-04-15 | 1 | -1/+3 |
|/ / | | | | | This is a simple optimization as Buffer Copies are mostly used for texture recycling. They are, however, useful when games abuse undefined behavior but most 3D APIs forbid it. | ||||
* | | Merge pull request #3636 from ReinUsesLisp/drop-vk-hpp | Rodrigo Locatti | 2020-04-13 | 51 | -2272/+2881 |
|\ \ | | | | | | | renderer_vulkan: Drop Vulkan-Hpp | ||||
| * | | renderer_vulkan: Drop Vulkan-Hpp | ReinUsesLisp | 2020-04-11 | 51 | -2272/+2881 |
| | | | |||||
* | | | Merge pull request #3651 from ReinUsesLisp/line-widths | Mat M | 2020-04-13 | 5 | -2/+33 |
|\ \ \ | | | | | | | | | gl_rasterizer: Implement line widths and smooth lines | ||||
| * | | | gl_rasterizer: Implement line widths and smooth lines | ReinUsesLisp | 2020-04-13 | 5 | -2/+33 |
| | | | | | | | | | | | | | | | | | | | | Implements "legacy" features from OpenGL present on hardware such as smooth lines and line width. | ||||
* | | | | Merge pull request #3638 from ReinUsesLisp/remove-preserve-contents | Mat M | 2020-04-13 | 3 | -47/+31 |
|\ \ \ \ | | | | | | | | | | | texture_cache: Remove preserve_contents | ||||
| * | | | | texture_cache: Remove preserve_contents | ReinUsesLisp | 2020-04-11 | 3 | -47/+31 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | preserve_contents was always true. We can't assume we don't have to preserve clears because scissored and color masked clears exist. This removes preserve_contents and assumes it as true at all times. | ||||
* | | | | Merge pull request #3631 from ReinUsesLisp/more-astc | Mat M | 2020-04-13 | 1 | -82/+159 |
|\ \ \ \ | | | | | | | | | | | texture/astc: More small ASTC optimizations | ||||
| * | | | | astc: Hard code bit depth changes to 8 and use fast replicate | ReinUsesLisp | 2020-04-09 | 1 | -21/+15 |
| | | | | | |||||
| * | | | | astc: Use boost's static_vector to avoid heap allocations | ReinUsesLisp | 2020-04-09 | 1 | -10/+14 |
| | | | | | |||||
| * | | | | astc: Implement a fast precompiled alternative for Replicate | ReinUsesLisp | 2020-04-09 | 1 | -2/+57 |
| | | | | | |||||
| * | | | | astc: Move Replicate to a constexpr LUT when possible | ReinUsesLisp | 2020-04-09 | 1 | -8/+38 |
| | | | | | |||||
| * | | | | astc: Make InputBitStream constexpr | ReinUsesLisp | 2020-04-09 | 1 | -11/+11 |
| | | | | | |||||
| * | | | | astc: OutputBitStream style changes and make it constexpr | ReinUsesLisp | 2020-04-09 | 1 | -32/+26 |
| | | | | | |||||
* | | | | | Merge pull request #3619 from ReinUsesLisp/i2i | Mat M | 2020-04-13 | 1 | -13/+100 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/conversion: Implement I2I sign extension, saturation and selection | ||||
| * | | | | | shader/conversion: Implement I2I sign extension, saturation and selection | ReinUsesLisp | 2020-04-07 | 2 | -14/+101 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplements I2I adding sign extension, saturation (clamp source value to the destination), selection and destination sizes that are not 32 bits wide. It doesn't implement CC yet. | ||||
* | | | | | | Merge pull request #3627 from ReinUsesLisp/layered-view | Mat M | 2020-04-13 | 1 | -2/+2 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_texture_cache: Attach view instead of base texture for layered attchments | ||||
| * | | | | | | gl_texture_cache: Attach view instead of base texture for layered attachments | ReinUsesLisp | 2020-04-09 | 1 | -2/+2 |
| | |/ / / / | |/| | | | | | | | | | | | | | | | | This way we are not ignoring the base layer of the current texture. | ||||
* | | | | | | Merge pull request #3646 from ReinUsesLisp/fix-glsl-turing | Mat M | 2020-04-13 | 1 | -6/+6 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_decompiler: Improve generated code in HMergeH* | ||||
| * | | | | | | gl_shader_decompiler: Implement merges with bitfieldInsert | ReinUsesLisp | 2020-04-13 | 1 | -4/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This also fixes Turing issues but it avoids doing more bitcasts. This should improve the generated code while also avoiding more points where compilers can flush floats. | ||||
| * | | | | | | gl_shader_decompiler: Improve generated code in HMergeH* | ReinUsesLisp | 2020-04-12 | 1 | -6/+8 |
| | |_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | Avoiding bitwise expressions, this fixes Turing issues in shaders using half float merges that affected several games. | ||||
* | | | | | | Merge pull request #3633 from ReinUsesLisp/clean-texdec | Mat M | 2020-04-13 | 1 | -14/+0 |
|\ \ \ \ \ \ | |_|_|_|/ / |/| | | | | | shader/texture: Remove type mismatches management from shader decoder | ||||
| * | | | | | shader/texture: Remove type mismatches management from shader decoder | ReinUsesLisp | 2020-04-10 | 1 | -14/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Since commit e22816a5bb we handle type mismatches from the CPU. We don't need to hack our shader decoder due to game bugs anymore. Removed in this commit. | ||||
* | | | | | | Merge pull request #3578 from ReinUsesLisp/vmnmx | Fernando Sahmkow | 2020-04-12 | 3 | -1/+117 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader/video: Partially implement VMNMX | ||||
| * | | | | | | shader/video: Partially implement VMNMX | ReinUsesLisp | 2020-04-12 | 3 | -0/+116 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implements the common usages for VMNMX. Inputs with a different size than 32 bits are not supported and sign mismatches aren't supported either. VMNMX works as follows: It grabs Ra and Rb and applies a maximum/minimum on them (this is defined by .MX), having in mind the input sign. This result can then be saturated. After the intermediate result is calculated, it applies another operation on it using Rc. These operations are merges, accumulations or another min/max pass. This instruction allows to implement with a more flexible approach GCN's min3 and max3 instructions (for instance). | ||||
| * | | | | | | shader_bytecode: Fix I2I_IMM encoding | ReinUsesLisp | 2020-03-28 | 1 | -1/+1 |
| | | | | | | | |||||
* | | | | | | | video_core: Add MSAA registers in 3D engine and TIC | ReinUsesLisp | 2020-04-12 | 2 | -6/+76 |
| |_|/ / / / |/| | | | | | | | | | | | | | | | | | | | | | | | This adds the registers used for multisampling. It doesn't implement anything for now. | ||||
* | | | | | | Merge pull request #3594 from ReinUsesLisp/vk-instance | bunnei | 2020-04-11 | 5 | -96/+302 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | yuzu: Drop SDL2 and Qt frontend Vulkan requirements | ||||
| * | | | | | yuzu: Drop SDL2 and Qt frontend Vulkan requirements | ReinUsesLisp | 2020-04-07 | 5 | -105/+238 |
| | | | | | | | | | | | | | | | | | | | | | | | | Create Vulkan instances and surfaces from the Vulkan backend. | ||||
| * | | | | | renderer_vulkan: Query device names from the backend | ReinUsesLisp | 2020-04-07 | 3 | -0/+73 |
| | |_|/ / | |/| | | | |||||
* | | | | | Merge pull request #3622 from ReinUsesLisp/srgb-texture-border | Fernando Sahmkow | 2020-04-10 | 3 | -28/+84 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/texture: Use a LUT to convert sRGB texture borders | ||||
| * | | | | | video_core/textures: Move GetMaxAnisotropy to cpp file | ReinUsesLisp | 2020-04-08 | 2 | -19/+23 |
| | | | | | | |||||
| * | | | | | video_core/texture: Use a LUT to convert sRGB texture borders | ReinUsesLisp | 2020-04-08 | 3 | -9/+61 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is a reversed look up table extracted from https://gist.github.com/rygorous/2203834#file-gistfile1-cpp-L41-L62 that is used in https://github.com/devkitPro/deko3d/blob/04d4e9e587fa3dc5447b43d273bc45f440226e41/source/maxwell/tsc_generate.cpp#L38 Games usually bind 0xFD expecting a float texture border of 1.0f. The conversion previous to this commit was multiplying the uint8 sRGB texture border color by 255. This is close to 1.0f but when that difference matters, some graphical glitches appear. This look up table is manually changed in the edges, clamping towards 0.0f and 1.0f. While we are at it, move this logic to its own translation unit. | ||||
* | | | | | Merge pull request #3610 from FernandoS27/gpu-caches | Rodrigo Locatti | 2020-04-09 | 30 | -424/+374 |
|\ \ \ \ \ | | | | | | | | | | | | | Refactor all the GPU Caches to use VAddr for cache addressing | ||||
| * | | | | | VkRasterizer: Eliminate Legacy code. | Fernando Sahmkow | 2020-04-09 | 1 | -1/+0 |
| | | | | | | |||||
| * | | | | | Memory: Correct GCC errors. | Fernando Sahmkow | 2020-04-09 | 2 | -2/+3 |
| | | | | | | |||||
| * | | | | | Memory: Address Feedback. | Fernando Sahmkow | 2020-04-08 | 3 | -4/+7 |
| | | | | | | |||||
| * | | | | | GPUMemoryManager: Improve safety of memory reads. | Fernando Sahmkow | 2020-04-08 | 3 | -55/+47 |
| | | | | | | |||||
| * | | | | | Clang Format. | Fernando Sahmkow | 2020-04-06 | 1 | -6/+3 |
| | | | | | | |||||
| * | | | | | Shader/Pipeline Cache: Use VAddr instead of physical memory for addressing. | Fernando Sahmkow | 2020-04-06 | 7 | -87/+62 |
| | | | | | | |||||
| * | | | | | Query Cache: Use VAddr instead of physical memory for adressing. | Fernando Sahmkow | 2020-04-06 | 3 | -23/+22 |
| | | | | | | |||||
| * | | | | | Buffer Cache: Use vAddr instead of physical memory. | Fernando Sahmkow | 2020-04-06 | 10 | -106/+129 |
| | | | | | | |||||
| * | | | | | Texture Cache: Use vAddr instead of physical memory for caching. | Fernando Sahmkow | 2020-04-06 | 5 | -130/+81 |
| | | | | | | |||||
| * | | | | | GPU: Setup Flush/Invalidate to use VAddr instead of CacheAddr | Fernando Sahmkow | 2020-04-06 | 13 | -61/+71 |
| |/ / / / | |||||
* | | | | | Merge pull request #3601 from ReinUsesLisp/some-shader-encodings | bunnei | 2020-04-09 | 3 | -5/+18 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | video_core/shader: Add some instruction and S2R encodings | ||||
| * | | | | shader/other: Add error message for some S2R registers | ReinUsesLisp | 2020-04-04 | 1 | -0/+6 |
| | | | | | |||||
| * | | | | shader_bytecode: Rename MOV_SYS to S2R | ReinUsesLisp | 2020-04-04 | 2 | -5/+5 |
| | | | | | |||||
| * | | | | shader_bytecode: Add encoding for BAR | ReinUsesLisp | 2020-04-04 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | shader_ir: Add error message for EXIT.FCSM_TR | ReinUsesLisp | 2020-04-04 | 1 | -0/+3 |
| | | | | | |||||
| * | | | | shader_bytecode: Add encoding for VOTE.VTG | ReinUsesLisp | 2020-04-04 | 1 | -0/+2 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3599 from ReinUsesLisp/revert-3499 | bunnei | 2020-04-07 | 1 | -4/+2 |
|\ \ \ \ | | | | | | | | | | | Revert "Merge pull request #3499 from ReinUsesLisp/depth-2d-array" | ||||
| * | | | | Revert "Merge pull request #3499 from ReinUsesLisp/depth-2d-array" | ReinUsesLisp | 2020-04-04 | 1 | -4/+2 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | This reverts commit 41905ee467b24172ba93e3fcd665bb4e4806a45a, reversing changes made to 35145bd529c3517e2c366efc764a762092d96edf. It causes regressions in several games. | ||||
* | | | | Merge pull request #3489 from namkazt/patch-2 | Rodrigo Locatti | 2020-04-07 | 2 | -11/+353 |
|\ \ \ \ | |_|/ / |/| | | | shader: implement SULD.D bits32/64 | ||||
| * | | | address nit. | Nguyen Dac Nam | 2020-04-07 | 1 | -1/+1 |
| | | | | |||||
| * | | | Apply suggestions from code review | Nguyen Dac Nam | 2020-04-07 | 1 | -9/+9 |
| | | | | | | | | | | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
| * | | | shader_decode: SULD.D using std::pair instead of out parameter | namkazy | 2020-04-06 | 2 | -19/+15 |
| | | | | |||||
| * | | | shader_decode: SULD.D avoid duplicate code block. | namkazy | 2020-04-06 | 1 | -39/+2 |
| | | | | |||||
| * | | | shader_decode: SULD.D fix conversion error. | namkazy | 2020-04-06 | 1 | -3/+3 |
| | | | | |||||
| * | | | shader_decode: SULD.D implement bits64 and reverse shader ir init method to removed shader stage. | namkazy | 2020-04-06 | 5 | -46/+105 |
| | | | | |||||
| * | | | silent warning (conversion error) | namkazy | 2020-04-05 | 1 | -3/+2 |
| | | | | |||||
| * | | | shader_decode: SULD.D -> SINT actually same as UNORM. | namkazy | 2020-04-05 | 1 | -5/+4 |
| | | | | |||||
| * | | | shader_decode: SULD.D fix decode SNORM component | namkazy | 2020-04-05 | 1 | -10/+9 |
| | | | | |||||
| * | | | clang-format | namkazy | 2020-04-05 | 1 | -2/+2 |
| | | | | |||||
| * | | | shader_decode: get sampler descriptor from registry. | namkazy | 2020-04-05 | 1 | -77/+93 |
| | | | | |||||
| * | | | tweaking. | namkazy | 2020-04-05 | 1 | -3/+3 |
| | | | | |||||
| * | | | clang-format | Nguyen Dac Nam | 2020-04-05 | 1 | -2/+1 |
| | | | | |||||
| * | | | cleanup unuse params | namkazy | 2020-04-05 | 1 | -8/+6 |
| | | | | |||||
| * | | | cleanup debug code. | namkazy | 2020-04-05 | 1 | -14/+3 |
| | | | | |||||
| * | | | reimplement get component type, uncomment mistaken code | namkazy | 2020-04-05 | 1 | -18/+93 |
| | | | | |||||
| * | | | remove disable optimize | namkazy | 2020-04-05 | 1 | -2/+0 |
| | | | | |||||
| * | | | [wip] reimplement SULD.D | namkazy | 2020-04-05 | 1 | -22/+229 |
| | | | | |||||
| * | | | add shader stage when init shader ir | namkazy | 2020-04-05 | 4 | -9/+12 |
| | | | | |||||
| * | | | clang-fix | Nguyen Dac Nam | 2020-04-05 | 1 | -1/+1 |
| | | | | |||||
| * | | | shader: image - import PredCondition | Nguyen Dac Nam | 2020-04-05 | 1 | -0/+1 |
| | | | | |||||
| * | | | shader: SULD.D bits32 implement more complexer method. | Nguyen Dac Nam | 2020-04-05 | 1 | -4/+28 |
| | | | | |||||
| * | | | shader: SULD.D import StoreType | Nguyen Dac Nam | 2020-04-05 | 1 | -0/+1 |
| | | | | |||||
| * | | | shader: implement SULD.D bits32 | Nguyen Dac Nam | 2020-04-05 | 1 | -11/+27 |
| |/ / | |||||
* | | | Merge pull request #3513 from ReinUsesLisp/native-astc | Fernando Sahmkow | 2020-04-06 | 12 | -301/+247 |
|\ \ \ | | | | | | | | | video_core: Use native ASTC when available | ||||
| * | | | gl_texture_cache: Fix software ASTC fallback | ReinUsesLisp | 2020-04-01 | 1 | -7/+12 |
| | | | | |||||
| * | | | vk_device: Add missing ASTC queries | ReinUsesLisp | 2020-04-01 | 1 | -14/+29 |
| | | | | |||||
| * | | | video_core: Use native ASTC when available | ReinUsesLisp | 2020-04-01 | 10 | -281/+176 |
| | | | | |||||
| * | | | gl_device: Detect if ASTC is reported and expose it | ReinUsesLisp | 2020-04-01 | 2 | -0/+31 |
| | | | | |||||
* | | | | Merge pull request #3592 from ReinUsesLisp/ipa | Fernando Sahmkow | 2020-04-06 | 4 | -68/+74 |
|\ \ \ \ | | | | | | | | | | | shader_decompiler: Remove FragCoord.w hack and change IPA implementation | ||||
| * | | | | shader_decompiler: Remove FragCoord.w hack and change IPA implementation | ReinUsesLisp | 2020-04-02 | 4 | -68/+74 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Credits go to gdkchan and Ryujinx. The pull request used for this can be found here: https://github.com/Ryujinx/Ryujinx/pull/1082 yuzu was already using the header for interpolation, but it was missing the FragCoord.w multiplication described in the linked pull request. This commit finally removes the FragCoord.w == 1.0f hack from the shader decompiler. While we are at it, this commit renames some enumerations to match Nvidia's documentation (linked below) and fixes component declaration order in the shader program header (z and w were swapped). https://github.com/NVIDIA/open-gpu-doc/blob/master/Shader-Program-Header/Shader-Program-Header.html | ||||
* | | | | Merge pull request #3589 from ReinUsesLisp/fix-clears | Fernando Sahmkow | 2020-04-06 | 1 | -2/+5 |
|\ \ \ \ | |_|/ / |/| | | | gl_rasterizer: Mark cleared textures as dirty | ||||
| * | | | gl_rasterizer: Mark cleared textures as dirty | ReinUsesLisp | 2020-03-31 | 1 | -2/+5 |
| | | | | | | | | | | | | | | | | | | | | Fixes a potential edge case where cleared textures read from the CPU were not flushed. | ||||
* | | | | shader/memory: Silence no return value warning | ReinUsesLisp | 2020-04-02 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | Silences a warning about control paths not all returning a value. | ||||
* | | | | Merge pull request #3552 from jroweboy/single-context | Rodrigo Locatti | 2020-04-02 | 16 | -93/+127 |
|\ \ \ \ | |_|/ / |/| | | | Refactor Context management (Fixes renderdoc on opengl issues) | ||||
| * | | | Address review and fix broken yuzu-tester build | James Rowe | 2020-03-26 | 3 | -5/+7 |
| | | | | |||||
| * | | | Frontend/GPU: Refactor context management | James Rowe | 2020-03-25 | 16 | -97/+129 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Changes the GraphicsContext to be managed by the GPU core. This eliminates the need for the frontends to fool around with tricky MakeCurrent/DoneCurrent calls that are dependent on the settings (such as async gpu option). This also refactors out the need to use QWidget::fromWindowContainer as that caused issues with focus and input handling. Now we use a regular QWidget and just access the native windowHandle() directly. Another change is removing the debug tool setting in FrameMailbox. Instead of trying to block the frontend until a new frame is ready, the core will now take over presentation and draw directly to the window if the renderer detects that its hooked by NSight or RenderDoc Lastly, since it was in the way, I removed ScopeAcquireWindowContext and replaced it with a simple subclass in GraphicsContext that achieves the same result | ||||
* | | | | Merge pull request #3591 from ReinUsesLisp/vk-wrapper-part2 | Rodrigo Locatti | 2020-04-01 | 2 | -0/+850 |
|\ \ \ \ | | | | | | | | | | | renderer_vulkan/wrapper: Add a Vulkan wrapper (part 2 of 2) | ||||
| * | | | | renderer_vulkan/wrapper: Add vkEnumerateInstanceExtensionProperties wrapper | ReinUsesLisp | 2020-04-01 | 2 | -0/+17 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add command buffer handle | ReinUsesLisp | 2020-04-01 | 1 | -0/+192 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add physical device handle | ReinUsesLisp | 2020-04-01 | 2 | -0/+123 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add device handle | ReinUsesLisp | 2020-04-01 | 2 | -0/+277 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add swapchain handle | ReinUsesLisp | 2020-04-01 | 2 | -0/+15 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add fence handle | ReinUsesLisp | 2020-04-01 | 1 | -0/+17 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add device memory handle | ReinUsesLisp | 2020-04-01 | 1 | -0/+15 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add pool handles | ReinUsesLisp | 2020-04-01 | 2 | -0/+47 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add buffer and image handles | ReinUsesLisp | 2020-04-01 | 2 | -0/+24 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add queue handle | ReinUsesLisp | 2020-04-01 | 2 | -0/+36 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add instance handle | ReinUsesLisp | 2020-04-01 | 2 | -0/+87 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #3561 from ReinUsesLisp/f2f-conversion | Fernando Sahmkow | 2020-03-31 | 1 | -5/+10 |
|\ \ \ \ | | | | | | | | | | | shader/conversion: Fix F2F rounding operations with different sizes | ||||
| * | | | | shader/conversion: Fix F2F rounding operations with different sizes | ReinUsesLisp | 2020-03-26 | 1 | -5/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Rounding operations only matter when the conversion size of source and destination is the same, i.e. .F16.F16, .F32.F32 and .F64.F64. When there is a mismatch (.F16.F32), these bits are used for IEEE rounding, we don't emulate this because GLSL and SPIR-V don't support configuring it per operation. | ||||
* | | | | | Merge pull request #3577 from ReinUsesLisp/lea | Fernando Sahmkow | 2020-03-31 | 1 | -11/+4 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/lea: Fix LEA implementation | ||||
| * | | | | | shader/lea: Simplify generated LEA code | ReinUsesLisp | 2020-03-28 | 1 | -3/+2 |
| | | | | | | |||||
| * | | | | | shader/lea: Fix op_a and op_b usages | ReinUsesLisp | 2020-03-27 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | They were swapped. | ||||
| * | | | | | shader/lea: Remove const and use move when possible | ReinUsesLisp | 2020-03-27 | 1 | -11/+5 |
| | |_|_|/ | |/| | | | |||||
* | | | | | Merge pull request #3506 from namkazt/patch-9 | Rodrigo Locatti | 2020-03-31 | 6 | -36/+119 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | shader_decode: Implement partial ATOM/ATOMS instr | ||||
| * | | | | clang-format | Nguyen Dac Nam | 2020-03-31 | 1 | -2/+1 |
| | | | | | |||||
| * | | | | shader_decode: fix by suggestion | Nguyen Dac Nam | 2020-03-31 | 1 | -27/+22 |
| | | | | | |||||
| * | | | | clang-format | namkazy | 2020-03-30 | 1 | -3/+3 |
| | | | | | |||||
| * | | | | gl_decompiler: min/max op not implement yet | namkazy | 2020-03-30 | 1 | -0/+4 |
| | | | | | |||||
| * | | | | shader_decode: ATOM/ATOMS: add function to avoid code repetition | namkazy | 2020-03-30 | 2 | -70/+53 |
| | | | | | |||||
| * | | | | shader_decode: merge GlobalAtomicOp to AtomicOp | namkazy | 2020-03-30 | 1 | -13/+1 |
| | | | | | |||||
| * | | | | shader_decode: implement ATOM operation for S32 and U32 | Nguyen Dac Nam | 2020-03-30 | 1 | -6/+39 |
| | | | | | |||||
| * | | | | clang-format | namkazy | 2020-03-30 | 1 | -3/+3 |
| | | | | | |||||
| * | | | | shader_decode: implement ATOMS instr partial. | Nguyen Dac Nam | 2020-03-30 | 1 | -10/+42 |
| | | | | | |||||
| * | | | | vk_decompiler: add atomic op and handler function. | Nguyen Dac Nam | 2020-03-30 | 1 | -6/+25 |
| | | | | | |||||
| * | | | | gl_decompiler: add atomic op | Nguyen Dac Nam | 2020-03-30 | 1 | -0/+16 |
| | | | | | |||||
| * | | | | shader: node - update correct comment | Nguyen Dac Nam | 2020-03-30 | 1 | -15/+15 |
| | | | | | |||||
| * | | | | shader_decode: add Atomic op for common usage | Nguyen Dac Nam | 2020-03-30 | 1 | -1/+15 |
| |/ / / | |||||
* | | | | Merge pull request #3566 from ReinUsesLisp/vk-wrapper-part1 | Rodrigo Locatti | 2020-03-31 | 3 | -0/+889 |
|\ \ \ \ | | | | | | | | | | | renderer_vulkan/wrapper: Add a Vulkan wrapper (part 1 of 2) | ||||
| * | | | | renderer_vulkan/wrapper: Address feedback | ReinUsesLisp | 2020-03-28 | 1 | -3/+24 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add owning handles | ReinUsesLisp | 2020-03-27 | 1 | -0/+18 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add pool allocations owning templated class | ReinUsesLisp | 2020-03-27 | 1 | -0/+81 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add owning handle templated class | ReinUsesLisp | 2020-03-27 | 1 | -0/+144 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add destroy and free overload set | ReinUsesLisp | 2020-03-27 | 2 | -0/+133 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add dispatch table and loaders | ReinUsesLisp | 2020-03-27 | 2 | -0/+283 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add exception class | ReinUsesLisp | 2020-03-27 | 2 | -0/+34 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add ToString function for VkResult | ReinUsesLisp | 2020-03-27 | 3 | -0/+91 |
| | | | | | |||||
| * | | | | renderer_vulkan/wrapper: Add Vulakn wrapper and a span helper | ReinUsesLisp | 2020-03-27 | 2 | -0/+84 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The intention behind a Vulkan wrapper is to drop Vulkan-Hpp. The issues with Vulkan-Hpp are: - Regular breaks of the API. - Copy constructors that do the same as the aggregates (fixed recently) - External dynamic dispatch that is hard to remove - Alias KHR handles with non-KHR handles making it impossible to use smart handles on Vulkan 1.0 instances with extensions that were included on Vulkan 1.1. - Dynamic dispatchers silently change size depending on preprocessor definitions. Different files will have different dispatch definitions, generating all kinds of hard to debug memory issues. In other words, Vulkan-Hpp is not "production ready" for our needs and this wrapper aims to replace it without losing RAII and exception safety. | ||||
* | | | | Merge pull request #3560 from ReinUsesLisp/fix-stencil | bunnei | 2020-03-30 | 1 | -5/+2 |
|\ \ \ \ | |_|/ / |/| | | | gl_rasterizer: Synchronize stencil testing on clears | ||||
| * | | | gl_rasterizer: Update stencil test regardless of it being disabled | ReinUsesLisp | 2020-03-26 | 1 | -5/+1 |
| | | | | |||||
| * | | | gl_rasterizer: Synchronize stencil testing on clears | ReinUsesLisp | 2020-03-26 | 1 | -0/+1 |
| |/ / | |||||
* | | | Merge pull request #3565 from ReinUsesLisp/image-format | Fernando Sahmkow | 2020-03-27 | 1 | -4/+10 |
|\ \ \ | | | | | | | | | engines/const_buffer_engine_interface: Store image format and types | ||||
| * | | | engines/const_buffer_engine_interface: Store image format type | ReinUsesLisp | 2020-03-27 | 1 | -4/+10 |
| |/ / | | | | | | | | | | | | | | | | | | | This information is required to properly implement SULD.B. It might also be handy for all image operations, since it would allow us to implement them on devices that require the image format to be specified (on desktop, this would be AMD on OpenGL and Intel on OpenGL and Vulkan). | ||||
* / / | maxwell_to_vk: implement signedscaled vertex formats | Dan | 2020-03-27 | 1 | -0/+20 |
|/ / | |||||
* | | Merge pull request #3544 from makigumo/myfork/patch-2 | bunnei | 2020-03-26 | 1 | -4/+5 |
|\ \ | | | | | | | xmad: fix clang build error | ||||
| * | | xmad: fix clang build error | makigumo | 2020-03-23 | 1 | -4/+5 |
| | | | |||||
* | | | Merge pull request #3520 from ReinUsesLisp/legacy-varyings | bunnei | 2020-03-26 | 4 | -47/+119 |
|\ \ \ | |_|/ |/| | | gl_shader_decompiler: Implement legacy varyings | ||||
| * | | gl_shader_decompiler: Don't redeclare gl_VertexID and gl_InstanceID | ReinUsesLisp | 2020-03-18 | 1 | -8/+0 |
| | | | |||||
| * | | gl_shader_decompiler: Implement legacy varyings | ReinUsesLisp | 2020-03-16 | 1 | -6/+57 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Legacy varyings are special attributes carried over in hardware from the OpenGL 1 and OpenGL 2 days. These were generally used instead of the generic attributes we use today. They are deprecated or removed from most APIs, but Nvidia still ships them in hardware. To implement these, this commit maps them 1:1 to OpenGL compatibility. | ||||
| * | | shader/shader_ir: Track usage in input attribute and of legacy varyings | ReinUsesLisp | 2020-03-16 | 3 | -34/+64 |
| | | | |||||
| * | | shader/shader_ir: Fix clip distance usage stores | ReinUsesLisp | 2020-03-16 | 1 | -2/+1 |
| | | | |||||
| * | | shader/shader_ir: Change declare output attribute to a switch | ReinUsesLisp | 2020-03-16 | 1 | -9/+9 |
| | | | |||||
* | | | Merge pull request #3543 from ReinUsesLisp/gl-depth-range | Fernando Sahmkow | 2020-03-23 | 1 | -4/+6 |
|\ \ \ | | | | | | | | | gl_rasterizer: Use transformed viewport for depth ranges | ||||
| * | | | gl_rasterizer: Use transformed viewport for depth ranges | ReinUsesLisp | 2020-03-22 | 1 | -4/+6 |
| | |/ | |/| | | | | | | | | | | | | | Implement depth ranges using the transformed viewport instead of the generic one. This matches the current Vulkan implementation but doesn't support negative depth ranges. An update to glad is required for this. | ||||
* | | | apply replay logic to all writes. remove replay from MacroInterpreter::Send (@fincs) | namkazy | 2020-03-22 | 2 | -12/+9 |
| | | | |||||
* | | | maxwell_3d: change declaration order | namkazy | 2020-03-22 | 1 | -1/+3 |
| | | | |||||
* | | | maxwell_3d: init shadow_state | namkazy | 2020-03-22 | 1 | -0/+2 |
| | | | |||||
* | | | maxwell_3d: this seem more correct. | namkazy | 2020-03-22 | 1 | -2/+2 |
| | | | |||||
* | | | maxwell_3d: update comments for shadow ram usage | namkazy | 2020-03-22 | 3 | -2/+6 |
| | | | |||||
* | | | marco_interpreter: write hw value when shadow ram requested | Nguyen Dac Nam | 2020-03-22 | 1 | -0/+6 |
| | | | |||||
* | | | maxwell_3d: track shadow ram ctrl and hw reg value | Nguyen Dac Nam | 2020-03-22 | 1 | -0/+10 |
| | | | |||||
* | | | maxwell_3d: implement MME shadow RAM | Nguyen Dac Nam | 2020-03-22 | 1 | -1/+14 |
|/ / | |||||
* | | Merge pull request #3505 from namkazt/patch-8 | bunnei | 2020-03-19 | 1 | -15/+48 |
|\ \ | | | | | | | shader_decode: implement XMAD mode CSfu | ||||
| * | | nit & remove some optional param | Nguyen Dac Nam | 2020-03-13 | 1 | -10/+11 |
| | | | |||||
| * | | shader_decode: implement XMAD mode CSfu | Nguyen Dac Nam | 2020-03-13 | 1 | -9/+41 |
| | | | |||||
* | | | vk_texture_cache: Silence misc warnings | ReinUsesLisp | 2020-03-19 | 1 | -3/+3 |
| | | | |||||
* | | | vk_staging_buffer_pool: Silence unused constant warning | ReinUsesLisp | 2020-03-19 | 1 | -1/+1 |
| | | | |||||
* | | | vk_rasterizer: Remove unused variable | ReinUsesLisp | 2020-03-19 | 1 | -2/+0 |
| | | | |||||
* | | | vk_pipeline_cache: Remove unused variable | ReinUsesLisp | 2020-03-19 | 1 | -1/+0 |
| | | | |||||
* | | | maxwell_to_vk: Sielence -Wswitch warning | ReinUsesLisp | 2020-03-19 | 1 | -0/+2 |
| | | | |||||
* | | | gl_shader_decompiler: Remove deprecated function and its usages | ReinUsesLisp | 2020-03-19 | 1 | -11/+8 |
| | | | |||||
* | | | gl_rasterizer: Silence misc warnings | ReinUsesLisp | 2020-03-19 | 1 | -7/+2 |
| | | | |||||
* | | | kepler_compute: Remove unused variables | ReinUsesLisp | 2020-03-19 | 1 | -8/+0 |
| | | | |||||
* | | | astc: Fix clang build issues | ReinUsesLisp | 2020-03-18 | 1 | -12/+12 |
| | | | |||||
* | | | Merge pull request #3510 from FernandoS27/dirty-write | Mat M | 2020-03-17 | 4 | -13/+0 |
|\ \ \ | | | | | | | | | DirtyFlags: relax need to set render_targets as dirty | ||||
| * | | | DirtyFlags: relax need to set render_targets as dirty | Fernando Sahmkow | 2020-03-14 | 4 | -13/+0 |
| | | | | | | | | | | | | | | | | | | | | The texture cache already takes care of setting a render target to dirty when invalidated. | ||||
* | | | | Merge pull request #3509 from ReinUsesLisp/astc-opts | Mat M | 2020-03-17 | 1 | -537/+519 |
|\ \ \ \ | | | | | | | | | | | astc: General changes and optimizations | ||||
| * | | | | astc: Fix typos from search and replace | ReinUsesLisp | 2020-03-14 | 1 | -3/+3 |
| | | | | | |||||
| * | | | | astc: Minor changes to InputBitStream | ReinUsesLisp | 2020-03-14 | 1 | -28/+34 |
| | | | | | |||||
| * | | | | astc: Pass val in Replicate by copy | ReinUsesLisp | 2020-03-14 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | astc: Call std::vector:reserve on decodedClolorValues to avoid reallocating | ReinUsesLisp | 2020-03-14 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | astc: Call std::vector::reserve on texelWeightValues to avoid reallocating | ReinUsesLisp | 2020-03-14 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | astc: Create a LUT at compile time for encoding values | ReinUsesLisp | 2020-03-14 | 1 | -7/+19 |
| | | | | | |||||
| * | | | | astc: Make IntegerEncodedValue a trivial structure | ReinUsesLisp | 2020-03-14 | 1 | -212/+177 |
| | | | | | |||||
| * | | | | astc: Make IntegerEncodedValue constructor constexpr | ReinUsesLisp | 2020-03-14 | 1 | -5/+6 |
| | | | | | |||||
| * | | | | astc: Make IntegerEncodedValue trivially copyable | ReinUsesLisp | 2020-03-14 | 1 | -9/+2 |
| | | | | | |||||
| * | | | | astc: Rename C types to common_types | ReinUsesLisp | 2020-03-14 | 1 | -79/+78 |
| | | | | | |||||
| * | | | | astc: Move Popcnt to an anonymous namespace and make it constexpr | ReinUsesLisp | 2020-03-14 | 1 | -9/+13 |
| | | | | | |||||
| * | | | | astc: Use common types instead of stdint.h integer types | ReinUsesLisp | 2020-03-14 | 1 | -284/+282 |
| | | | | | |||||
| * | | | | astc: Use 'enum class' instead of 'enum' for EIntegerEncoding | ReinUsesLisp | 2020-03-14 | 1 | -25/+25 |
| | | | | | |||||
* | | | | | Merge pull request #3518 from ReinUsesLisp/scissor-clears | Mat M | 2020-03-17 | 4 | -45/+53 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_rasterizer: Implement scissor clears and layered clears | ||||
| * | | | | | vk_rasterizer: Implement layered clears | ReinUsesLisp | 2020-03-15 | 1 | -2/+2 |
| | | | | | | |||||
| * | | | | | vk_rasterizer: Reimplement clears with vkCmdClearAttachments | ReinUsesLisp | 2020-03-15 | 4 | -45/+53 |
| | | | | | | |||||
* | | | | | | Merge pull request #3519 from ReinUsesLisp/int-formats | Mat M | 2020-03-17 | 1 | -0/+4 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | maxwell_to_vk: Implement RG32 and RGB32 integer vertex formats | ||||
| * | | | | | | maxwell_to_vk: Implement RG32 and RGB32 integer vertex formats | ReinUsesLisp | 2020-03-15 | 1 | -0/+4 |
| | |_|_|_|/ | |/| | | | | |||||
* | | | | | | Merge pull request #3498 from ReinUsesLisp/texel-fetch-glsl | bunnei | 2020-03-17 | 1 | -6/+9 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_decompiler: Add layer component to texelFetch | ||||
| * | | | | | | gl_shader_decompiler: Add layer component to texelFetch | ReinUsesLisp | 2020-03-12 | 1 | -6/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | TexelFetch was not emitting the array component generating invalid GLSL. | ||||
* | | | | | | | renderer_opengl: Move some logic to an anonymous namespace | ReinUsesLisp | 2020-03-16 | 1 | -151/+151 |
| | | | | | | | |||||
* | | | | | | | renderer_opengl: Detect Nvidia Nsight as a debugging tool | ReinUsesLisp | 2020-03-16 | 3 | -7/+22 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Use getenv to detect Nsight. | ||||
* | | | | | | | Merge pull request #3515 from ReinUsesLisp/vertex-vk-assert | Rodrigo Locatti | 2020-03-16 | 1 | -1/+1 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | vk_rasterizer: Fix vertex range assert | ||||
| * | | | | | | | vk_rasterizer: Fix vertex range assert | ReinUsesLisp | 2020-03-15 | 1 | -1/+1 |
| | |/ / / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | End can be equal to start in CalculateVertexArraysSize. This is quite common when the vertex size is zero. | ||||
* | | | | | | | Merge pull request #3501 from ReinUsesLisp/rgba16-snorm | Rodrigo Locatti | 2020-03-16 | 8 | -69/+84 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | video_core: Implement RGBA16_SNORM | ||||
| * | | | | | | | video_core: Implement RGBA16_SNORM | ReinUsesLisp | 2020-03-13 | 8 | -69/+84 |
| | |/ / / / / | |/| | | | | | | | | | | | | | | | | | | | Implement RGBA16_SNORM with the current API. Nothing special here. | ||||
* | | | | | | | Merge pull request #3502 from namkazt/patch-3 | Rodrigo Locatti | 2020-03-16 | 3 | -29/+53 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | shader_decode: Reimplement BFE instructions | ||||
| * | | | | | | | clang-format | Nguyen Dac Nam | 2020-03-14 | 1 | -2/+1 |
| | | | | | | | | |||||
| * | | | | | | | nit | Nguyen Dac Nam | 2020-03-14 | 1 | -1/+1 |
| | | | | | | | | |||||
| * | | | | | | | clang-format | Nguyen Dac Nam | 2020-03-13 | 1 | -4/+8 |
| | | | | | | | | |||||
| * | | | | | | | Apply suggestions from code review | Nguyen Dac Nam | 2020-03-13 | 1 | -5/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
| * | | | | | | | shader_decode: BFE add ref of reverse parallel method. | Nguyen Dac Nam | 2020-03-13 | 1 | -0/+3 |
| | | | | | | | | |||||
| * | | | | | | | shader_decode: implement BREV on BFE | Nguyen Dac Nam | 2020-03-13 | 1 | -6/+25 |
| | | | | | | | | | | | | | | | | | | | | | | | | Implement reverse parallel follow: https://graphics.stanford.edu/~seander/bithacks.html#ReverseParallel | ||||
| * | | | | | | | shader_bytecode: update BFE instructions struct. | Nguyen Dac Nam | 2020-03-13 | 1 | -8/+3 |
| | | | | | | | | |||||
| * | | | | | | | node_helper: add IBitfieldExtract case | Nguyen Dac Nam | 2020-03-13 | 1 | -0/+2 |
| | | | | | | | | |||||
| * | | | | | | | shader_decode: Reimplement BFE instructions | Nguyen Dac Nam | 2020-03-13 | 1 | -25/+27 |
| | |_|_|_|_|/ | |/| | | | | | |||||
* | | | | | | | Merge pull request #3503 from makigumo/patch-2 | Rodrigo Locatti | 2020-03-16 | 1 | -0/+2 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | maxwell_to_vk: add vertex format eA2B10G10R10UnormPack32 | ||||
| * | | | | | | | fix formatting | makigumo | 2020-03-13 | 1 | -1/+1 |
| | | | | | | | | |||||
| * | | | | | | | maxwell_to_vk: add vertex format eA2B10G10R10UnormPack32 | makigumo | 2020-03-13 | 1 | -1/+3 |
| |/ / / / / / | |||||
* | | | | | | | Merge pull request #3512 from bunnei/fix-renderdoc | Rodrigo Locatti | 2020-03-15 | 3 | -1/+38 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | renderer_opengl: Keep frames synchronized when using a GPU debugger. | ||||
| * | | | | | | | renderer_opengl: Keep presentation frames in lock-step when GPU debugging. | bunnei | 2020-03-14 | 1 | -1/+32 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Fixes renderdoc with OpenGL renderer. | ||||
| * | | | | | | | gl_device: Add option to check GL_EXT_debug_tool. | bunnei | 2020-03-14 | 2 | -0/+6 |
| | |_|_|/ / / | |/| | | | | | |||||
* | | | | | | | vk_shader_decompiler: fix linux build | makigumo | 2020-03-15 | 1 | -1/+1 |
| |_|_|/ / / |/| | | | | | |||||
* | | | | | | Merge pull request #3508 from FernandoS27/page-table | bunnei | 2020-03-14 | 1 | -1/+1 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | PageTable: move backing addresses to a children class as the CPU page table does not need them. | ||||
| * | | | | | PageTable: move backing addresses to a children class as the CPU page table does not need them. | Fernando Sahmkow | 2020-03-14 | 1 | -1/+1 |
| | |_|_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | This PR aims to reduce the memory usage in the CPU page table by moving GPU specific parameters into a child class. This saves 1Gb of Memory for most games. | ||||
* | | | | | Merge pull request #3500 from ReinUsesLisp/incompatible-types | Fernando Sahmkow | 2020-03-14 | 1 | -2/+39 |
|\ \ \ \ \ | | | | | | | | | | | | | texture_cache: Report incompatible textures as black | ||||
| * | | | | | texture_cache: Report incompatible textures as black | ReinUsesLisp | 2020-03-12 | 1 | -2/+39 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some games bind incompatible texture types to certain types. For example Astral Chain binds a 2D texture with 1 layer (non-array) to a cubemap slot (that's how it's used in the shader). After testing this in hardware, the expected "undefined behavior" is to report all pixels as black. We already have a path for reporting black textures in the texture cache. When textures types are incompatible, this commit binds these kind of textures. This is done on the API agnostic texture cache so no extra code has to be inserted on OpenGL or Vulkan. As a side effect, this fixes invalidations of ASTC textures on Astral Chain. This happened because yuzu detected a cube texture and forced 6 faces, generating a texture larger than what the TIC reported. | ||||
* | | | | | Merge pull request #3499 from ReinUsesLisp/depth-2d-array | Fernando Sahmkow | 2020-03-14 | 1 | -2/+4 |
|\ \ \ \ \ | | | | | | | | | | | | | texture_cache/surface_params: Force depth=1 on 2D textures | ||||
| * | | | | | texture_cache/surface_params: Force depth=1 on 2D textures | ReinUsesLisp | 2020-03-12 | 1 | -2/+4 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Sometimes games will sample a 2D array TIC with a 2D access in the shader. This causes bad interactions with the rest of the texture cache. To emulate what the game wants to do, force a depth=1 on 2D textures (not 2D arrays) and let the texture cache handle the rest. | ||||
* | | | | | vk/gl_shader_decompiler: Silence assertion on compute | ReinUsesLisp | 2020-03-13 | 2 | -6/+12 |
| | | | | | |||||
* | | | | | vk_shader_decompiler: Fix default varying regression | ReinUsesLisp | 2020-03-13 | 1 | -2/+6 |
| | | | | | |||||
* | | | | | maxwell_3d: Add padding words to XFB entries | ReinUsesLisp | 2020-03-13 | 1 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Use INSERT_UNION_PADDING_WORDS instead of alignas to ensure a size requirement. | ||||
* | | | | | gl_shader_decompiler: Fix implicit conversion errors | ReinUsesLisp | 2020-03-13 | 1 | -3/+3 |
| | | | | | |||||
* | | | | | vk_shader_decompiler: Fix implicit type conversion | Rodrigo Locatti | 2020-03-13 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
* | | | | | vk_rasterizer: Implement transform feedback binding zero | ReinUsesLisp | 2020-03-13 | 2 | -0/+46 |
| | | | | | |||||
* | | | | | vk_shader_decompiler: Add XFB decorations to generic varyings | ReinUsesLisp | 2020-03-13 | 1 | -16/+89 |
| | | | | | |||||
* | | | | | vk_device: Enable VK_EXT_transform_feedback when available | ReinUsesLisp | 2020-03-13 | 2 | -7/+40 |
| | | | | | |||||
* | | | | | vk_device: Shrink formatless capability name size | ReinUsesLisp | 2020-03-13 | 3 | -26/+23 |
| | | | | | |||||
* | | | | | shader/transform_feedback: Expose buffer stride | ReinUsesLisp | 2020-03-13 | 3 | -1/+4 |
| | | | | | |||||
* | | | | | vk_shader_decompiler: Use registry for specialization | ReinUsesLisp | 2020-03-13 | 4 | -31/+37 |
| | | | | | |||||
* | | | | | gl_rasterizer: Implement transform feedback bindings | ReinUsesLisp | 2020-03-13 | 3 | -10/+83 |
| | | | | | |||||
* | | | | | gl_shader_decompiler: Decorate output attributes with XFB layout | ReinUsesLisp | 2020-03-13 | 1 | -29/+105 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We sometimes have to slice attributes in different parts. This is needed for example in instances where the game feedbacks 3 components but writes 4 from the shader (something that is possible with GL_NV_transform_feedback). | ||||
* | | | | | shader/transform_feedback: Add host API friendly TFB builder | ReinUsesLisp | 2020-03-13 | 3 | -0/+138 |
| |_|_|/ |/| | | | |||||
* | | | | Merge branch 'master' into shader-purge | Rodrigo Locatti | 2020-03-13 | 11 | -32/+148 |
|\ \ \ \ | | |/ / | |/| | | |||||
| * | | | Merge pull request #3491 from ReinUsesLisp/polygon-modes | bunnei | 2020-03-13 | 7 | -2/+99 |
| |\ \ \ | | |_|/ | |/| | | gl_rasterizer: Implement polygon modes and fill rectangles | ||||
| | * | | gl_rasterizer: Implement polygon modes and fill rectangles | ReinUsesLisp | 2020-03-10 | 7 | -2/+99 |
| | | | | |||||
| * | | | Merge pull request #3483 from namkazt/patch-1 | Fernando Sahmkow | 2020-03-13 | 1 | -1/+1 |
| |\ \ \ | | | | | | | | | | | vk_rasterizer: fix mistype on SetupGraphicsImages | ||||
| | * | | | vk_reasterizer: fix mistype on SetupGraphicsImages | Nguyen Dac Nam | 2020-03-08 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | This should use Maxwell3D engine. Fixed some GPU error on Kirby and maybe other games. | ||||
| * | | | | Merge pull request #3480 from ReinUsesLisp/vk-disabled-ubo | Fernando Sahmkow | 2020-03-13 | 2 | -1/+9 |
| |\ \ \ \ | | |_|_|/ | |/| | | | vk_rasterizer: Support disabled uniform buffers | ||||
| | * | | | vk_rasterizer: Support disabled uniform buffers | ReinUsesLisp | 2020-03-06 | 2 | -1/+9 |
| | | | | | |||||
| * | | | | gl_shader_manager: Fix interaction between graphics and compute | ReinUsesLisp | 2020-03-11 | 4 | -29/+39 |
| | |_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | After a compute shader was set to the pipeline, no graphics shader was invoked again. To address this use glUseProgram to bind compute shaders (without state tracking) and call glUseProgram(0) when transitioning out of it back to the graphics pipeline. | ||||
* | | | | gl_shader_decompiler: Initialize gl_Position on vertex shaders | ReinUsesLisp | 2020-03-13 | 1 | -0/+4 |
| | | | | |||||
* | | | | gl_shader_decompiler: Add missing {} on smem GLSL emission | ReinUsesLisp | 2020-03-13 | 1 | -1/+1 |
| | | | | |||||
* | | | | gl_shader_decompiler: Fix regression in render target declarations | ReinUsesLisp | 2020-03-12 | 1 | -12/+2 |
| | | | | | | | | | | | | | | | | | | | | A previous commit introduced a way to declare as few render targets as possible. Turns out this introduced a regression in some games. | ||||
* | | | | engines/maxwell_3d: Add TFB registers and store them in shader registry | ReinUsesLisp | 2020-03-09 | 4 | -6/+45 |
| | | | | |||||
* | | | | shader/registry: Address feedback | ReinUsesLisp | 2020-03-09 | 3 | -13/+18 |
| | | | | |||||
* | | | | gl_shader_decompiler: Add identifier to decompiled code | ReinUsesLisp | 2020-03-09 | 3 | -8/+16 |
| | | | | |||||
* | | | | gl_shader_decompiler: Roll back to GLSL core 430 | ReinUsesLisp | 2020-03-09 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | RenderDoc won't build shaders if we use GLSL compatibility. | ||||
* | | | | const_buffer_engine_interface: Store component types | ReinUsesLisp | 2020-03-09 | 4 | -46/+27 |
| | | | | | | | | | | | | | | | | | | | | This is required for Vulkan. Sampling integer textures with float handles is illegal. | ||||
* | | | | yuzu/loading_screen: Remove unused shader progress mode | ReinUsesLisp | 2020-03-09 | 1 | -1/+0 |
| | | | | |||||
* | | | | gl_shader_cache: Reduce registry consistency to debug assert | ReinUsesLisp | 2020-03-09 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | | | | | | Registry consistency is something that practically can't happen and it has a measurable runtime cost. Reduce it to a DEBUG_ASSERT. | ||||
* | | | | shader/registry: Cache tessellation state | ReinUsesLisp | 2020-03-09 | 3 | -3/+10 |
| | | | | |||||
* | | | | shader/registry: Store graphics and compute metadata | ReinUsesLisp | 2020-03-09 | 8 | -75/+176 |
| | | | | | | | | | | | | | | | | | | | | Store information GLSL forces us to provide but it's dynamic state in hardware (workgroup sizes, primitive topology, shared memory size). | ||||
* | | | | video_core: Rename "const buffer locker" to "registry" | ReinUsesLisp | 2020-03-09 | 15 | -93/+98 |
| | | | | |||||
* | | | | gl_shader_cache: Rework shader cache and remove post-specializations | ReinUsesLisp | 2020-03-09 | 17 | -1092/+544 |
|/ / / | | | | | | | | | | | | | Instead of pre-specializing shaders and then post-specializing them, drop the later and only "specialize" the shader while decoding it. | ||||
* | | | Merge pull request #3301 from ReinUsesLisp/state-tracker | Rodrigo Locatti | 2020-03-09 | 48 | -1889/+1563 |
|\ \ \ | | | | | | | | | video_core: Remove gl_state and use a state tracker based on dirty flags | ||||
| * | | | video_core/dirty_flags: Address feedback | ReinUsesLisp | 2020-02-28 | 1 | -4/+4 |
| | | | | |||||
| * | | | renderer_opengl: Fix edge-case where alpha testing might cull presentation | ReinUsesLisp | 2020-02-28 | 2 | -0/+7 |
| | | | | |||||
| * | | | gl_texture_cache: Remove blending disable on blits | ReinUsesLisp | 2020-02-28 | 1 | -5/+0 |
| | | | | | | | | | | | | | | | | | | | | Blending doesn't affect blits. Rasterizer discard does, update the commentaries. | ||||
| * | | | gl_rasterizer: Don't disable blending on clears | ReinUsesLisp | 2020-02-28 | 1 | -4/+0 |
| | | | | | | | | | | | | | | | | Blending doesn't affect clears. | ||||
| * | | | dirty_flags: Deduplicate code between OpenGL and Vulkan | ReinUsesLisp | 2020-02-28 | 5 | -77/+73 |
| | | | | |||||
| * | | | vk_rasterizer: Pass Maxwell registers to dynamic updates | ReinUsesLisp | 2020-02-28 | 2 | -26/+21 |
| | | | | |||||
| * | | | state_tracker: Remove type traits with named structures | ReinUsesLisp | 2020-02-28 | 4 | -18/+22 |
| | | | | |||||
| * | | | vk_state_tracker: Implement dirty flags for stencil properties | ReinUsesLisp | 2020-02-28 | 3 | -0/+21 |
| | | | | |||||
| * | | | vk_state_tracker: Implement dirty flags for depth bounds | ReinUsesLisp | 2020-02-28 | 3 | -0/+14 |
| | | | | |||||
| * | | | vk_state_tracker: Implement dirty flags for blend constants | ReinUsesLisp | 2020-02-28 | 3 | -0/+14 |
| | | | | |||||
| * | | | vk_state_tracker: Implement dirty flags for depth bias | ReinUsesLisp | 2020-02-28 | 3 | -0/+17 |
| | | | | |||||
| * | | | vk_state_tracker: Implement dirty flags for scissors | ReinUsesLisp | 2020-02-28 | 3 | -0/+14 |
| | | | | |||||
| * | | | vk_state_tracker: Initial implementation | ReinUsesLisp | 2020-02-28 | 10 | -52/+198 |
| | | | | | | | | | | | | | | | | Add support for render targets and viewports. | ||||
| * | | | gl_rasterizer: Remove num vertex buffers magic number | ReinUsesLisp | 2020-02-28 | 1 | -2/+4 |
| | | | | |||||
| * | | | gl_rasterizer: Only apply polygon offset clamp if enabled | ReinUsesLisp | 2020-02-28 | 1 | -3/+6 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for depth clamp enabling | ReinUsesLisp | 2020-02-28 | 3 | -3/+15 |
| | | | | |||||
| * | | | gl_rasterizer: Disable scissor 0 when scissor is not used on clear | ReinUsesLisp | 2020-02-28 | 1 | -0/+3 |
| | | | | |||||
| * | | | gl_rasterizer: Notify depth mask changes on clear | ReinUsesLisp | 2020-02-28 | 2 | -1/+6 |
| | | | | |||||
| * | | | gl_rasterizer: Minor sort changes to clearing | ReinUsesLisp | 2020-02-28 | 1 | -11/+9 |
| | | | | |||||
| * | | | maxwell_3d: Use two tables instead of three for dirty flags | ReinUsesLisp | 2020-02-28 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_state_tracker: Track state of index buffers | ReinUsesLisp | 2020-02-28 | 4 | -5/+23 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for clip control | ReinUsesLisp | 2020-02-28 | 5 | -15/+31 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for point sizes | ReinUsesLisp | 2020-02-28 | 3 | -4/+25 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for fragment color clamp | ReinUsesLisp | 2020-02-28 | 3 | -2/+14 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for logic op | ReinUsesLisp | 2020-02-28 | 4 | -2/+22 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for sRGB | ReinUsesLisp | 2020-02-28 | 5 | -2/+21 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for rasterize enable | ReinUsesLisp | 2020-02-28 | 5 | -2/+21 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for multisample | ReinUsesLisp | 2020-02-28 | 3 | -0/+13 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for alpha testing | ReinUsesLisp | 2020-02-28 | 4 | -6/+24 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for polygon offsets | ReinUsesLisp | 2020-02-28 | 4 | -2/+24 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for primitive restart | ReinUsesLisp | 2020-02-28 | 3 | -5/+19 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for stencil testing | ReinUsesLisp | 2020-02-28 | 4 | -3/+29 |
| | | | | |||||
| * | | | gl_state_tracker: Implement depth dirty flags | ReinUsesLisp | 2020-02-28 | 4 | -6/+31 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for front face and culling | ReinUsesLisp | 2020-02-28 | 4 | -7/+38 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for blending | ReinUsesLisp | 2020-02-28 | 5 | -14/+67 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for clip distances and shaders | ReinUsesLisp | 2020-02-28 | 7 | -14/+43 |
| | | | | |||||
| * | | | gl_state_tracker: Add dirty flags for buffers and divisors | ReinUsesLisp | 2020-02-28 | 4 | -22/+56 |
| | | | | |||||
| * | | | maxwell_3d: Change write dirty flags to a bitset | ReinUsesLisp | 2020-02-28 | 3 | -16/+16 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for vertex formats | ReinUsesLisp | 2020-02-28 | 4 | -9/+44 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for color masks | ReinUsesLisp | 2020-02-28 | 4 | -9/+53 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for scissors | ReinUsesLisp | 2020-02-28 | 5 | -10/+58 |
| | | | | |||||
| * | | | gl_state_tracker: Implement dirty flags for viewports | ReinUsesLisp | 2020-02-28 | 4 | -9/+54 |
| | | | | |||||
| * | | | renderer_opengl: Reintroduce dirty flags for render targets | ReinUsesLisp | 2020-02-28 | 9 | -13/+195 |
| | | | | |||||
| * | | | maxwell_3d: Flatten cull and front face registers | ReinUsesLisp | 2020-02-28 | 8 | -50/+47 |
| | | | | |||||
| * | | | video_core: Reintroduce dirty flags infrastructure | ReinUsesLisp | 2020-02-28 | 9 | -1/+71 |
| | | | | |||||
| * | | | gl_state: Remove completely | ReinUsesLisp | 2020-02-28 | 13 | -152/+4 |
| | | | | |||||
| * | | | gl_state: Remove program tracking | ReinUsesLisp | 2020-02-28 | 9 | -94/+62 |
| | | | | |||||
| * | | | gl_state: Remove framebuffer tracking | ReinUsesLisp | 2020-02-28 | 7 | -82/+23 |
| | | | | |||||
| * | | | gl_state: Remove image tracking | ReinUsesLisp | 2020-02-28 | 5 | -24/+12 |
| | | | | |||||
| * | | | gl_state: Remove texture and sampler tracking | ReinUsesLisp | 2020-02-28 | 5 | -60/+8 |
| | | | | |||||
| * | | | gl_state: Remove blend state tracking | ReinUsesLisp | 2020-02-28 | 5 | -104/+28 |
| | | | | |||||
| * | | | gl_state: Remove stencil test tracking | ReinUsesLisp | 2020-02-28 | 4 | -92/+18 |
| | | | | |||||
| * | | | gl_state: Remove clip control tracking | ReinUsesLisp | 2020-02-28 | 5 | -19/+8 |
| | | | | |||||
| * | | | gl_state: Remove clip distances tracking | ReinUsesLisp | 2020-02-28 | 4 | -29/+3 |
| | | | | |||||
| * | | | gl_state: Remove rasterizer disable tracking | ReinUsesLisp | 2020-02-28 | 6 | -13/+8 |
| | | | | |||||
| * | | | gl_state: Remove viewport and depth range tracking | ReinUsesLisp | 2020-02-28 | 7 | -101/+39 |
| | | | | |||||
| * | | | gl_state: Remove scissor test tracking | ReinUsesLisp | 2020-02-28 | 6 | -69/+12 |
| | | | | |||||
| * | | | gl_state: Remove color mask tracking | ReinUsesLisp | 2020-02-28 | 4 | -40/+12 |
| | | | | |||||
| * | | | gl_state: Remove clamp framebuffer color tracking | ReinUsesLisp | 2020-02-28 | 3 | -17/+6 |
| | | | | | | | | | | | | | | | | | | | | This commit doesn't reset it for screen draws because clamping doesn't change anything there. | ||||
| * | | | gl_state: Remove multisample tracking | ReinUsesLisp | 2020-02-28 | 3 | -16/+2 |
| | | | | |||||
| * | | | gl_state: Remove framebuffer sRGB tracking | ReinUsesLisp | 2020-02-28 | 6 | -21/+25 |
| | | | | |||||
| * | | | gl_state: Remove VAO cache and tracking | ReinUsesLisp | 2020-02-28 | 10 | -153/+53 |
| | | | | |||||
| * | | | gl_state: Remove depth clamp tracking | ReinUsesLisp | 2020-02-28 | 4 | -25/+13 |
| | | | | |||||
| * | | | gl_state: Remove depth tracking | ReinUsesLisp | 2020-02-28 | 4 | -34/+7 |
| | | | | |||||
| * | | | gl_state: Remove primitive restart tracking | ReinUsesLisp | 2020-02-28 | 3 | -18/+2 |
| | | | | |||||
| * | | | gl_state: Remove logic op tracker | ReinUsesLisp | 2020-02-28 | 4 | -24/+5 |
| | | | | |||||
| * | | | gl_state: Remove blend color tracking | ReinUsesLisp | 2020-02-28 | 3 | -18/+1 |
| | | | | |||||
| * | | | gl_state: Remove polygon offset tracking | ReinUsesLisp | 2020-02-28 | 4 | -39/+7 |
| | | | | |||||
| * | | | gl_state: Remove alpha test tracking | ReinUsesLisp | 2020-02-28 | 4 | -21/+4 |
| | | | | |||||
| * | | | gl_state: Remove cull mode tracking | ReinUsesLisp | 2020-02-28 | 4 | -19/+4 |
| | | | | |||||
| * | | | gl_state: Remove front face tracking | ReinUsesLisp | 2020-02-28 | 4 | -6/+5 |
| | | | | |||||
| * | | | gl_state: Remove point size tracking | ReinUsesLisp | 2020-02-28 | 3 | -22/+4 |
| | | | | |||||
| * | | | gl_rasterizer: Add oglEnablei helper | ReinUsesLisp | 2020-02-28 | 1 | -0/+4 |
| | | | | |||||
| * | | | gl_rasterizer: Add OpenGL enable/disable helper | ReinUsesLisp | 2020-02-28 | 1 | -0/+4 |
| | | | | |||||
| * | | | gl_rasterizer: Remove dirty flags | ReinUsesLisp | 2020-02-28 | 18 | -457/+7 |
| | | | | |||||
* | | | | textures: Fix anisotropy hack | ReinUsesLisp | 2020-03-08 | 1 | -14/+16 |
| | | | | | | | | | | | | | | | | Previous code could generate an anisotropy value way higher than x16. | ||||
* | | | | Merge pull request #3452 from Morph1984/anisotropic-filtering | bunnei | 2020-03-08 | 2 | -2/+24 |
|\ \ \ \ | |_|_|/ |/| | | | frontend/Graphics: Add "Advanced" graphics tab and experimental Anisotropic Filtering support | ||||
| * | | | Create an "Advanced" tab in the graphics configuration tab and add anisotropic filtering levels. | Morph | 2020-02-28 | 2 | -2/+24 |
| | | | | |||||
* | | | | Merge pull request #3481 from ReinUsesLisp/abgr5-storage | bunnei | 2020-03-08 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | maxwell_to_vk: Remove Storage capability for A1B5G5R5U | ||||
| * | | | | maxwell_to_vk: Remove Storage capability for A1B5G5R5U | ReinUsesLisp | 2020-03-06 | 1 | -1/+1 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3463 from ReinUsesLisp/vk-toctou | bunnei | 2020-03-06 | 1 | -9/+12 |
|\ \ \ \ | |/ / / |/| | | | vk_swapchain: Silence TOCTOU race condition | ||||
| * | | | vk_swapchain: Silence TOCTOU race condition | ReinUsesLisp | 2020-02-26 | 1 | -9/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | It's possible that the window is resized from the moment we ask for its size to the moment a swapchain is created, causing validation issues. To workaround this Vulkan issue request the capabilities again just before creating the swapchain, making the race condition less likely. | ||||
* | | | | Merge pull request #3451 from ReinUsesLisp/indexed-textures | bunnei | 2020-03-05 | 8 | -74/+107 |
|\ \ \ \ | | | | | | | | | | | vk_shader_decompiler: Implement indexed textures | ||||
| * | | | | vk_shader_decompiler: Implement indexed textures | ReinUsesLisp | 2020-02-24 | 6 | -54/+99 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement accessing textures through an index. It uses the same interface as OpenGL, the main difference is that Vulkan bindings are forced to be arrayed (the binding index doesn't change for stacked textures in SPIR-V). | ||||
| * | | | | shader: Simplify indexed sampler usages | ReinUsesLisp | 2020-02-24 | 2 | -20/+8 |
| | | | | | |||||
* | | | | | Merge pull request #3469 from namkazt/patch-1 | bunnei | 2020-03-05 | 1 | -6/+14 |
|\ \ \ \ \ | | | | | | | | | | | | | shader_decode: Fix LD, LDG when track constant buffer | ||||
| * | | | | | nit: move comment to right place. | Nguyen Dac Nam | 2020-02-29 | 1 | -2/+2 |
| | | | | | | |||||
| * | | | | | shader_decode: Fix LD, LDG when track constant buffer | Nguyen Dac Nam | 2020-02-28 | 1 | -4/+12 |
| | |_|/ / | |/| | | | |||||
* | | | | | Merge pull request #3455 from ReinUsesLisp/attr-scaled | bunnei | 2020-03-04 | 3 | -4/+40 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | video_core: Implement more scaled attribute formats | ||||
| * | | | | video_core: Implement more scaler attribute formats | ReinUsesLisp | 2020-02-24 | 3 | -4/+40 |
| | |/ / | |/| | | | | | | | | | | | | | | While changing this, fix assert in vk_shader_decompiler. We now know scaled formats are expected to be float in shaders attributes. | ||||
* | | | | renderer_opengl: Fix SRGB presentation frame tracking. | bunnei | 2020-02-28 | 2 | -5/+2 |
| |/ / |/| | | | | | | | | - Fixes SRGB in Super Smash Bros. Ultimate. | ||||
* | | | Merge pull request #3430 from bunnei/split-presenter | bunnei | 2020-02-28 | 10 | -47/+353 |
|\ \ \ | | | | | | | | | Port citra-emu/citra#4940: "Split Presentation thread from Render thread" | ||||
| * | | | renderer_opengl: Reduce swap chain size to 3. | bunnei | 2020-02-28 | 1 | -3/+2 |
| | | | | |||||
| * | | | renderer_opengl: Use more concise lock syntax. | bunnei | 2020-02-27 | 1 | -4/+4 |
| | | | | |||||
| * | | | renderer_opengl: Move Frame/FrameMailbox to OpenGL namespace. | bunnei | 2020-02-27 | 2 | -36/+42 |
| | | | | |||||
| * | | | renderer_opengl: Create gl_framebuffer_data if empty. | bunnei | 2020-02-26 | 1 | -1/+2 |
| | | | | |||||
| * | | | frontend: qt: bootmanager: Vulkan: Restore support for VK backend. | bunnei | 2020-02-26 | 2 | -9/+14 |
| | | | | |||||
| * | | | core: frontend: Refactor scope_acquire_window_context to scope_acquire_context. | bunnei | 2020-02-26 | 1 | -2/+2 |
| | | | | |||||
| * | | | renderer_opengl: Add texture mailbox support for presenter thread. | bunnei | 2020-02-26 | 3 | -35/+268 |
| | | | | |||||
| * | | | renderer_opengl: Add OGLRenderbuffer to resource/state management. | bunnei | 2020-02-26 | 4 | -0/+62 |
| | |/ | |/| | |||||
* | | | shader: FMUL switch to using LUT (#3441) | Nguyen Dac Nam | 2020-02-27 | 1 | -19/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * shader: add FmulPostFactor LUT table * shader: FMUL apply LUT * Update src/video_core/engines/shader_bytecode.h Co-Authored-By: Mat M. <mathew1800@gmail.com> * nit: mistype * clang-format & add missing import * shader: remove post factor LUT. * shader: move post factor LUT to function and fix incorrect order. * clang-format * shader: FMUL: add static to post factor LUT * nit: typo Co-authored-by: Mat M. <mathew1800@gmail.com> | ||||
* | | | Merge pull request #3440 from namkazt/patch-6 | bunnei | 2020-02-26 | 1 | -36/+58 |
|\ \ \ | |/ / |/| | | shader: implement LOP3 fast replace for old function | ||||
| * | | nit: add const to where it need. | Nguyen Dac Nam | 2020-02-21 | 1 | -14/+14 |
| | | | |||||
| * | | shader: implement LOP3 fast replace for old function | Nguyen Dac Nam | 2020-02-21 | 1 | -36/+58 |
| | | | | | | | | | ref: https://devtalk.nvidia.com/default/topic/1070081/cuda-programming-and-performance/reverse-lut-for-lop3-lut/ | ||||
* | | | Merge pull request #3461 from ReinUsesLisp/r32i-rt | Mat M | 2020-02-25 | 2 | -0/+3 |
|\ \ \ | | | | | | | | | video_core/surface: Add R32_SINT render target format | ||||
| * | | | video_core/surface: Add R32_SINT render target format | ReinUsesLisp | 2020-02-25 | 2 | -0/+3 |
| | | | | |||||
* | | | | Merge pull request #3460 from ReinUsesLisp/unused-format-getter | Mat M | 2020-02-25 | 2 | -71/+0 |
|\ \ \ \ | |/ / / |/| | | | video_core/gpu: Remove unused functions | ||||
| * | | | video_core/gpu: Remove unused functions | ReinUsesLisp | 2020-02-25 | 2 | -71/+0 |
| | | | | |||||
* | | | | Merge pull request #3417 from ReinUsesLisp/r32i | bunnei | 2020-02-25 | 6 | -34/+46 |
|\ \ \ \ | |/ / / |/| | | | texture: Implement R32I | ||||
| * | | | texture: Implement R32I | ReinUsesLisp | 2020-02-15 | 6 | -34/+46 |
| | | | | |||||
* | | | | Merge pull request #3425 from ReinUsesLisp/layered-framebuffer | bunnei | 2020-02-24 | 8 | -51/+74 |
|\ \ \ \ | |_|_|/ |/| | | | texture_cache: Implement layered framebuffer attachments | ||||
| * | | | texture_cache: Implement layered framebuffer attachments | ReinUsesLisp | 2020-02-16 | 8 | -51/+74 |
| |/ / | | | | | | | | | | | | | | | | Layered framebuffer attachments is a feature that allows applications to write attach layered textures to a single attachment. What layer the fragments are written to is decided from the shader using gl_Layer. | ||||
* | | | Merge pull request #3424 from ReinUsesLisp/spirv-layer | bunnei | 2020-02-23 | 1 | -6/+30 |
|\ \ \ | | | | | | | | | vk_shader_decompiler: Implement Layer output attribute | ||||
| * | | | vk_shader_decompiler: Implement Layer output attribute | ReinUsesLisp | 2020-02-16 | 1 | -6/+30 |
| |/ / | | | | | | | | | | | | | | | | SPIR-V's Layer is GLSL's gl_Layer. It lets the application choose from a shader stage (vertex, tessellation or geometry) which framebuffer layer write the output fragments to. | ||||
* | | | Merge pull request #3422 from ReinUsesLisp/buffer-flush | bunnei | 2020-02-23 | 2 | -0/+11 |
|\ \ \ | | | | | | | | | surface_base: Implement texture buffer flushes | ||||
| * | | | surface_base: Implement texture buffer flushes | ReinUsesLisp | 2020-02-16 | 2 | -0/+11 |
| |/ / | | | | | | | | | | | | | Implement downloads to guest memory from texture buffers on the generic cache and OpenGL. | ||||
* | | | shader/texture: Fix illegal 3D texture assert | ReinUsesLisp | 2020-02-21 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | Fix typo in the illegal 3D texture assert logic. We care about catching arrayed 3D textures or 3D shadow textures, not regular 3D textures. | ||||
* | | | Merge pull request #3433 from namkazt/patch-1 | Rodrigo Locatti | 2020-02-21 | 1 | -3/+8 |
|\ \ \ | | | | | | | | | renderer_vulkan: Add the rest of case for TryConvertBorderColor | ||||
| * | | | fixups mistake auto commit. | Nguyen Dac Nam | 2020-02-18 | 1 | -9/+0 |
| | | | | |||||
| * | | | Update code structure | Nguyen Dac Nam | 2020-02-18 | 1 | -0/+7 |
| | | | | | | | | | | | | Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
| * | | | renderer_vulkan: Add the rest of case for TryConvertBorderColor | Nguyen Dac Nam | 2020-02-18 | 1 | -3/+10 |
| | | | | |||||
* | | | | Merge pull request #3434 from namkazt/patch-2 | Rodrigo Locatti | 2020-02-21 | 3 | -2/+25 |
|\ \ \ \ | | | | | | | | | | | vk_shader: Implement ImageLoad | ||||
| * | | | | vk_device: remove left over from other branch | Nguyen Dac Nam | 2020-02-21 | 1 | -1/+0 |
| | | | | | |||||
| * | | | | clang-format | Nguyen Dac Nam | 2020-02-20 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | shader_decompiler: only add StorageImageReadWithoutFormat when available | Nguyen Dac Nam | 2020-02-20 | 1 | -1/+4 |
| | | | | | |||||
| * | | | | shader_decompiler: add check in case of device not support ShaderStorageImageReadWithoutFormat | Nguyen Dac Nam | 2020-02-19 | 1 | -0/+4 |
| | | | | | |||||
| * | | | | vk_device: setup shaderStorageImageReadWithoutFormat | Nguyen Dac Nam | 2020-02-19 | 1 | -0/+5 |
| | | | | | |||||
| * | | | | vk_device: add check for shaderStorageImageReadWithoutFormat | Nguyen Dac Nam | 2020-02-19 | 1 | -0/+7 |
| | | | | | |||||
| * | | | | vk_shader: add Capability StorageImageReadWithoutFormat | Nguyen Dac Nam | 2020-02-19 | 1 | -0/+1 |
| | | | | | |||||
| * | | | | vk_shader: Implement function ImageLoad (Used by Kirby Start Allies) | Nguyen Dac Nam | 2020-02-19 | 1 | -2/+6 |
| |/ / / | | | | | | | | | | | | | Please enter the commit message for your changes. Lines starting | ||||
* | | | | Merge pull request #3435 from namkazt/patch-3 | Rodrigo Locatti | 2020-02-21 | 2 | -1/+4 |
|\ \ \ \ | | | | | | | | | | | vulkan: add DXT23_SRGB | ||||
| * | | | | add vertex UnsignedInt size RGBA | Nguyen Dac Nam | 2020-02-18 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | add eBc2SrgbBlock to formats | Nguyen Dac Nam | 2020-02-18 | 1 | -0/+1 |
| | | | | | |||||
| * | | | | vulkan: add DXT23_SRGB | Nguyen Dac Nam | 2020-02-18 | 1 | -1/+1 |
| |/ / / | |||||
* | | | | Merge pull request #3423 from ReinUsesLisp/no-match-3d | bunnei | 2020-02-21 | 1 | -8/+11 |
|\ \ \ \ | |_|_|/ |/| | | | texture_cache: Avoid matches in 3D textures | ||||
| * | | | texture_cache: Avoid matches in 3D textures | ReinUsesLisp | 2020-02-16 | 1 | -8/+11 |
| | |/ | |/| | | | | | | | | | | Code before this commit was trying to match 3D textures with another target. Fix that. | ||||
* | | | Merge pull request #3438 from bunnei/gpu-mem-manager-fix | bunnei | 2020-02-21 | 1 | -1/+10 |
|\ \ \ | | | | | | | | | video_core: memory_manager: Flush/invalidate asynchronously when possible. | ||||
| * | | | video_core: memory_manager: Flush/invalidate asynchronously on Unmap. | bunnei | 2020-02-20 | 1 | -1/+10 |
| | | | | | | | | | | | | | | | | - Minor perf improvement. | ||||
* | | | | Merge pull request #3415 from ReinUsesLisp/texture-code | bunnei | 2020-02-20 | 1 | -43/+28 |
|\ \ \ \ | | | | | | | | | | | shader/texture: Allow 2D shadow arrays and simplify code | ||||
| * | | | | shader/texture: Allow 2D shadow arrays and simplify code | ReinUsesLisp | 2020-02-15 | 1 | -43/+28 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | Shadow sampler 2D arrays are supported on OpenGL, so there's no reason to forbid these. Enable textureLod usage on these. Minor style changes. | ||||
* | | | | Merge pull request #3414 from ReinUsesLisp/maxwell-3d-draw | bunnei | 2020-02-19 | 6 | -36/+6 |
|\ \ \ \ | | | | | | | | | | | maxwell_3d: Unify draw methods | ||||
| * | | | | maxwell_3d: Unify draw methods | ReinUsesLisp | 2020-02-14 | 6 | -36/+6 |
| |/ / / | | | | | | | | | | | | | | | | | Pass instanced state of a draw invocation as an argument instead of having two separate virtual methods. | ||||
* | | | | Merge pull request #3411 from ReinUsesLisp/specific-funcs | bunnei | 2020-02-19 | 1 | -8/+28 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Use the least generic OpenGL draw function possible | ||||
| * | | | | gl_rasterizer: Use the least generic OpenGL draw function possible | ReinUsesLisp | 2020-02-14 | 1 | -8/+28 |
| | | | | | | | | | | | | | | | | | | | | This may help some implementations. | ||||
* | | | | | shader_conversion: I2F : add Assert for case src_size is Short | Nguyen Dac Nam | 2020-02-19 | 1 | -0/+3 |
| | | | | | |||||
* | | | | | fix warning | Nguyen Dac Nam | 2020-02-19 | 1 | -1/+1 |
| | | | | | |||||
* | | | | | clang-format fix | Nguyen Dac Nam | 2020-02-19 | 1 | -1/+1 |
| | | | | | |||||
* | | | | | shader_conversion: add conversion I2F for Short | Nguyen Dac Nam | 2020-02-19 | 1 | -9/+6 |
| |_|/ / |/| | | | |||||
* | | | | Merge pull request #3410 from ReinUsesLisp/vk-draw-index | bunnei | 2020-02-19 | 1 | -4/+13 |
|\ \ \ \ | | | | | | | | | | | vk_shader_decompiler: Fix vertex id and instance id | ||||
| * | | | | vk_shader_decompiler: Fix vertex id and instance id | ReinUsesLisp | 2020-02-14 | 1 | -4/+13 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Vulkan's VertexIndex and InstanceIndex don't match with hardware. This is because Nvidia implements gl_VertexID and gl_InstanceID. The math that relates these is: gl_VertexIndex = gl_BaseVertex + gl_VertexID gl_InstanceIndex = gl_InstanceIndex + gl_InstanceID To emulate it using what Vulkan's SPIR-V offers (the *Index variants) this commit substracts gl_Base* from gl_*Index to obtain the OpenGL and hardware's equivalent. | ||||
* | | | | Merge pull request #3409 from ReinUsesLisp/host-queries | Fernando Sahmkow | 2020-02-18 | 18 | -51/+1043 |
|\ \ \ \ | |_|_|/ |/| | | | query_cache: Implement a query cache and query 21 (samples passed) | ||||
| * | | | query_cache: Address feedback | ReinUsesLisp | 2020-02-14 | 2 | -16/+18 |
| | | | | |||||
| * | | | query_cache: Fix ambiguity in CacheAddr getter | ReinUsesLisp | 2020-02-14 | 1 | -4/+5 |
| | | | | |||||
| * | | | query_cache: Add a recursive mutex for concurrent usage | ReinUsesLisp | 2020-02-14 | 1 | -0/+6 |
| | | | | |||||
| * | | | vk_query_cache: Implement generic query cache on Vulkan | ReinUsesLisp | 2020-02-14 | 11 | -20/+327 |
| | | | | |||||
| * | | | query_cache: Abstract OpenGL implementation | ReinUsesLisp | 2020-02-14 | 4 | -339/+394 |
| | | | | | | | | | | | | | | | | | | | | | | | | Abstract the current OpenGL implementation into the VideoCommon namespace and reimplement it on top of that. Doing this avoids repeating code and logic in the Vulkan implementation. | ||||
| * | | | gl_query_cache: Optimize query cache | ReinUsesLisp | 2020-02-14 | 6 | -79/+217 |
| | | | | | | | | | | | | | | | | Use a custom cache instead of relying on a ranged cache. | ||||
| * | | | gl_query_cache: Implement host queries using a deferred cache | ReinUsesLisp | 2020-02-14 | 7 | -86/+328 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of waiting immediately for executed commands, defer the query until the guest CPU reads it. This way we get closer to what the guest program is doing. To archive this we have to build a dependency queue, because host APIs (like OpenGL and Vulkan) use ranged queries instead of counters like NVN. Waiting for queries implicitly uses fences and this requires a command being queued, otherwise the driver will lock waiting until a timeout. To fix this when there are no commands queued, we explicitly call glFlush. | ||||
| * | | | gl_rasterizer: Sort method declarations | ReinUsesLisp | 2020-02-14 | 1 | -16/+15 |
| | | | | |||||
| * | | | gl_rasterizer: Add queued commands counter | ReinUsesLisp | 2020-02-14 | 2 | -0/+16 |
| | | | | | | | | | | | | | | | | | | | | | | | | Keep track of the queued OpenGL commands that can signal a fence if waited on. As a side effect, we avoid calls to glFlush when no commands are queued. | ||||
| * | | | maxwell_3d: Slow implementation of passed samples (query 21) | ReinUsesLisp | 2020-02-14 | 8 | -17/+201 |
| | | | | | | | | | | | | | | | | Implements GL_SAMPLES_PASSED by waiting immediately for queries. | ||||
| * | | | gl_resource_manager: Add managed query class | ReinUsesLisp | 2020-02-14 | 2 | -0/+42 |
| | | | | |||||
* | | | | Revert "video_core: memory_manager: Use GPU interface for cache functions." | bunnei | 2020-02-15 | 3 | -9/+14 |
| |_|/ |/| | | |||||
* | | | Merge pull request #3379 from ReinUsesLisp/cbuf-offset | bunnei | 2020-02-14 | 3 | -5/+5 |
|\ \ \ | |/ / |/| | | shader/decode: Fix constant buffer offsets | ||||
| * | | shader/decode: Fix constant buffer offsets | ReinUsesLisp | 2020-02-05 | 3 | -5/+5 |
| | | | | | | | | | | | | | | | | | | Some instances were using cbuf34.offset instead of cbuf34.GetOffset(). This returned the an invalid offset. Address those instances and rename offset to "shifted_offset" to avoid future bugs. | ||||
* | | | Merge pull request #3395 from FernandoS27/queries | bunnei | 2020-02-14 | 4 | -52/+73 |
|\ \ \ | |_|/ |/| | | GPU: Refactor queries implementation and correct GPU Clock. | ||||
| * | | GPU: Address Feedback. | Fernando Sahmkow | 2020-02-13 | 2 | -11/+10 |
| | | | |||||
| * | | GPU: Implement GPU Clock correctly. | Fernando Sahmkow | 2020-02-10 | 3 | -2/+17 |
| | | | |||||
| * | | Maxwell3D: Correct query reporting. | Fernando Sahmkow | 2020-02-10 | 2 | -51/+58 |
| | | | |||||
* | | | Merge pull request #3376 from ReinUsesLisp/point-sprite | bunnei | 2020-02-11 | 4 | -1/+9 |
|\ \ \ | |/ / |/| | | gl_rasterizer: Implement GL_POINT_SPRITE | ||||
| * | | gl_rasterizer: Implement GL_POINT_SPRITE | ReinUsesLisp | 2020-02-04 | 4 | -1/+9 |
| | | | | | | | | | | | | | | | | | | OpenGL core defaults to GL_POINT_SPRITE, meanwhile on OpenGL compatibility we have to explicitly enable it. This fixes gl_PointCoord's behaviour. | ||||
* | | | Merge pull request #3372 from ReinUsesLisp/fix-back-stencil | bunnei | 2020-02-10 | 1 | -3/+3 |
|\ \ \ | | | | | | | | | maxwell_3d: Fix stencil back mask | ||||
| * | | | maxwell_3d: Fix stencil back mask | ReinUsesLisp | 2020-02-02 | 1 | -3/+3 |
| | | | | |||||
* | | | | Merge pull request #3387 from bunnei/gpu-mpscqueue | bunnei | 2020-02-09 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | gpu_thread: Use MPSCQueue for GPU commands. | ||||
| * | | | | gpu_thread: Use MPSCQueue for GPU commands. | bunnei | 2020-02-08 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | - Necessary for multiple service threads. | ||||
* | | | | | video_core: memory_manager: Use GPU interface for cache functions. | bunnei | 2020-02-08 | 3 | -14/+9 |
|/ / / / | |||||
* | | | | Merge pull request #3378 from ReinUsesLisp/uscaled | bunnei | 2020-02-08 | 1 | -8/+12 |
|\ \ \ \ | | | | | | | | | | | maxwell_to_gl: Implement R8G8_USCALED | ||||
| * | | | | maxwell_to_gl: Implement R8G8_USCALED | ReinUsesLisp | 2020-02-05 | 1 | -0/+8 |
| | | | | | |||||
| * | | | | maxwell_to_gl: Reduce unimplemented formats to LOG_ERROR | ReinUsesLisp | 2020-02-05 | 1 | -8/+4 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3369 from ReinUsesLisp/shf | bunnei | 2020-02-08 | 2 | -11/+122 |
|\ \ \ \ | | | | | | | | | | | shader/shift: Implement SHF | ||||
| * | | | | shader/shift: Implement SHIFT_RIGHT_{IMM,R} | ReinUsesLisp | 2020-02-02 | 1 | -26/+58 |
| | | | | | | | | | | | | | | | | | | | | Shifts a pair of registers to the right and returns the low register. | ||||
| * | | | | shader/shift: Implement SHF_LEFT_{IMM,R} | ReinUsesLisp | 2020-02-02 | 2 | -10/+89 |
| | | | | | | | | | | | | | | | | | | | | Shifts a pair of registers to the left and returns the high register. | ||||
* | | | | | Merge pull request #3362 from ReinUsesLisp/fix-instanced | bunnei | 2020-02-07 | 2 | -106/+28 |
|\ \ \ \ \ | |_|/ / / |/| | | | | gl_rasterizer: Fix instanced draw arrays | ||||
| * | | | | gl_rasterizer: Fix instanced draw arrays | ReinUsesLisp | 2020-01-30 | 2 | -106/+28 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | glDrawArrays was being used when the draw had a base instance specified. This commit removes the draw parameters abstraction and fixes the mentioned issue. | ||||
* | | | | vk_rasterizer: Use noexcept variants of std::bitset | ReinUsesLisp | 2020-02-04 | 1 | -4/+5 |
| | | | | | | | | | | | | | | | | Removes bounds checking from "texceptions" instances. | ||||
* | | | | Merge pull request #3357 from ReinUsesLisp/bfi-rc | bunnei | 2020-02-04 | 2 | -2/+7 |
|\ \ \ \ | |_|_|/ |/| | | | shader/bfi: Implement register-constant buffer variant | ||||
| * | | | shader/bfi: Implement register-constant buffer variant | ReinUsesLisp | 2020-01-27 | 2 | -2/+7 |
| | | | | | | | | | | | | | | | | | | | | It's the same as the variant that was implemented, but it takes the operands from another source. | ||||
* | | | | Merge pull request #3356 from ReinUsesLisp/fcmp | bunnei | 2020-02-04 | 2 | -1/+17 |
|\ \ \ \ | | | | | | | | | | | shader/arithmetic: Implement FCMP | ||||
| * | | | | shader/arithmetic: Implement FCMP | ReinUsesLisp | 2020-01-27 | 2 | -1/+17 |
| |/ / / | | | | | | | | | | | | | | | | | Compares the third operand with zero, then selects between the first and second. | ||||
* | | | | Merge pull request #3337 from ReinUsesLisp/vulkan-staged | bunnei | 2020-02-03 | 6 | -7/+294 |
|\ \ \ \ | | | | | | | | | | | yuzu: Implement Vulkan frontend | ||||
| * | | | | yuzu: Implement Vulkan frontend | ReinUsesLisp | 2020-01-29 | 3 | -1/+280 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Adds a Qt and SDL2 frontend for Vulkan. It also finishes the missing bits on Vulkan initialization. | ||||
| * | | | | settings: Add settings for graphics backend | ReinUsesLisp | 2020-01-29 | 1 | -2/+4 |
| | | | | | |||||
| * | | | | shader/other: Fix skips for SYNC and BRK | ReinUsesLisp | 2020-01-29 | 1 | -2/+2 |
| | | | | | |||||
| * | | | | shader/other: Stub S2R LaneId | ReinUsesLisp | 2020-01-29 | 1 | -1/+4 |
| | | | | | |||||
| * | | | | buffer_cache: Delay buffer destructions | ReinUsesLisp | 2020-01-29 | 1 | -1/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Delay buffer destruction some extra frames to avoid destroying buffers that are still being used from older frames. This happens on Nvidia's driver with mailbox. | ||||
* | | | | | shader: Remove curly braces initializers on shared pointers | ReinUsesLisp | 2020-02-02 | 5 | -12/+12 |
| |_|_|/ |/| | | | |||||
* | | | | Merge pull request #3282 from FernandoS27/indexed-samplers | bunnei | 2020-02-02 | 24 | -58/+610 |
|\ \ \ \ | | | | | | | | | | | Partially implement Indexed samplers in general and specific code in GLSL | ||||
| * | | | | Shader_IR: Address feedback. | Fernando Sahmkow | 2020-01-25 | 10 | -36/+40 |
| | | | | | |||||
| * | | | | Shader_IR: Change name of TrackSampler function so it does not confuse with the type. | Fernando Sahmkow | 2020-01-24 | 3 | -7/+10 |
| | | | | | |||||
| * | | | | Shader_IR: Corrections, styling and extras. | Fernando Sahmkow | 2020-01-24 | 1 | -2/+4 |
| | | | | | |||||
| * | | | | Shader_IR: Correct Custom Variable assignment. | Fernando Sahmkow | 2020-01-24 | 2 | -0/+4 |
| | | | | | |||||
| * | | | | Shader_IR: Propagate bindless index into the GL compiler. | Fernando Sahmkow | 2020-01-24 | 5 | -24/+54 |
| | | | | | |||||
| * | | | | Shader_IR: Implement Injectable Custom Variables to the IR. | Fernando Sahmkow | 2020-01-24 | 5 | -1/+70 |
| | | | | | |||||
| * | | | | GL Backend: Introduce indexed samplers into the GL backend | Fernando Sahmkow | 2020-01-24 | 2 | -10/+39 |
| | | | | | |||||
| * | | | | Shader_IR: deduce size of indexed samplers | Fernando Sahmkow | 2020-01-24 | 4 | -8/+60 |
| | | | | | |||||
| * | | | | Shader_IR: Setup Indexed Samplers on the IR | Fernando Sahmkow | 2020-01-24 | 1 | -20/+46 |
| | | | | | |||||
| * | | | | Shader_IR: Implement initial code for tracking indexed samplers. | Fernando Sahmkow | 2020-01-24 | 4 | -0/+139 |
| | | | | | |||||
| * | | | | Shader_IR: Address Feedback | Fernando Sahmkow | 2020-01-24 | 5 | -35/+37 |
| | | | | | |||||
| * | | | | Shader_IR: Allow constant access of guest driver. | Fernando Sahmkow | 2020-01-24 | 7 | -1/+18 |
| | | | | | |||||
| * | | | | Shader_IR: Address Feedback | Fernando Sahmkow | 2020-01-24 | 4 | -21/+29 |
| | | | | | |||||
| * | | | | Guest_driver: Correct compiling errors in GCC. | Fernando Sahmkow | 2020-01-24 | 2 | -1/+5 |
| | | | | | |||||
| * | | | | Shader_IR: Store Bound buffer on Shader Usage | Fernando Sahmkow | 2020-01-24 | 5 | -5/+41 |
| | | | | | |||||
| * | | | | GPU: Implement guest driver profile and deduce texture handler sizes. | Fernando Sahmkow | 2020-01-24 | 13 | -0/+127 |
| | | | | | |||||
* | | | | | Merge pull request #3347 from ReinUsesLisp/local-mem | bunnei | 2020-01-30 | 1 | -30/+55 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | shader/memory: Implement LDL.S16, LDS.S16, STL.S16 and STS.S16 | ||||
| * | | | | shader/memory: Implement STL.S16 and STS.S16 | ReinUsesLisp | 2020-01-25 | 1 | -3/+10 |
| | | | | | |||||
| * | | | | shader/memory: Implement unaligned LDL.S16 and LDS.S16 | ReinUsesLisp | 2020-01-25 | 1 | -5/+3 |
| | | | | | |||||
| * | | | | shader/memory: Move unaligned load/store to functions | ReinUsesLisp | 2020-01-25 | 1 | -18/+27 |
| | | | | | |||||
| * | | | | shader/memory: Implement LDL.S16 and LDS.S16 | ReinUsesLisp | 2020-01-25 | 1 | -12/+23 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3350 from ReinUsesLisp/atom | bunnei | 2020-01-29 | 5 | -39/+86 |
|\ \ \ \ | |_|_|/ |/| | | | shader/memory: Implement ATOM.ADD | ||||
| * | | | shader/memory: Implement ATOM.ADD | ReinUsesLisp | 2020-01-26 | 5 | -39/+86 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ATOM operates atomically on global memory. For now only add ATOM.ADD since that's what was found in commercial games. This asserts for ATOM.ADD.S32 (handling the others as unimplemented), although ATOM.ADD.U32 shouldn't be any different. This change forces us to change the default type on SPIR-V storage buffers from float to uint. We could also alias the buffers, but it's simpler for now to just use uint. While we are at it, abstract the code to avoid repetition. | ||||
* | | | Merge pull request #3355 from ReinUsesLisp/break-down | bunnei | 2020-01-29 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | texture_cache/surface_base: Fix layered break down | ||||
| * | | | texture_cache/surface_base: Fix layered break down | ReinUsesLisp | 2020-01-27 | 1 | -1/+1 |
| |/ / | | | | | | | | | | | | | Layered break downs was passing "layer" as a "depth" parameter. This commit addresses that. | ||||
* | | | Merge pull request #3358 from ReinUsesLisp/implicit-texture-cache | bunnei | 2020-01-29 | 1 | -3/+6 |
|\ \ \ | | | | | | | | | gl_texture_cache: Silence implicit sign cast warnings | ||||
| * | | | gl_texture_cache: Silence implicit sign cast warnings | ReinUsesLisp | 2020-01-28 | 1 | -3/+6 |
| |/ / | |||||
* | | | Merge pull request #3359 from ReinUsesLisp/assert-point-size | bunnei | 2020-01-28 | 1 | -1/+0 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Remove UNIMPLEMENTED for gl_PointSize | ||||
| * | | | gl_shader_decompiler: Remove UNIMPLEMENTED for gl_PointSize | ReinUsesLisp | 2020-01-28 | 1 | -1/+0 |
| |/ / | | | | | | | | | | This was implemented by a previous commit and it's no longer required. | ||||
* / / | gl_texture_cache: Properly implement depth/stencil sampling | ReinUsesLisp | 2020-01-27 | 1 | -4/+27 |
|/ / | | | | | | | | | | | This addresses the long standing issue of compatibility vs. core profiles on OpenGL, properly implementing depth vs. stencil sampling depending on the texture swizzle. | ||||
* | | Merge pull request #3344 from ReinUsesLisp/vk-botw | bunnei | 2020-01-24 | 3 | -19/+16 |
|\ \ | |/ |/| | vk_shader_decompiler: Disable default values on unwritten render targets | ||||
| * | vk_shader_decompiler: Disable default values on unwritten render targets | ReinUsesLisp | 2020-01-24 | 3 | -19/+16 |
| | | | | | | | | | | | | | | | | | | Some games like The Legend of Zelda: Breath of the Wild assign render targets without writing them from the fragment shader. This generates Vulkan validation errors, so silence these I previously introduced a commit to set "vec4(0, 0, 0, 1)" for these attachments. The problem is that this is not what games expect. This commit reverts that change. | ||||
* | | Merge pull request #3273 from FernandoS27/txd-array | bunnei | 2020-01-24 | 1 | -5/+12 |
|\ \ | | | | | | | Shader_IR: Implement TXD Array. | ||||
| * | | Shader_IR: Implement TXD Array. | Fernando Sahmkow | 2020-01-04 | 1 | -5/+12 |
| | | | | | | | | | | | | | | | This commit extends the compilation of TXD to support array samplers on TXD. | ||||
* | | | gl_shader_cache: Disable fastmath on Nvidia | ReinUsesLisp | 2020-01-21 | 1 | -0/+4 |
| |/ |/| | |||||
* | | Merge pull request #3330 from ReinUsesLisp/vk-blit-screen | Fernando Sahmkow | 2020-01-21 | 5 | -6/+754 |
|\ \ | | | | | | | vk_blit_screen: Initial implementation | ||||
| * | | vk_blit_screen: Address feedback | ReinUsesLisp | 2020-01-20 | 4 | -22/+25 |
| | | | |||||
| * | | vk_blit_screen: Initial implementation | ReinUsesLisp | 2020-01-20 | 3 | -0/+745 |
| | | | | | | | | | | | | | | | This abstraction takes care of presenting accelerated and non-accelerated or "framebuffer" images to the Vulkan swapchain. | ||||
* | | | Merge pull request #3328 from ReinUsesLisp/vulkan-atoms | bunnei | 2020-01-20 | 1 | -3/+11 |
|\ \ \ | | | | | | | | | vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V | ||||
| * | | | vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V | ReinUsesLisp | 2020-01-19 | 1 | -3/+11 |
| | | | | | | | | | | | | | | | | Also updates sirit to include atomic instructions. | ||||
* | | | | Merge pull request #3322 from ReinUsesLisp/vk-front-face | bunnei | 2020-01-20 | 2 | -2/+2 |
|\ \ \ \ | |_|/ / |/| | | | vk_graphics_pipeline: Set front facing properly | ||||
| * | | | vk_graphics_pipeline: Set front facing properly | ReinUsesLisp | 2020-01-18 | 2 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Front face was being forced to a certain value when cull face is disabled. Set a default value on initialization and drop the forcefully set front facing value with culling disabled. | ||||
* | | | | Merge pull request #3313 from ReinUsesLisp/vk-rasterizer | bunnei | 2020-01-20 | 4 | -1/+1466 |
|\ \ \ \ | |_|/ / |/| | | | vk_rasterizer: Implement Vulkan's rasterizer | ||||
| * | | | vk_rasterizer: Address feedback | ReinUsesLisp | 2020-01-18 | 2 | -25/+32 |
| | | | | |||||
| * | | | vk_rasterizer: Implement Vulkan's rasterizer | ReinUsesLisp | 2020-01-17 | 3 | -1/+1386 |
| | | | | | | | | | | | | | | | | | | | | | | | | This abstraction is Vulkan's equivalent to OpenGL's rasterizer. It takes care of joining all parts of the backend and rendering accordingly on demand. | ||||
| * | | | renderer_vulkan: Add header as placeholder | ReinUsesLisp | 2020-01-17 | 2 | -0/+73 |
| | | | | |||||
* | | | | Merge pull request #3317 from ReinUsesLisp/gl-decomp-cc-decomp | Fernando Sahmkow | 2020-01-19 | 1 | -27/+5 |
|\ \ \ \ | | | | | | | | | | | gl_shader_decompiler: Fix decompilation of condition codes | ||||
| * | | | | gl_shader_decompiler: Fix decompilation of condition codes | ReinUsesLisp | 2020-01-18 | 1 | -27/+5 |
| | |/ / | |/| | | | | | | | | | | | | | | Use Visit instead of reimplementing it. Fixes unimplemented negations for condition codes. | ||||
* | | | | gl_state: Use bool instead of GLboolean | ReinUsesLisp | 2020-01-18 | 2 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | This fixes template resolution considering GLboolean an integer instead of a bool. | ||||
* | | | | Merge pull request #3305 from ReinUsesLisp/point-size-program | bunnei | 2020-01-18 | 4 | -2/+13 |
|\ \ \ \ | | | | | | | | | | | gl_state: Implement PROGRAM_POINT_SIZE | ||||
| * | | | | gl_state: Implement PROGRAM_POINT_SIZE | ReinUsesLisp | 2020-01-15 | 4 | -2/+13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | For gl_PointSize to have effect we have to activate GL_PROGRAM_POINT_SIZE. | ||||
* | | | | | Merge pull request #3312 from ReinUsesLisp/atoms-u32 | bunnei | 2020-01-18 | 5 | -3/+74 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/memory: Implement ATOMS.ADD.U32 | ||||
| * | | | | | shader/memory: Implement ATOMS.ADD.U32 | ReinUsesLisp | 2020-01-16 | 5 | -3/+74 |
| | | | | | | |||||
* | | | | | | Merge pull request #3306 from ReinUsesLisp/gl-texture | bunnei | 2020-01-17 | 1 | -9/+7 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | gl_texture_cache: Minor fixes and style changes | ||||
| * | | | | | gl_texture_cache: Use local variables to simplify DownloadTexture | ReinUsesLisp | 2020-01-14 | 1 | -6/+4 |
| | | | | | | |||||
| * | | | | | gl_texture_cache: Fix format for RGBX16F | ReinUsesLisp | 2020-01-14 | 1 | -1/+1 |
| | | | | | | |||||
| * | | | | | gl_texture_cache: Use Snorm internal format for RG8S | ReinUsesLisp | 2020-01-14 | 1 | -1/+1 |
| | | | | | | |||||
| * | | | | | gl_texture_cache: Use Snorm internal format for ABGR8S | ReinUsesLisp | 2020-01-14 | 1 | -1/+1 |
| | |/ / / | |/| | | | |||||
* | | | | | Merge pull request #3311 from ReinUsesLisp/z32fx24s8 | bunnei | 2020-01-17 | 1 | -1/+1 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | format_lookup_table: Fix ZF32_X24S8 component types | ||||
| * | | | | format_lookup_table: Fix ZF32_X24S8 component types | ReinUsesLisp | 2020-01-16 | 1 | -1/+1 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | Component types for ZF32_X24S8 were using UNORM. Drivers will set FLOAT, UINT, UNORM, UNORM; causing a format mismatch. This commit addresses that. | ||||
* | | | | Merge pull request #3300 from ReinUsesLisp/vk-texture-cache | bunnei | 2020-01-17 | 5 | -5/+724 |
|\ \ \ \ | |/ / / |/| | | | vk_texture_cache: Implement generic texture cache on Vulkan | ||||
| * | | | vk_texture_cache: Address feedback | ReinUsesLisp | 2020-01-16 | 2 | -22/+8 |
| | | | | |||||
| * | | | vk_texture_cache: Fix typo in commentary | Rodrigo Locatti | 2020-01-16 | 1 | -1/+1 |
| | | | | | | | | | | | | Co-Authored-By: MysticExile <30736337+MysticExile@users.noreply.github.com> | ||||
| * | | | vk_texture_cache: Implement generic texture cache on Vulkan | ReinUsesLisp | 2020-01-14 | 4 | -1/+733 |
| | | | | | | | | | | | | | | | | | | | | It currently ignores PBO linearizations since these should be dropped as soon as possible on OpenGL. | ||||
| * | | | texture_cache/surface_params: Make GetNumLayers public | ReinUsesLisp | 2020-01-14 | 1 | -4/+5 |
| | | | | |||||
* | | | | Merge pull request #3308 from lioncash/private | bunnei | 2020-01-16 | 1 | -2/+2 |
|\ \ \ \ | | | | | | | | | | | maxwell_3d: Make dirty_pointers private | ||||
| * | | | | maxwell_3d: Make dirty_pointers private | Lioncash | 2020-01-16 | 1 | -2/+2 |
| | |/ / | |/| | | | | | | | | | | | | | | This isn't used outside of the class itself, so we can make it private for the time being. | ||||
* | | | | Merge pull request #3304 from lioncash/fwd-decl | bunnei | 2020-01-16 | 2 | -15/+16 |
|\ \ \ \ | |/ / / |/| | | | renderer_opengl/utils: Forward declare private structs | ||||
| * | | | renderer_opengl/utils: Remove unused header inclusions | Lioncash | 2020-01-15 | 1 | -3/+0 |
| | | | | | | | | | | | | | | | | Nothing from these headers are used, so they can be removed. | ||||
| * | | | renderer_opengl/utils: Forward declare private structs | Lioncash | 2020-01-15 | 2 | -12/+16 |
| | | | | | | | | | | | | | | | | | | | | Keeps the definitions hidden and allows changes to the structs without needing to recompile all users of classes containing said structs. | ||||
* | | | | Merge pull request #3303 from lioncash/reorder | Rodrigo Locatti | 2020-01-14 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | control_flow: Silence -Wreorder warning for CFGRebuildState | ||||
| * | | | | control_flow: Silence -Wreorder warning for CFGRebuildState | Lioncash | 2020-01-14 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | | | | | Organizes the initializer list in the same order that the variables would actually be initialized in. | ||||
* | | | | gl_shader_cache: Remove unused STAGE_RESERVED_UBOS constant | Lioncash | 2020-01-14 | 1 | -3/+0 |
| | | | | | | | | | | | | | | | | Given this isn't used, this can be removed entirely. | ||||
* | | | | gl_shader_cache: std::move entries in CachedShader constructor | Lioncash | 2020-01-14 | 1 | -3/+4 |
| | | | | | | | | | | | | | | | | Avoids several reallocations of std::vector instances where applicable. | ||||
* | | | | gl_shader_cache: Remove unused entries variable in BuildShader() | Lioncash | 2020-01-14 | 1 | -1/+0 |
|/ / / | | | | | | | | | | Eliminates a few unnecessary constructions of std::vectors. | ||||
* | | | Merge pull request #3287 from ReinUsesLisp/ldg-stg-16 | bunnei | 2020-01-14 | 2 | -34/+52 |
|\ \ \ | | | | | | | | | shader_ir/memory: Implement u16 and u8 for STG and LDG | ||||
| * | | | shader_ir/memory: Implement u16 and u8 for STG and LDG | ReinUsesLisp | 2020-01-09 | 2 | -34/+52 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Using the same technique we used for u8 on LDG, implement u16. In the case of STG, load memory and insert the value we want to set into it with bitfieldInsert. Then set that value. | ||||
* | | | | Merge pull request #3288 from ReinUsesLisp/uncurse-aoffi | bunnei | 2020-01-14 | 1 | -10/+6 |
|\ \ \ \ | | | | | | | | | | | shader_ir/texture: Simplify AOFFI code | ||||
| * | | | | shader_ir/texture: Simplify AOFFI code | ReinUsesLisp | 2020-01-09 | 1 | -10/+6 |
| |/ / / | |||||
* | | | | Merge pull request #3290 from ReinUsesLisp/gl-clamp | bunnei | 2020-01-14 | 3 | -6/+11 |
|\ \ \ \ | |_|/ / |/| | | | maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver | ||||
| * | | | maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver | ReinUsesLisp | 2020-01-10 | 3 | -6/+11 |
| |/ / | | | | | | | | | | | | | | | | Nvidia's driver defaults invalid enumerations to GL_CLAMP. Vulkan doesn't expose GL_CLAMP through its API, but we can hack it on Nvidia's driver using the internal driver defaults. | ||||
* | | | vk_compute_pass: Address feedback | Rodrigo Locatti | 2020-01-11 | 1 | -0/+2 |
| | | | | | | | | | Comment hardcoded SPIR-V modules. | ||||
* | | | vk_compute_pass: Add compute passes to emulate missing Vulkan features | ReinUsesLisp | 2020-01-08 | 3 | -0/+416 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This currently only supports quad arrays and u8 indices. In the future we can remove quad arrays with a table written from the CPU, but this was used to bootstrap the other passes helpers and it was left in the code. The blob code is generated from the "shaders/" directory. Read the instructions there to know how to generate the SPIR-V. | ||||
* | | | vk_shader_util: Add helper to build SPIR-V shaders | ReinUsesLisp | 2020-01-08 | 3 | -0/+53 |
|/ / | |||||
* | | vk_pipeline_cache: Initial implementation | ReinUsesLisp | 2020-01-07 | 2 | -1/+460 |
| | | | | | | | | | | Given a pipeline key, this cache returns a pipeline abstraction (for graphics or compute). | ||||
* | | vk_graphics_pipeline: Initial implementation | ReinUsesLisp | 2020-01-07 | 4 | -0/+395 |
| | | | | | | | | | | | | | | | | | | This abstractio represents the state of the 3D engine at a given draw. Instead of changing individual bits of the pipeline how it's done in APIs like D3D11, OpenGL and NVN; on Vulkan we are forced to put everything together into a single, immutable object. It takes advantage of the few dynamic states Vulkan offers. | ||||
* | | vk_compute_pipeline: Initial implementation | ReinUsesLisp | 2020-01-07 | 4 | -0/+219 |
| | | | | | | | | This abstraction represents a Vulkan compute pipeline. | ||||
* | | vk_pipeline_cache: Add file and define descriptor update template filler | ReinUsesLisp | 2020-01-07 | 3 | -0/+67 |
| | | | | | | | | | | This function allows us to share code between compute and graphics pipelines compilation. | ||||
* | | fixed_pipeline_state: Add depth clamp | ReinUsesLisp | 2020-01-07 | 2 | -10/+18 |
| | | |||||
* | | vk_rasterizer: Add placeholder | ReinUsesLisp | 2020-01-07 | 2 | -0/+14 |
| | | |||||
* | | Merge pull request #3276 from ReinUsesLisp/pipeline-reqs | bunnei | 2020-01-06 | 5 | -1/+345 |
|\ \ | | | | | | | vk_update_descriptor/vk_renderpass_cache: Add pipeline cache dependencies | ||||
| * | | vk_renderpass_cache: Initial implementation | ReinUsesLisp | 2020-01-06 | 3 | -0/+199 |
| | | | | | | | | | | | | | | | The renderpass cache is used to avoid creating renderpasses on each draw. The hashed structure is not currently optimized. | ||||
| * | | vk_update_descriptor: Initial implementation | ReinUsesLisp | 2020-01-06 | 3 | -1/+146 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | The update descriptor is used to store in flat memory a large chunk of staging data used to update descriptor sets through templates. It provides a push interface to easily insert descriptors following the current pipeline. The order used in the descriptor update template has to be implicitly followed. We can catch bugs here using validation layers. | ||||
* | | | vk_stream_buffer/vk_buffer_cache: Avoid halting and use generic cache | ReinUsesLisp | 2020-01-06 | 4 | -62/+340 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The stream buffer before this commit once it was full (no more bytes to write before looping) waiting for all previous operations to finish. This was a temporary solution and had a noticeable performance penalty in performance (from what a profiler showed). To avoid this mark with fences usages of the stream buffer and once it loops wait for them to be signaled. On average this will never wait. Each fence knows where its usage finishes, resulting in a non-paged stream buffer. On the other side, the buffer cache is reimplemented using the generic buffer cache. It makes use of the staging buffer pool and the new stream buffer. | ||||
* | | | vk_memory_manager: Misc changes | ReinUsesLisp | 2020-01-06 | 2 | -88/+142 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Allocate memory in discrete exponentially increasing chunks until the 128 MiB threshold. Allocations larger thant that increase linearly by 256 MiB (depending on the required size). This allows to use small allocations for small resources. * Move memory maps to a RAII abstraction. To optimize for debugging tools (like RenderDoc) users will map/unmap on usage. If this ever becomes a noticeable overhead (from my profiling it doesn't) we can transparently move to persistent memory maps without harming the API, getting optimal performance for both gameplay and debugging. * Improve messages on exceptional situations. * Fix typos "requeriments" -> "requirements". * Small style changes. | ||||
* | | | vk_buffer_cache: Temporarily remove buffer cache | ReinUsesLisp | 2020-01-06 | 2 | -226/+0 |
|/ / | | | | | | | This is intended for a follow up commit to avoid circular dependencies. | ||||
* | | Merge pull request #3257 from degasus/no_busy_loops | bunnei | 2020-01-06 | 2 | -4/+8 |
|\ \ | | | | | | | video_core: Block in WaitFence. | ||||
| * | | video_core: Block in WaitFence. | Markus Wick | 2019-12-30 | 2 | -4/+8 |
| | | | | | | | | | | | | | | | | | | | | | This function is called rarely and blocks quite often for a long time. So don't waste power and let the CPU sleep. This might also increase the performance as the other cores might be allowed to clock higher. | ||||
* | | | Merge pull request #3264 from ReinUsesLisp/vk-descriptor-pool | Fernando Sahmkow | 2020-01-05 | 3 | -0/+147 |
|\ \ \ | | | | | | | | | vk_descriptor_pool: Initial implementation | ||||
| * | | | Update src/video_core/renderer_vulkan/vk_descriptor_pool.cpp | Rodrigo Locatti | 2020-01-03 | 1 | -1/+1 |
| | | | | | | | | | | | | Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
| * | | | vk_descriptor_pool: Initial implementation | ReinUsesLisp | 2020-01-01 | 3 | -0/+147 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Create a large descriptor pool where we allocate all our descriptors from. It has to be wide enough to support any pipeline, hence its large numbers. If the descritor pool is filled, we allocate more memory at that moment. This way we can take advantage of permissive drivers like Nvidia's that allocate more descriptors than what the spec requires. | ||||
* | | | | Merge pull request #3258 from FernandoS27/shader-amend | bunnei | 2020-01-04 | 5 | -2/+52 |
|\ \ \ \ | |_|_|/ |/| | | | Shader_IR: add the ability to amend code in the shader ir. | ||||
| * | | | Shader_IR: Address Feedback | Fernando Sahmkow | 2020-01-04 | 5 | -38/+19 |
| | | | | |||||
| * | | | Shader_IR: add the ability to amend code in the shader ir. | Fernando Sahmkow | 2019-12-30 | 5 | -3/+72 |
| | |/ | |/| | | | | | | | | | | | | | | | | This commit introduces a mechanism by which shader IR code can be amended and extended. This useful for track algorithms where certain information can derived from before the track such as indexes to array samplers. | ||||
* | | | yuzu: Remove Maxwell debugger | ReinUsesLisp | 2020-01-03 | 4 | -239/+0 |
| | | | | | | | | | | | | | | | This was carried from Citra and wasn't really used on yuzu. It also adds some runtime overhead. This commit removes it from yuzu's codebase. | ||||
* | | | Merge pull request #3243 from ReinUsesLisp/topologies | bunnei | 2020-01-02 | 1 | -4/+18 |
|\ \ \ | |_|/ |/| | | maxwell_to_gl: Implement missing primitive topologies | ||||
| * | | maxwell_to_gl: Implement missing primitive topologies | ReinUsesLisp | 2019-12-23 | 1 | -4/+18 |
| | | | | | | | | | | | | Many of these topologies are exclusively available in OpenGL. | ||||
* | | | Merge pull request #3239 from ReinUsesLisp/p2r | bunnei | 2020-01-01 | 2 | -17/+47 |
|\ \ \ | | | | | | | | | shader/p2r: Implement P2R Pr | ||||
| * | | | shader/p2r: Implement P2R Pr | ReinUsesLisp | 2019-12-20 | 1 | -1/+15 |
| | | | | | | | | | | | | | | | | | | | | P2R dumps predicate or condition codes state to a register. This is useful for unit testing. | ||||
| * | | | shader/r2p: Refactor P2R to support P2R | ReinUsesLisp | 2019-12-20 | 2 | -17/+33 |
| | | | | |||||
* | | | | Merge pull request #3248 from ReinUsesLisp/vk-image | Fernando Sahmkow | 2019-12-30 | 3 | -0/+192 |
|\ \ \ \ | | | | | | | | | | | vk_image: Add an image object abstraction | ||||
| * | | | | vk_image: Avoid unnecesary equals | Rodrigo Locatti | 2019-12-30 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | vk_image: Add an image object abstraction | ReinUsesLisp | 2019-12-25 | 3 | -0/+192 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This object's job is to contain an image and manage its transitions. Since Nvidia hardware doesn't know what a transition is but Vulkan requires them anyway, we have to state track image subresources individually. To avoid the overhead of tracking each subresource in images with many subresources (think of cubemap arrays with several mipmaps), this commit tracks when subresources have diverged. As long as this doesn't happen we can check the state of the first subresource (that will be shared with all subresources) and update accordingly. Image transitions are deferred to the scheduler command buffer. | ||||
* | | | | | Merge pull request #3249 from ReinUsesLisp/vk-staging-buffer-pool | Fernando Sahmkow | 2019-12-30 | 3 | -0/+212 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | vk_staging_buffer_pool: Add a staging pool for temporary operations | ||||
| * | | | | vk_staging_buffer_pool: Initialize last epoch to zero | Rodrigo Locatti | 2019-12-29 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | vk_staging_buffer_pool: Add a staging pool for temporary operations | ReinUsesLisp | 2019-12-25 | 3 | -0/+212 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | The job of this abstraction is to provide staging buffers for temporary operations. Think of image uploads or buffer uploads to device memory. It automatically deletes unused buffers. | ||||
* | | | | Merge pull request #3250 from ReinUsesLisp/empty-fragment | Fernando Sahmkow | 2019-12-28 | 2 | -0/+7 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Allow rendering without fragment shader | ||||
| * | | | | gl_rasterizer: Allow rendering without fragment shader | ReinUsesLisp | 2019-12-26 | 2 | -0/+7 |
| |/ / / | | | | | | | | | | | | | | | | | Rendering without a fragment shader is usually used in depth-only passes. | ||||
* | | | | Merge pull request #3228 from ReinUsesLisp/ptp | bunnei | 2019-12-27 | 5 | -74/+142 |
|\ \ \ \ | |/ / / |/| | | | shader/texture: Implement AOFFI and PTP for TLD4 and TLD4S | ||||
| * | | | shader/texture: Implement TLD4.PTP | ReinUsesLisp | 2019-12-16 | 5 | -56/+120 |
| | | | | |||||
| * | | | shader/texture: Enable arrayed TLD4 | ReinUsesLisp | 2019-12-16 | 1 | -1/+0 |
| | | | | |||||
| * | | | gl_shader_decompiler: Rename "sepparate" to "separate" | ReinUsesLisp | 2019-12-16 | 1 | -3/+3 |
| | | | | |||||
| * | | | shader/texture: Implement AOFFI for TLD4S | ReinUsesLisp | 2019-12-16 | 1 | -13/+18 |
| | | | | |||||
| * | | | shader/texture: Remove unnecesary parenthesis | ReinUsesLisp | 2019-12-16 | 1 | -2/+2 |
| | | | | |||||
* | | | | Merge pull request #3244 from ReinUsesLisp/vk-fps | Fernando Sahmkow | 2019-12-25 | 4 | -6/+594 |
|\ \ \ \ | | | | | | | | | | | fixed_pipeline_state: Define structure and loaders | ||||
| * | | | | fixed_pipeline_state: Define symetric operator!= and mark as noexcept | ReinUsesLisp | 2019-12-24 | 2 | -40/+92 |
| | | | | | | | | | | | | | | | | | | | | Marks as noexcept Hash, operator== and operator!= for consistency. | ||||
| * | | | | fixed_pipeline_state: Define structure and loaders | ReinUsesLisp | 2019-12-23 | 3 | -0/+528 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The intention behind this hasheable structure is to describe the state of fixed function pipeline state that gets compiled to a single graphics pipeline state object. This is all dynamic state in OpenGL but Vulkan wants it in an immutable state, even if hardware can edit it freely. In this commit the structure is defined in an optimized state (it uses booleans, has paddings and many data entries that can be packed to single integers). This is intentional as an initial implementation that is easier to debug, implement and review. It will be optimized in later stages, or it might change if Vulkan gets more dynamic states. | ||||
| * | | | | maxwell_3d: Add depth bounds registers | ReinUsesLisp | 2019-12-23 | 1 | -6/+14 |
| | | | | | |||||
* | | | | | Merge pull request #3236 from ReinUsesLisp/rasterize-enable | bunnei | 2019-12-25 | 6 | -4/+28 |
|\ \ \ \ \ | |/ / / / |/| | | | | gl_rasterizer: Implement RASTERIZE_ENABLE | ||||
| * | | | | gl_rasterizer: Implement RASTERIZE_ENABLE | ReinUsesLisp | 2019-12-18 | 6 | -4/+28 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | RASTERIZE_ENABLE is the opposite of GL_RASTERIZER_DISCARD. Implement it naturally using this. NVN games expect rasterize to be enabled by default, reflect that in our initial GPU state. | ||||
* | | | | | Merge pull request #3241 from ReinUsesLisp/gl-shader-cache | bunnei | 2019-12-22 | 1 | -19/+14 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | gl_shader_cache: Style changes | ||||
| * | | | | gl_shader_cache: Update commentary for shared memory | ReinUsesLisp | 2019-12-21 | 1 | -9/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Remove false commentary. Not dividing by 4 the size of shared memory is not a hack; it describes the number of integers, not bytes. While we are at it sort the generated code to put preprocessor lines on the top. | ||||
| * | | | | gl_shader_cache: Remove unused entry in GetPrimitiveDescription | ReinUsesLisp | 2019-12-21 | 1 | -11/+9 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3238 from ReinUsesLisp/vk-resource-manager | bunnei | 2019-12-22 | 4 | -1/+82 |
|\ \ \ \ | | | | | | | | | | | vk_resource_manager: Catch device losses and other changes | ||||
| * | | | | vk_resource_manager: Add entry to VKFence to test its usage | ReinUsesLisp | 2019-12-19 | 1 | -0/+8 |
| | | | | | |||||
| * | | | | vk_reosurce_manager: Add assert for releasing fences | ReinUsesLisp | 2019-12-19 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Notify the programmer when a request to release a fence is invalid because the fence is already free. | ||||
| * | | | | vk_resource_manager: Implement VKFenceWatch move constructor | ReinUsesLisp | 2019-12-19 | 2 | -0/+32 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows us to put VKFenceWatch inside a std::vector without storing it in heap. On move we have to signal the fences where the new protected resource is, adding some overhead. | ||||
| * | | | | vk_device: Add entry to catch device losses | ReinUsesLisp | 2019-12-19 | 3 | -1/+40 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | VK_NV_device_diagnostic_checkpoints allows us to push data to a Vulkan queue and then query it even after a device loss. This allows us to push the current pipeline object and see what was the call that killed the device. | ||||
| * | | | | vk_device: Add query for RGBA8Uint | ReinUsesLisp | 2019-12-19 | 1 | -0/+1 |
| | | | | | |||||
* | | | | | Merge pull request #3203 from FernandoS27/tex-cache-fixes | bunnei | 2019-12-22 | 4 | -1/+144 |
|\ \ \ \ \ | | | | | | | | | | | | | Texture Cache: Add HLE methods for building 3D textures | ||||
| * | | | | | Texture Cache: Improve documentation | Fernando Sahmkow | 2019-12-22 | 2 | -4/+5 |
| | | | | | | |||||
| * | | | | | Texture Cache: Address Feedback | Fernando Sahmkow | 2019-12-22 | 2 | -11/+11 |
| | | | | | | |||||
| * | | | | | Texture Cache: Add HLE methods for building 3D textures within the GPU in certain scenarios. | Fernando Sahmkow | 2019-12-22 | 4 | -1/+143 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit adds a series of HLE methods for handling 3D textures in general. This helps games that generate 3D textures on every frame and may reduce loading times for certain games. | ||||
* | | | | | | Merge pull request #3237 from ReinUsesLisp/vk-shader-decompiler | Fernando Sahmkow | 2019-12-22 | 2 | -38/+49 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | vk_shader_decompiler: Misc changes | ||||
| * | | | | | vk_shader_decompiler: Fix full decompilation | ReinUsesLisp | 2019-12-19 | 1 | -3/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When full decompilation was enabled, labels were not being inserted and instructions were misused. Fix these bugs. | ||||
| * | | | | | vk_shader_decompiler: Skip NDC correction when it is native | ReinUsesLisp | 2019-12-19 | 2 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoid changing gl_Position when the NDC used by the game is [0, 1] (Vulkan's native). | ||||
| * | | | | | vk_shader_decompiler: Normalize output fragment attachments | ReinUsesLisp | 2019-12-19 | 2 | -12/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some games write from fragment shaders to an unexistant framebuffer attachment or they don't write to one when it exists in the framebuffer. Fix this by skipping writes or adding zeroes. | ||||
| * | | | | | vk_shader_decompiler: Update sirit and implement Texture AOFFI | ReinUsesLisp | 2019-12-19 | 1 | -22/+30 |
| |/ / / / | |||||
* | | | | | Merge pull request #3230 from ReinUsesLisp/vk-emu-shaders | Fernando Sahmkow | 2019-12-22 | 4 | -0/+122 |
|\ \ \ \ \ | | | | | | | | | | | | | renderer_vulkan/shader: Add helper GLSL shaders | ||||
| * | | | | | renderer_vulkan/shader: Add helper GLSL shaders | ReinUsesLisp | 2019-12-16 | 4 | -0/+122 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These shaders are used to specify code that is not dynamically generated in the Vulkan backend. Instead of packing it inside the build system, it's manually built and copied to the C++ file to avoid adding unnecessary build time dependencies. quad_array should be dropped in the future since it can be emulated with a memory pool generated from the CPU. | ||||
* | | | | | Merge pull request #3240 from ReinUsesLisp/decomp-cond-code | Fernando Sahmkow | 2019-12-22 | 1 | -23/+1 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_shader_decompiler: Use Visit instead of reimplementing it | ||||
| * | | | | | vk_shader_decompiler: Use Visit instead of reimplementing it | ReinUsesLisp | 2019-12-21 | 1 | -23/+1 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ExprCondCode visit implements the generic Visit. Use this instead of that one. As an intended side effect this fixes unwritten memory usages in cases when a negation of a condition code is used. | ||||
* | | | | | Merge pull request #3235 from ReinUsesLisp/ldg-u8 | bunnei | 2019-12-22 | 1 | -6/+32 |
|\ \ \ \ \ | |/ / / / |/| | | | | shader/memory: Implement LDG.U8 and unaligned U8 loads | ||||
| * | | | | shader/memory: Implement LDG.U8 and unaligned U8 loads | ReinUsesLisp | 2019-12-18 | 1 | -6/+32 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | LDG can load single bytes instead of full integers or packs of integers. These have the advantage of loading bytes that are not aligned to 4 bytes. To emulate these this commit gets the byte being referenced (by doing "address & 3" and then using that to extract the byte from the loaded integer: result = bitfieldExtract(loaded_integer, (address % 4) * 8, 8) | ||||
* | | | | Merge pull request #3234 from ReinUsesLisp/i2f-u8-selector | bunnei | 2019-12-20 | 1 | -2/+13 |
|\ \ \ \ | | | | | | | | | | | shader/conversion: Implement byte selector in I2F | ||||
| * | | | | shader/conversion: Implement byte selector in I2F | ReinUsesLisp | 2019-12-18 | 1 | -2/+13 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | I2F's byte selector is used to choose what bytes to convert to float. e.g. if the input is 0xaabbccdd and the selector is ".B3" it will convert 0xaa. The default (when it's not shown in nvdisasm) is ".B0", in that example the default would convert 0xdd to float. | ||||
* | | | | Merge pull request #3233 from ReinUsesLisp/mismatch-sizes | bunnei | 2019-12-20 | 1 | -4/+9 |
|\ \ \ \ | | | | | | | | | | | shader/texture: Properly shrink unused entries in size mismatches | ||||
| * | | | | shader/texture: Properly shrink unused entries in size mismatches | ReinUsesLisp | 2019-12-18 | 1 | -4/+9 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | When a image format mismatches we were inserting zeroes to the texture itself. This was not handling cases were the mismatch uses less coordinates than the guest shader code. Address that by resizing the vector. | ||||
* | | | | Merge pull request #3232 from ReinUsesLisp/gl-decompiler-images | bunnei | 2019-12-19 | 1 | -0/+1 |
|\ \ \ \ | | | | | | | | | | | gl_shader_decompiler: Add missing DeclareImages | ||||
| * | | | | gl_shader_decompiler: Add missing DeclareImages | ReinUsesLisp | 2019-12-18 | 1 | -0/+1 |
| |/ / / | |||||
* | | | | Merge pull request #3231 from ReinUsesLisp/tld4s-encoding | bunnei | 2019-12-19 | 1 | -1/+1 |
|\ \ \ \ | |_|/ / |/| | | | shader_bytecode: Fix TLD4S encoding | ||||
| * | | | shader_bytecode: Fix TLD4S encoding | ReinUsesLisp | 2019-12-18 | 1 | -1/+1 |
| |/ / | |||||
* | | | Merge pull request #3221 from ReinUsesLisp/vk-scheduler | bunnei | 2019-12-19 | 2 | -37/+311 |
|\ \ \ | |/ / |/| | | vk_scheduler: Delegate commands to a worker thread and state track | ||||
| * | | vk_scheduler: Delegate commands to a worker thread and state track | ReinUsesLisp | 2019-12-13 | 2 | -37/+311 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Introduce a worker thread approach for delegating Vulkan work derived from dxvk's approach. https://github.com/doitsujin/dxvk Now that the scheduler is what handles all Vulkan work related to command streaming, store state tracking in itself. This way we can know when to reupload Vulkan dynamic state to the queue (since this one is invalidated between command buffers unlike NVN). We can also store the renderpass state and graphics pipeline bound to avoid redundant binds and renderpass begins/ends. | ||||
* | | | Merge pull request #3182 from ReinUsesLisp/renderer-opengl | bunnei | 2019-12-16 | 2 | -131/+115 |
|\ \ \ | |_|/ |/| | | renderer_opengl: Miscellaneous clean ups | ||||
| * | | renderer_opengl: Make ScreenRectVertex's constructor constexpr | ReinUsesLisp | 2019-11-29 | 1 | -12/+7 |
| | | | |||||
| * | | renderer_opengl: Remove C casts | ReinUsesLisp | 2019-11-29 | 1 | -4/+5 |
| | | | |||||
| * | | renderer_opengl: Use explicit binding for presentation shaders | ReinUsesLisp | 2019-11-29 | 2 | -34/+20 |
| | | | |||||
| * | | renderer_opengl: Drop macros for message decorations | ReinUsesLisp | 2019-11-29 | 1 | -21/+26 |
| | | | |||||
| * | | renderer_opengl: Move static definitions to anonymous namespace | ReinUsesLisp | 2019-11-29 | 1 | -62/+66 |
| | | | |||||
| * | | renderer_opengl: Move commentaries to header file | ReinUsesLisp | 2019-11-29 | 2 | -20/+13 |
| | | | |||||
* | | | Merge pull request #3219 from FernandoS27/fix-bindless | Rodrigo Locatti | 2019-12-16 | 4 | -47/+124 |
|\ \ \ | | | | | | | | | Corrections and fixes to TLD4S & bindless samplers failing | ||||
| * | | | Shader_IR: Correct TLD4S Depth Compare. | Fernando Sahmkow | 2019-12-12 | 2 | -9/+16 |
| | | | | |||||
| * | | | Shader_Ir: Correct TLD4S encoding and implement f16 flag. | Fernando Sahmkow | 2019-12-12 | 3 | -11/+15 |
| | | | | |||||
| * | | | Gl_Shader_compiler: Correct Depth Compare for Texture Gather operations. | Fernando Sahmkow | 2019-12-12 | 1 | -8/+21 |
| | | | | |||||
| * | | | Shader_Ir: default failed tracks on bindless samplers to null values. | Fernando Sahmkow | 2019-12-12 | 2 | -24/+77 |
| | | | | |||||
* | | | | Merge pull request #3222 from ReinUsesLisp/maxwell-to-vk | bunnei | 2019-12-15 | 3 | -111/+243 |
|\ \ \ \ | | | | | | | | | | | maxwell_to_vk: Use VK_EXT_index_type_uint8 and misc changes | ||||
| * | | | | maxwell_to_vk: Improve image format table and add more formats | ReinUsesLisp | 2019-12-13 | 2 | -89/+127 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | A1B5G5R5 uses A1R5G5B5. This is flipped with image view swizzles; flushing is still not properly implemented on Vulkan for this particular format. | ||||
| * | | | | maxwell_to_vk: Implement more vertex formats | ReinUsesLisp | 2019-12-13 | 1 | -7/+81 |
| | | | | | |||||
| * | | | | maxwell_to_vk: Implement more primitive topologies | ReinUsesLisp | 2019-12-13 | 2 | -2/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add an extra argument to query device capabilities in the future. The intention behind this is to use native quads, quad strips, line loops and polygons if these are released for Vulkan. | ||||
| * | | | | maxwell_to_vk: Approach GL_CLAMP closer to the GL spec | ReinUsesLisp | 2019-12-13 | 3 | -9/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The OpenGL spec defines GL_CLAMP's formula similarly to CLAMP_TO_EDGE and CLAMP_TO_BORDER depending on the filter mode used. It doesn't exactly behave like this, but it's the closest we can get with what Vulkan offers without emulating it by injecting shader code. | ||||
| * | | | | maxwell_to_vk: Use VK_EXT_index_type_uint8 when available | ReinUsesLisp | 2019-12-13 | 2 | -4/+7 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3213 from ReinUsesLisp/intel-mesa | bunnei | 2019-12-14 | 1 | -1/+4 |
|\ \ \ \ | | | | | | | | | | | gl_device: Enable compute shaders for Intel Mesa drivers | ||||
| * | | | | gl_device: Enable compute shaders for Intel Mesa drivers | ReinUsesLisp | 2019-12-11 | 1 | -1/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously we naively checked for "Intel" in GL_VENDOR, but this includes both Intel's proprietary driver and the mesa driver. Re-enable compute shaders for mesa. | ||||
* | | | | | Merge pull request #3212 from ReinUsesLisp/fix-smem-lmem | bunnei | 2019-12-14 | 1 | -2/+2 |
|\ \ \ \ \ | |_|/ / / |/| | | | | gl_shader_cache: Add missing new-line on emitted GLSL | ||||
| * | | | | gl_shader_cache: Add missing new-line on emitted GLSL | ReinUsesLisp | 2019-12-11 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add missing new-line. This caused shaders using local memory and shared memory to inject a preprocessor GLSL line after an expression (resulting in invalid code). It looked like this: shared uint smem[8];#define LOCAL_MEMORY_SIZE 16 It should look like this (addressed by this commit): shared uint smem[8]; \#define LOCAL_MEMORY_SIZE 16 | ||||
* | | | | | Merge pull request #3217 from jhol/fix-boost-include | bunnei | 2019-12-12 | 1 | -0/+1 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | Added missing include | ||||
| * | | | | Added missing include | Joel Holdsworth | 2019-12-11 | 1 | -0/+1 |
| | |/ / | |/| | | |||||
* | | | | Gl_Rasterizer: Skip Tesselation Control and Eval stages as they are un implemented. | Fernando Sahmkow | 2019-12-11 | 1 | -0/+8 |
| | | | | | | | | | | | | | | | | | | | | This commit ensures the OGL backend does not execute tesselation shader stages as they are currently unimplemented. | ||||
* | | | | Merge pull request #3210 from ReinUsesLisp/memory-barrier | bunnei | 2019-12-11 | 5 | -1/+46 |
|\ \ \ \ | |/ / / |/| | | | shader: Implement MEMBAR.GL | ||||
| * | | | shader: Implement MEMBAR.GL | ReinUsesLisp | 2019-12-10 | 5 | -1/+46 |
| |/ / | | | | | | | | | | Implement using memoryBarrier in GLSL and OpMemoryBarrier on SPIR-V. | ||||
* / / | Maxwell3D: Implement Depth Mode. | Fernando Sahmkow | 2019-12-11 | 4 | -8/+15 |
|/ / | | | | | | | | | This commit finishes adding depth mode that was reverted before due to other unresolved issues. | ||||
* | | vk_shader_decompiler: Fix build issues on old gcc versions | ReinUsesLisp | 2019-12-10 | 1 | -2/+3 |
| | | |||||
* | | vk_shader_decompiler: Reduce YNegate's severity | ReinUsesLisp | 2019-12-10 | 1 | -1/+1 |
| | | |||||
* | | shader_ir/other: Implement S2R InvocationId | ReinUsesLisp | 2019-12-10 | 4 | -0/+9 |
| | | |||||
* | | vk_shader_decompiler: Misc changes | ReinUsesLisp | 2019-12-10 | 2 | -697/+1648 |
| | | | | | | | | | | | | | | | | Update Sirit and its usage in vk_shader_decompiler. Highlights: - Implement tessellation shaders - Implement geometry shaders - Implement some missing features - Use native half float instructions when available. | ||||
* | | shader: Keep track of shaders using warp instructions | ReinUsesLisp | 2019-12-10 | 2 | -0/+8 |
| | | |||||
* | | shader_ir/memory: Implement patch stores | ReinUsesLisp | 2019-12-10 | 4 | -20/+38 |
| | | |||||
* | | vk_device: Misc changes | ReinUsesLisp | 2019-12-09 | 2 | -117/+276 |
| | | | | | | | | | | | | | | | | | | - Setup more features and requirements. - Improve logging for missing features. - Collect telemetry parameters. - Add queries for more image formats. - Query push constants limits. - Optionally enable some extensions. | ||||
* | | Merge pull request #3198 from ReinUsesLisp/tessellation-maxwell | bunnei | 2019-12-09 | 1 | -2/+37 |
|\ \ | | | | | | | maxwell_3d: Add tessellation state entries | ||||
| * | | maxwell_3d: Add tessellation tess level registers | ReinUsesLisp | 2019-12-07 | 1 | -1/+6 |
| | | | |||||
| * | | maxwell_3d: Add tessellation mode register | ReinUsesLisp | 2019-12-07 | 1 | -1/+28 |
| | | | |||||
| * | | maxwell_3d: Add patch vertices register | ReinUsesLisp | 2019-12-07 | 1 | -1/+4 |
| | | | |||||
* | | | externals: Update Vulkan-Headers | ReinUsesLisp | 2019-12-09 | 2 | -2/+14 |
| | | | |||||
* | | | vk_swapchain: Add support for swapping sRGB | ReinUsesLisp | 2019-12-07 | 2 | -24/+31 |
|/ / | | | | | | | | | We don't know until the game is running if it's using an sRGB color space or not. Add support for hot-swapping swapchain surface formats. | ||||
* | | shader_bytecode: Remove corrupted character | ReinUsesLisp | 2019-12-07 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #3109 from FernandoS27/new-instr | bunnei | 2019-12-07 | 6 | -8/+171 |
|\ \ | | | | | | | Implement FLO & TXD Instructions on GPU Shaders | ||||
| * | | Shader_IR: Address Feedback | Fernando Sahmkow | 2019-11-18 | 3 | -11/+9 |
| | | | |||||
| * | | Shader_IR: Implement TXD instruction. | Fernando Sahmkow | 2019-11-14 | 5 | -8/+120 |
| | | | |||||
| * | | Shader_IR: Implement FLO instruction. | Fernando Sahmkow | 2019-11-14 | 5 | -0/+35 |
| | | | |||||
| * | | Shader_Bytecode: Add encodings for FLO, SHF and TXD | Fernando Sahmkow | 2019-11-14 | 1 | -0/+18 |
| | | | |||||
* | | | Merge pull request #2987 from FernandoS27/texture-invalid | bunnei | 2019-12-02 | 3 | -32/+101 |
|\ \ \ | | | | | | | | | Texture_Cache: Redo invalid Surfaces handling. | ||||
| * | | | Texture_Cache: Redo invalid Surfaces handling. | Fernando Sahmkow | 2019-11-20 | 3 | -32/+101 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit aims to redo the full setup of invalid textures and guarantee correct behavior across backends in the case of finding one by using black dummy textures that match the target of the expected texture. | ||||
* | | | | Merge pull request #3184 from ReinUsesLisp/framebuffer-cache | bunnei | 2019-12-01 | 4 | -72/+69 |
|\ \ \ \ | | | | | | | | | | | gl_framebuffer_cache: Optimize framebuffer cache management | ||||
| * | | | | gl_framebuffer_cache: Optimize framebuffer key | ReinUsesLisp | 2019-11-29 | 3 | -46/+60 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Pack color attachment enumerations into a single u32. To determine the number of buffers, the highest color attachment with a shared pointer that doesn't point to null is used. | ||||
| * | | | | gl_rasterizer: Re-enable framebuffer cache for clear buffers | ReinUsesLisp | 2019-11-29 | 3 | -32/+15 |
| | |_|/ | |/| | | |||||
* / | | | texture_cache/surface_base: Fix out of bounds texture views | ReinUsesLisp | 2019-11-29 | 1 | -7/+4 |
|/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some texture views were being created out of bounds (with more layers or mipmaps than what the original texture has). This is because of a miscalculation in mipmap bounding. end_layer and end_mipmap are out of bounds (e.g. layer 6 in a cubemap), there's no need to add one more there. Fixes OpenGL errors and Vulkan crashes on Splatoon 2. | ||||
* | | | Merge pull request #3169 from lioncash/memory | bunnei | 2019-11-28 | 7 | -16/+31 |
|\ \ \ | | | | | | | | | core/memory: Deglobalize memory management code | ||||
| * | | | core/memory: Migrate over RasterizerMarkRegionCached() to the Memory class | Lioncash | 2019-11-27 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | This is only used within the accelerated rasterizer in two places, so this is also a very trivial migration. | ||||
| * | | | core/memory: Migrate over GetPointer() | Lioncash | 2019-11-27 | 4 | -7/+8 |
| | | | | | | | | | | | | | | | | | | | | With all of the interfaces ready for migration, it's trivial to migrate over GetPointer(). | ||||
| * | | | core: Prepare various classes for memory read/write migration | Lioncash | 2019-11-27 | 5 | -7/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Amends a few interfaces to be able to handle the migration over to the new Memory class by passing the class by reference as a function parameter where necessary. Notably, within the filesystem services, this eliminates two ReadBlock() calls by using the helper functions of HLERequestContext to do that for us. | ||||
* | | | | Merge pull request #3174 from lioncash/optional | Rodrigo Locatti | 2019-11-28 | 1 | -2/+1 |
|\ \ \ \ | | | | | | | | | | | video_core/gpu_thread: Tidy up SwapBuffers() | ||||
| * | | | | video_core/gpu_thread: Tidy up SwapBuffers() | Lioncash | 2019-11-27 | 1 | -2/+1 |
| |/ / / | | | | | | | | | | | | | | | | | We can just use std::nullopt and std::make_optional to make this a little bit less noisy. | ||||
* | | | | video_core/const_buffer_locker: Make use of std::tie in HasEqualKeys() | Lioncash | 2019-11-27 | 1 | -2/+3 |
| | | | | | | | | | | | | | | | | Tidies it up a little bit visually. | ||||
* | | | | video_core/const_buffer_locker: Remove unused includes | Lioncash | 2019-11-27 | 2 | -2/+2 |
| | | | | |||||
* | | | | video_core/const_buffer_locker: Remove #pragma once from cpp file | Lioncash | 2019-11-27 | 1 | -2/+0 |
|/ / / | | | | | | | | | | Silences a compiler warning. | ||||
* | | | Merge pull request #3143 from ReinUsesLisp/indexing-bug | bunnei | 2019-11-27 | 2 | -48/+2 |
|\ \ \ | | | | | | | | | gl_device: Deduce indexing bug from device instead of heuristic | ||||
| * | | | gl_device: Deduce indexing bug from device instead of heuristic | ReinUsesLisp | 2019-11-25 | 2 | -48/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | The heuristic to detect AMD's driver was not working properly since it also included Intel. Instead of using heuristics to detect it, compare the GL_VENDOR string. | ||||
* | | | | gl_shader_decompiler: Fix casts from fp32 to f16 | ReinUsesLisp | 2019-11-26 | 1 | -1/+2 |
|/ / / | | | | | | | | | | Casts from f32 to f16 zeroes the higher half of the target register. | ||||
* | | | Merge pull request #3158 from ReinUsesLisp/srgb-blit | bunnei | 2019-11-25 | 1 | -0/+1 |
|\ \ \ | | | | | | | | | gl_texture_cache: Apply sRGB on blits | ||||
| * | | | gl_texture_cache: Apply sRGB on blits | ReinUsesLisp | 2019-11-24 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | glBlitFramebuffer keeps in mind GL_FRAMEBUFFER_SRGB's state. Enable this depending on the target surface pixel format. | ||||
* | | | | Merge pull request #3155 from bunnei/fix-asynch-gpu-wait | bunnei | 2019-11-25 | 1 | -17/+15 |
|\ \ \ \ | | | | | | | | | | | gpu_thread: Don't spin wait if there are no GPU commands. | ||||
| * | | | | gpu_thread: Don't spin wait if there are no GPU commands. | bunnei | 2019-11-23 | 1 | -17/+15 |
| |/ / / | |||||
* | | | | Merge pull request #3098 from ReinUsesLisp/shader-invalidations | bunnei | 2019-11-25 | 31 | -744/+742 |
|\ \ \ \ | | | | | | | | | | | gl_shader_cache: Miscellaneous changes to shaders | ||||
| * | | | | gl_device: Reserve base bindings on limited devices | ReinUsesLisp | 2019-11-23 | 1 | -36/+76 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | SSBOs and other resources are limited per pipeline on Intel and AMD. Heuristically reserve resources per stage having in mind the reported OpenGL limits. | ||||
| * | | | | gl_state: Skip null texture binds | ReinUsesLisp | 2019-11-23 | 1 | -1/+5 |
| | | | | | | | | | | | | | | | | | | | | glBindTextureUnit doesn't support null textures. Skip binding these. | ||||
| * | | | | gl_rasterizer: Disable compute shaders on Intel | ReinUsesLisp | 2019-11-23 | 3 | -0/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Intel's proprietary driver enters in a corrupt state when compute shaders are executed. For now, disable these. | ||||
| * | | | | gl_shader_cache: Hack shared memory size | ReinUsesLisp | 2019-11-23 | 1 | -2/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The current shared memory size seems to be smaller than what the game actually uses. This makes Nvidia's driver consistently blow up; in the case of FE3H it made it explode on Qt's SwapBuffers while SDL2 worked just fine. For now keep this hack since it's still progress over the previous hardcoded shared memory size. | ||||
| * | | | | gl_shader_decompiler: Normalize image bindings | ReinUsesLisp | 2019-11-23 | 3 | -33/+19 |
| | | | | | |||||
| * | | | | gl_shader_decompiler: Normalize cbuf bindings | ReinUsesLisp | 2019-11-23 | 2 | -10/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Stage and compute shaders were using a different binding counter. Normalize these. | ||||
| * | | | | gl_rasterizer: Add missing cbuf counter reset on compute | ReinUsesLisp | 2019-11-23 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | gl_shader_cache: Remove dynamic BaseBinding specialization | ReinUsesLisp | 2019-11-23 | 16 | -192/+200 |
| | | | | | |||||
| * | | | | video_core: Unify ProgramType and ShaderStage into ShaderType | ReinUsesLisp | 2019-11-23 | 22 | -289/+262 |
| | | | | | |||||
| * | | | | gl_rasterizer: Bind graphics images to draw commands | ReinUsesLisp | 2019-11-23 | 4 | -33/+54 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Images were not being bound to draw invocations because these would require a cache invalidation. | ||||
| * | | | | gl_shader_cache: Specialize local memory size for compute shaders | ReinUsesLisp | 2019-11-23 | 6 | -21/+32 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Local memory size in compute shaders was stubbed with an arbitary size. This commit specializes local memory size from guest GPU parameters. | ||||
| * | | | | gl_shader_cache: Specialize shared memory size | ReinUsesLisp | 2019-11-23 | 5 | -29/+25 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Shared memory was being declared with an undefined size. Specialize from guest GPU parameters the compute shader's shared memory size. | ||||
| * | | | | gl_shader_cache: Specialize shader workgroup | ReinUsesLisp | 2019-11-23 | 6 | -68/+74 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Drop the usage of ARB_compute_variable_group_size and specialize compute shaders instead. This permits compute to run on AMD and Intel proprietary drivers. | ||||
| * | | | | shader/texture: Handle TLDS texture type mismatches | ReinUsesLisp | 2019-11-23 | 1 | -1/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some games like "Fire Emblem: Three Houses" bind 2D textures to offsets used by instructions of 1D textures. To handle the discrepancy this commit uses the the texture type from the binding and modifies the emitted code IR to build a valid backend expression. E.g.: Bound texture is 2D and instruction is 1D, the emitted IR samples a 2D texture in the coordinate ivec2(X, 0). | ||||
| * | | | | shader/texture: Deduce texture buffers from locker | ReinUsesLisp | 2019-11-23 | 9 | -174/+107 |
| | |/ / | |/| | | | | | | | | | | | | | | Instead of specializing shaders to separate texture buffers from 1D textures, use the locker to deduce them while they are being decoded. | ||||
* | | | | Merge pull request #3105 from ReinUsesLisp/fix-stencil-reg | bunnei | 2019-11-24 | 1 | -3/+3 |
|\ \ \ \ | | | | | | | | | | | maxwell_3d: Fix stencil_back_func_mask offset | ||||
| * | | | | maxwell_3d: Fix stencil_back_func_mask offset | ReinUsesLisp | 2019-11-13 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | stencil_back_func_mask and stencil_back_mask were misplaced. This commit addresses that issue. | ||||
* | | | | | Merge pull request #3145 from ReinUsesLisp/buffer-cache-init | bunnei | 2019-11-24 | 1 | -10/+10 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | buffer_cache: Remove brace initialized for objects with default constructor | ||||
| * | | | | buffer_cache: Remove brace initialized for objects with default constructor | ReinUsesLisp | 2019-11-20 | 1 | -10/+10 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #3141 from ReinUsesLisp/gl-position | bunnei | 2019-11-23 | 1 | -0/+1 |
|\ \ \ \ | |/ / / |/| | | | gl_shader_gen: Apply default value to gl_Position | ||||
| * | | | gl_shader_gen: Apply default value to gl_Position | ReinUsesLisp | 2019-11-20 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Nvidia has sane default output values for varyings, but the other vendors don't apply these. To properly emulate this we would have to analyze the shader header. For the time being, apply the same default Nvidia applies so we get the same behaviour on non-Nvidia drivers. | ||||
* | | | | shader/other: Reduce DEPBAR log severity | ReinUsesLisp | 2019-11-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | While DEPBAR is stubbed it doesn't change anything from our end. Shading languages handle what this instruction does implicitly. We are not getting anything out fo this log except noise. | ||||
* | | | | Merge pull request #3086 from ReinUsesLisp/format-lookups | bunnei | 2019-11-20 | 12 | -555/+442 |
|\ \ \ \ | |/ / / |/| | | | texture_cache: Use a flat table instead of switch for texture format lookups | ||||
| * | | | format_lookup_table: Address feedback | ReinUsesLisp | 2019-11-15 | 2 | -30/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | format_lookup_table: Drop bitfields format_lookup_table: Use std::array for definition table format_lookup_table: Include <limits> instead of <numeric> | ||||
| * | | | texture_cache: Use a table instead of switch for texture formats | ReinUsesLisp | 2019-11-15 | 9 | -261/+290 |
| | | | | | | | | | | | | | | | | | | | | | | | | Use a large flat array to look up texture formats. This allows us to properly implement formats with different component types. It should also be faster. | ||||
| * | | | texture_cache: Drop abstracted ComponentType | ReinUsesLisp | 2019-11-14 | 8 | -294/+158 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Abstracted ComponentType was not being used in a meaningful way. This commit drops its usage. There is one place where it was being used to test compatibility between two cached surfaces, but this one is implied in the pixel format. Removing the component type test doesn't change the behaviour. | ||||
* | | | | Merge pull request #3047 from ReinUsesLisp/clip-control | bunnei | 2019-11-15 | 7 | -79/+42 |
|\ \ \ \ | |/ / / |/| | | | gl_rasterizer: Emulate viewport flipping with ARB_clip_control | ||||
| * | | | gl_rasterizer: Remove front facing hack | ReinUsesLisp | 2019-11-07 | 1 | -12/+0 |
| | | | | |||||
| * | | | gl_shader_decompiler: Fix typo "y_negate"->"y_direction" | ReinUsesLisp | 2019-11-07 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_shader_manager: Remove unused variable in SetFromRegs | ReinUsesLisp | 2019-11-07 | 1 | -1/+0 |
| | | | | |||||
| * | | | gl_rasterizer: Emulate viewport flipping with ARB_clip_control | ReinUsesLisp | 2019-11-07 | 7 | -76/+52 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Emulates negative y viewports with ARB_clip_control. This allows us to more easily emulated pipelines with tessellation and/or geometry shader stages. It also avoids corrupting games with transform feedbacks and negative viewports (gl_Position.y was being modified). | ||||
* | | | | correct the implementation of RGBA16UI | greggameplayer | 2019-11-14 | 1 | -0/+2 |
| |_|/ |/| | | |||||
* | | | Merge pull request #3081 from ReinUsesLisp/fswzadd-shuffles | Fernando Sahmkow | 2019-11-14 | 8 | -125/+127 |
|\ \ \ | |_|/ |/| | | shader: Implement FSWZADD and reimplement SHFL | ||||
| * | | gl_shader_cache: Enable extensions only when available | ReinUsesLisp | 2019-11-08 | 1 | -6/+14 |
| | | | | | | | | | | | | Silence GLSL compilation warnings. | ||||
| * | | gl_shader_decompiler: Add safe fallbacks when ARB_shader_ballot is not available | ReinUsesLisp | 2019-11-08 | 3 | -5/+28 |
| | | | |||||
| * | | shader_ir/warp: Implement FSWZADD | ReinUsesLisp | 2019-11-08 | 5 | -0/+44 |
| | | | |||||
| * | | gl_shader_decompiler: Reimplement shuffles with platform agnostic intrinsics | ReinUsesLisp | 2019-11-08 | 5 | -122/+49 |
| | | | |||||
* | | | Merge pull request #3084 from ReinUsesLisp/cast-warnings | Rodrigo Locatti | 2019-11-13 | 10 | -53/+68 |
|\ \ \ | | | | | | | | | video_core: Treat implicit conversions as errors | ||||
| * | | | video_core: Enable sign conversion warnings | Rodrigo Locatti | 2019-11-11 | 1 | -1/+1 |
| | | | | | | | | | | | | Enable sign conversion warnings but don't treat them as errors. | ||||
| * | | | video_core: Treat implicit conversions as errors | ReinUsesLisp | 2019-11-08 | 1 | -0/+6 |
| | | | | |||||
| * | | | video_core: Silence implicit conversion warnings | ReinUsesLisp | 2019-11-08 | 9 | -53/+62 |
| | |/ | |/| | |||||
* | | | Merge pull request #3082 from ReinUsesLisp/fix-lockers | bunnei | 2019-11-09 | 1 | -2/+4 |
|\ \ \ | | | | | | | | | gl_shader_cache: Fix locker constructors | ||||
| * | | | gl_shader_cache: Fix locker constructors | ReinUsesLisp | 2019-11-08 | 1 | -2/+4 |
| | |/ | |/| | | | | | | | Properly pass engine when a shader is being constructed from memory. | ||||
* | | | Merge pull request #3080 from FernandoS27/glsl-fix | bunnei | 2019-11-08 | 1 | -1/+1 |
|\ \ \ | |/ / |/| | | GLSLDecompiler: Correct Texture Gather Offset. | ||||
| * | | GLSLDecompiler: Correct Texture Gather Offset. | Fernando Sahmkow | 2019-11-07 | 1 | -1/+1 |
| | | | | | | | | | | | | This commit corrects the argument ordering in textureGatherOffset. | ||||
* | | | Merge pull request #3032 from ReinUsesLisp/simplify-control-flow-brx | bunnei | 2019-11-07 | 1 | -103/+111 |
|\ \ \ | | | | | | | | | shader/control_flow: Abstract repeated code chunks in BRX tracking | ||||
| * | | | shader/control_flow: Specify constness on caller lambdas | Rodrigo Locatti | 2019-11-07 | 1 | -11/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> Update src/video_core/shader/control_flow.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
| * | | | shader/control_flow: Use callable template instead of std::function | ReinUsesLisp | 2019-11-07 | 1 | -6/+5 |
| | | | | |||||
| * | | | shader/control_flow: Abstract repeated code chunks in BRX tracking | ReinUsesLisp | 2019-11-07 | 1 | -93/+101 |
| | | | | | | | | | | | | | | | | Remove copied and pasted for cycles into a common templated function. | ||||
| * | | | shader/control_flow: Silence Intellisense cast warnings | ReinUsesLisp | 2019-11-07 | 1 | -1/+1 |
| | | | | |||||
| * | | | shader/control_flow: Remove brace initializer in std containers | ReinUsesLisp | 2019-11-07 | 1 | -9/+9 |
| | | | | | | | | | | | | | | | | These containers have a default constructor. | ||||
* | | | | buffer_cache: Add missing includes (#3079) | Morph | 2019-11-07 | 1 | -0/+4 |
| | | | | | | | | | | | | | | | | `boost::make_iterator_range` is available when `boost/range/iterator_range.hpp` is included. Also include `boost/icl/interval_map.hpp` and `boost/icl/interval_set.hpp`. | ||||
* | | | | Merge pull request #3070 from ReinUsesLisp/shader-warnings | bunnei | 2019-11-07 | 7 | -51/+19 |
|\ \ \ \ | |_|_|/ |/| | | | shader_ir: Reduce severity of warnings | ||||
| * | | | shader/decode: Reduce severity of arithmetic rounding warnings | ReinUsesLisp | 2019-11-07 | 6 | -15/+17 |
| | | | | |||||
| * | | | shader/arithmetic: Reduce RRO stub severity | ReinUsesLisp | 2019-11-07 | 1 | -1/+2 |
| | | | | |||||
| * | | | shader/texture: Remove NODEP warnings | ReinUsesLisp | 2019-11-07 | 1 | -35/+0 |
| |/ / | | | | | | | | | | | | | These warnings don't offer meaningful information while decoding shaders. Remove them. | ||||
* | | | Merge pull request #3057 from ReinUsesLisp/buffer-sub-data | bunnei | 2019-11-06 | 6 | -11/+70 |
|\ \ \ | |/ / |/| | | gl_rasterizer: Upload constant buffers with glNamedBufferSubData | ||||
| * | | gl_rasterizer: Re-enable stream buffer memory due to global memory | ReinUsesLisp | 2019-11-02 | 1 | -14/+8 |
| | | | | | | | | | | | | | | | Global memory is still using the stream buffer when it shouldn't. As a temporary fix re-enable the stream buffer on compute. | ||||
| * | | gl_rasterizer: Upload constant buffers with glNamedBufferSubData | ReinUsesLisp | 2019-11-02 | 6 | -19/+84 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Nvidia's OpenGL driver maps gl(Named)BufferSubData with some requirements to a fast. This path has an extra memcpy but updates the buffer without orphaning or waiting for previous calls. It can be seen as a better model for "push constants" that can upload a whole UBO instead of 256 bytes. This path has some requirements established here: http://on-demand.gputechconf.com/gtc/2014/presentations/S4379-opengl-44-scene-rendering-techniques.pdf#page=24 Instead of using the stream buffer, this commits moves constant buffers uploads to calls of glNamedBufferSubData and from my testing it brings a performance improvement. This is disabled when the vendor is not Nvidia since it brings performance regressions. | ||||
* | | | Merge pull request #3039 from ReinUsesLisp/cleanup-samplers | Rodrigo Locatti | 2019-11-06 | 8 | -142/+116 |
|\ \ \ | | | | | | | | | shader/node: Unpack bindless texture encoding | ||||
| * | | | shader/node: Unpack bindless texture encoding | ReinUsesLisp | 2019-10-30 | 8 | -142/+116 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Bindless textures were using u64 to pack the buffer and offset from where they come from. Drop this in favor of separated entries in the struct. Remove the usage of std::set in favor of std::list (it's not std::vector to avoid reference invalidations) for samplers and images. | ||||
* | | | | Merge pull request #2914 from FernandoS27/fermi-fix | bunnei | 2019-11-06 | 1 | -3/+27 |
|\ \ \ \ | | | | | | | | | | | Fermi2D: limit blit area to only available area | ||||
| * | | | | Fermi2D: Use a different formula for delimiting blit areas. | Fernando Sahmkow | 2019-10-18 | 1 | -14/+28 |
| | | | | | |||||
| * | | | | Fermi2D: limit blit area to only available area | Fernando Sahmkow | 2019-10-17 | 1 | -4/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Normaly OpenGL does not care if the areas exceed the texture regions but other backends such as Vulkan do care about the limits of this areas. This PR crops the areas of the blit in order that they don't surpass the limits of the textures. This should help Vulkan and faulty OpenGL drivers | ||||
* | | | | | common_func: Use std::array for INSERT_PADDING_* macros. | bunnei | 2019-11-04 | 7 | -108/+110 |
| |_|/ / |/| | | | | | | | | | | | - Zero initialization here is useful for determinism. | ||||
* | | | | Shader_IR: Fix regression on TLD4 | Fernando Sahmkow | 2019-10-31 | 2 | -5/+4 |
| |_|/ |/| | | | | | | | | | | | | | | Originally on the last commit I thought TLD4 acted the same as TLD4S and didn't have a mask. It actually does have a component mask. This commit corrects that. | ||||
* | | | Merge pull request #3050 from FernandoS27/fix-tld4 | Rodrigo Locatti | 2019-10-30 | 3 | -11/+55 |
|\ \ \ | | | | | | | | | shader_ir: Fix TLD4 and add bindless variant | ||||
| * | | | Shader_IR: Fix TLD4 and add Bindless Variant. | Fernando Sahmkow | 2019-10-30 | 3 | -11/+55 |
| | | | | | | | | | | | | | | | | | | | | | | | | This commit fixes an issue where not all 4 results of tld4 were being written, the color component was defaulted to red, among other things. It also implements the bindless variant. | ||||
* | | | | Merge pull request #3046 from ReinUsesLisp/clean-gl-state | bunnei | 2019-10-30 | 3 | -291/+156 |
|\ \ \ \ | | | | | | | | | | | gl_state: Miscellaneous clean up | ||||
| * | | | | gl_state: Use std::array::fill instead of std::fill | Rodrigo Locatti | 2019-10-30 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | Co-Authored-By: Mat M. <mathew1800@gmail.com> | ||||
| * | | | | gl_state: Move dirty checks to individual apply calls instead of Apply | ReinUsesLisp | 2019-10-30 | 2 | -66/+74 |
| | | | | | | | | | | | | | | | | | | | | | | | | | This requires removing constness from some methods, but for consistency it's removed in all methods. | ||||
| * | | | | gl_state: Remove ApplyDefaultState | ReinUsesLisp | 2019-10-30 | 3 | -17/+1 |
| | | | | | | | | | | | | | | | | | | | | OpenGL has defaults values we can trust. Remove these. | ||||
| * | | | | gl_state: Change SetDefaultViewports to use default constructor | ReinUsesLisp | 2019-10-30 | 1 | -13/+2 |
| | | | | | |||||
| * | | | | gl_state: Minor style changes | ReinUsesLisp | 2019-10-30 | 1 | -3/+5 |
| | | | | | |||||
| * | | | | gl_state: Remove unused Citra TextureUnits | ReinUsesLisp | 2019-10-30 | 1 | -23/+0 |
| | | | | | |||||
| * | | | | gl_state: Move initializers from constructor to class declaration | ReinUsesLisp | 2019-10-30 | 2 | -170/+75 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #3035 from ReinUsesLisp/rasterizer-accelerated | bunnei | 2019-10-30 | 5 | -45/+98 |
|\ \ \ \ | |/ / / |/| | | | rasterizer_accelerated: Add intermediary for GPU rasterizers | ||||
| * | | | rasterizer_accelerated: Add intermediary for GPU rasterizers | ReinUsesLisp | 2019-10-27 | 5 | -45/+98 |
| | | | | | | | | | | | | | | | | | | | | | | | | Add an intermediary class that implements common functions across GPU accelerated rasterizers. This avoids code repetition on different backends. | ||||
* | | | | Merge pull request #3004 from ReinUsesLisp/maxwell3d-cleanup | Rodrigo Locatti | 2019-10-30 | 6 | -81/+20 |
|\ \ \ \ | |_|/ / |/| | | | maxwell_3d: Remove unused entries | ||||
| * | | | maxwell_3d/kepler_compute: Remove unused arguments in GetTexture | ReinUsesLisp | 2019-10-28 | 5 | -37/+20 |
| | | | | |||||
| * | | | video_core/textures: Remove unused index entry in FullTextureInfo | ReinUsesLisp | 2019-10-28 | 2 | -2/+0 |
| | | | | |||||
| * | | | maxwell_3d: Remove unused method GetStageTextures | ReinUsesLisp | 2019-10-28 | 2 | -42/+0 |
| | | | | |||||
* | | | | Merge pull request #3037 from FernandoS27/new-formats | Rodrigo Locatti | 2019-10-28 | 4 | -5/+22 |
|\ \ \ \ | |/ / / |/| | | | video_core: Implement texture format E5B9G9R9_SHAREDEXP. | ||||
| * | | | Video_Core: Implement texture format E5B9G9R9_SHAREDEXP. | Fernando Sahmkow | 2019-10-27 | 4 | -5/+22 |
| |/ / | | | | | | | | | | | | | This commit implements the E5B9G9R9 Texture format into the general system and OpenGL backend. | ||||
* | | | Merge pull request #3034 from ReinUsesLisp/w4244-maxwell3d | bunnei | 2019-10-27 | 2 | -24/+25 |
|\ \ \ | | | | | | | | | maxwell_3d: Silence implicit conversion warnings | ||||
| * | | | maxwell_3d: Silence implicit conversion warnings | ReinUsesLisp | 2019-10-27 | 2 | -24/+25 |
| |/ / | | | | | | | | | | While we are at it, unify types for dirty reg pointers. | ||||
* / / | astc: Silence implicit conversion warnings | ReinUsesLisp | 2019-10-27 | 1 | -7/+8 |
|/ / | |||||
* | | Merge pull request #2976 from FernandoS27/cache-fast-brx-rebased | Rodrigo Locatti | 2019-10-26 | 26 | -862/+1467 |
|\ \ | | | | | | | Implement Fast BRX, fix TXQ and addapt the Shader Cache for it | ||||
| * | | Shader_IR: Address Feedback. | Fernando Sahmkow | 2019-10-26 | 8 | -55/+65 |
| | | | |||||
| * | | Shader_IR: Clang format | Fernando Sahmkow | 2019-10-25 | 1 | -2/+1 |
| | | | |||||
| * | | gl_shader_cache: Implement locker variants invalidation | ReinUsesLisp | 2019-10-25 | 4 | -44/+104 |
| | | | |||||
| * | | gl_shader_disk_cache: Store and load fast BRX | ReinUsesLisp | 2019-10-25 | 6 | -50/+210 |
| | | | |||||
| * | | const_buffer_locker: Minor style changes | ReinUsesLisp | 2019-10-25 | 2 | -152/+76 |
| | | | |||||
| * | | gl_shader_decompiler: Move entries to a separate function | ReinUsesLisp | 2019-10-25 | 15 | -722/+420 |
| | | | |||||
| * | | Shader_IR: Implement Fast BRX and allow multi-branches in the CFG. | Fernando Sahmkow | 2019-10-25 | 1 | -1/+1 |
| | | | |||||
| * | | Shader_IR: Correct typo in Consistent method. | Fernando Sahmkow | 2019-10-25 | 2 | -2/+2 |
| | | | |||||
| * | | Shader_IR: allow lookup of texture samplers within the shader_ir for instructions that don't provide it | Fernando Sahmkow | 2019-10-25 | 9 | -46/+363 |
| | | | |||||
| * | | Shader_IR: Implement Fast BRX and allow multi-branches in the CFG. | Fernando Sahmkow | 2019-10-25 | 7 | -130/+258 |
| | | | |||||
| * | | Shader_Cache: setup connection of ConstBufferLocker | Fernando Sahmkow | 2019-10-25 | 10 | -43/+82 |
| | | | |||||
| * | | VideoCore: Unify const buffer accessing along engines and provide ConstBufferLocker class to shaders. | Fernando Sahmkow | 2019-10-25 | 10 | -11/+168 |
| | | | |||||
| * | | Shader_IR: Implement BRX tracking. | Fernando Sahmkow | 2019-10-25 | 1 | -0/+113 |
| | | | |||||
* | | | Merge pull request #3027 from lioncash/lookup | Rodrigo Locatti | 2019-10-26 | 1 | -53/+67 |
|\ \ \ | | | | | | | | | shader_ir: Use std::array with std::pair instead of std::unordered_map | ||||
| * | | | shader_ir: Use std::array with pair instead of unordered_map | Lioncash | 2019-10-24 | 1 | -53/+67 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given the overall size of the maps are very small, we can use arrays of pairs here instead of always heap allocating a new map every time the functions are called. Given the small size of the maps, the difference in container lookups are negligible, especially given the entries are already sorted. | ||||
* | | | | Merge pull request #3013 from FernandoS27/tld4s-fix | Rodrigo Locatti | 2019-10-26 | 2 | -5/+5 |
|\ \ \ \ | |_|/ / |/| | | | Shader_Ir: Fix TLD4S from using a component mask. | ||||
| * | | | Shader_Ir: Fix TLD4S from using a component mask. | Fernando Sahmkow | 2019-10-22 | 2 | -5/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | TLD4S always outputs 4 values, the previous code checked a component mask and omitted those values that weren't part of it. This commit corrects that and makes sure all 4 values are set. | ||||
* | | | | Merge pull request #3028 from lioncash/constexpr | Rodrigo Locatti | 2019-10-24 | 1 | -13/+13 |
|\ \ \ \ | | | | | | | | | | | shader_bytecode: Make Matcher constexpr capable | ||||
| * | | | | shader_bytecode: Make Matcher constexpr capable | Lioncash | 2019-10-24 | 1 | -13/+13 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | Greatly shrinks the amount of generated code for GetDecodeTable(). Collapses an assembly output of 9000+ lines down to ~3621 with Clang, and 6513 down to ~2616 with GCC, given it's now allowed to construct all the entries as a sequence of constant data. | ||||
* / | | | video_core/shader: Resolve instances of variable shadowing | Lioncash | 2019-10-24 | 6 | -11/+12 |
|/ / / | | | | | | | | | | Silences a few -Wshadow warnings. | ||||
* | | | Merge pull request #2995 from ReinUsesLisp/ignore-gmem | Fernando Sahmkow | 2019-10-22 | 2 | -18/+26 |
|\ \ \ | | | | | | | | | shader_ir/memory: Ignore global memory when tracking fails | ||||
| * | | | shader_ir/memory: Ignore global memory when tracking fails | ReinUsesLisp | 2019-10-22 | 2 | -18/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Ignore global memory operations instead of invoking undefined behaviour when constant buffer tracking fails and we are blasting through asserts, ignore the operation. In the case of LDG this means filling the destination registers with zeroes; for STG this means ignore the instruction as a whole. The default behaviour is still to abort execution on failure. | ||||
* | | | | Merge pull request #2983 from lioncash/fallthrough | Fernando Sahmkow | 2019-10-22 | 2 | -0/+6 |
|\ \ \ \ | |_|/ / |/| | | | gl_shader_decompiler/vk_shader_decompiler: Resolve implicit fallthrough cases | ||||
| * | | | vk_shader_decompiler: Resolve fallthrough within ExprDecompiler's ExprCondCode operator() | Lioncash | 2019-10-16 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | This would previously result in NeverExecute and UnusedIndex being treated as regular predicates. | ||||
| * | | | gl_shader_decompiler: Resolve fallthrough within ExprDecompiler's ExprCondCode operator() | Lioncash | 2019-10-16 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | This would previously result in NeverExecute and UnusedIndex being treated as regular predicates. | ||||
* | | | | maxwell_3d: Reduce FlushMMEInlineDraw logging to Trace | ReinUsesLisp | 2019-10-20 | 1 | -1/+1 |
| |/ / |/| | | |||||
* | | | Merge pull request #2994 from lioncash/fmt | Rodrigo Locatti | 2019-10-18 | 2 | -40/+51 |
|\ \ \ | | | | | | | | | video_core/shader/ast: Minor changes to ASTPrinter | ||||
| * | | | video_core/shader/ast: Make ShowCurrentState() and SanityCheck() const member functions | Lioncash | 2019-10-18 | 2 | -5/+5 |
| | | | | | | | | | | | | | | | | | | | | These can also trivially be made const member functions, with the addition of a few consts. | ||||
| * | | | video_core/shader/ast: Make ASTManager::Print a const member function | Lioncash | 2019-10-18 | 2 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | Given all visiting functions never modify the nodes, we can trivially make this a const member function. | ||||
| * | | | video_core/shader/ast: Make ExprPrinter members private | Lioncash | 2019-10-18 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | This member already has an accessor, so there's no need for it to be public. | ||||
| * | | | video_core/shader/ast: Make Indent() return a string_view | Lioncash | 2019-10-18 | 1 | -14/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The returned string is simply a substring of our constexpr tabs string_view, so we can just use a string_view here as well, since the original string_view is guaranteed to always exist. Now the function is fully non-allocating. | ||||
| * | | | video_core/shader/ast: Make Indent() private | Lioncash | 2019-10-18 | 1 | -9/+9 |
| | | | | | | | | | | | | | | | | It's never used outside of this class, so we can narrow its scope down. | ||||
| * | | | video_core/shader/ast: Rename Ident() to Indent() | Lioncash | 2019-10-18 | 1 | -13/+13 |
| | | | | | | | | | | | | | | | | | | | | This can be confusing, given "ident" is generally used as a shorthand for "identifier". | ||||
| * | | | video_core/shader/ast: Make use of fmt where applicable | Lioncash | 2019-10-18 | 1 | -14/+14 |
| | | | | | | | | | | | | | | | | | | | | Makes a few strings nicer to read and also eliminates a bit of string churn with operator+. | ||||
* | | | | Merge pull request #2993 from lioncash/vulkan-expr | Rodrigo Locatti | 2019-10-18 | 1 | -21/+23 |
|\ \ \ \ | | | | | | | | | | | vk_shader_decompiler: Mark operator() function parameters as const references | ||||
| * | | | | vk_shader_decompiler: Mark operator() function parameters as const references | Lioncash | 2019-10-18 | 1 | -21/+23 |
| |/ / / | | | | | | | | | | | | | | | | | These parameters aren't actually modified in any way, so they can be made const references. | ||||
* | | | | Merge pull request #2966 from FernandoS27/astc-formats | Rodrigo Locatti | 2019-10-18 | 4 | -79/+211 |
|\ \ \ \ | | | | | | | | | | | Implement a series of ASTC formats and R4G4B4A4 format | ||||
| * | | | | Surfaces: Implement R4G4B4A4U format. | Fernando Sahmkow | 2019-10-09 | 4 | -24/+41 |
| | | | | | |||||
| * | | | | Surfaces: Implement ASTC 6x6 10x10 12x12 8x6 6x5 | Fernando Sahmkow | 2019-10-09 | 4 | -70/+185 |
| | | | | | |||||
* | | | | | Merge pull request #2979 from lioncash/macro | Rodrigo Locatti | 2019-10-18 | 2 | -72/+79 |
|\ \ \ \ \ | |_|/ / / |/| | | | | video_core/macro_interpreter: Make definitions of most private enums/unions hidden | ||||
| * | | | | video_core/macro_interpreter: Make definitions of most private enums/unions hidden | Lioncash | 2019-10-17 | 2 | -72/+79 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | This allows the implementation of these types to change without requiring a rebuild of everything that includes the macro interpreter header. | ||||
* | | | | Merge pull request #2980 from lioncash/warn | bunnei | 2019-10-17 | 3 | -5/+6 |
|\ \ \ \ | |_|_|/ |/| | | | maxwell_3d: Silence truncation warnings | ||||
| * | | | control_flow: Silence truncation warnings | Lioncash | 2019-10-16 | 2 | -4/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | This can be trivially fixed by making the input size a size_t. CFGRebuildState's constructor parameter is already a std::size_t, so this just makes the size type fully conform with it. | ||||
| * | | | maxwell_3d: Silence truncation warnings | Lioncash | 2019-10-15 | 1 | -1/+2 |
| |/ / | | | | | | | | | | | | | A trivial warning caused by not using size_t as the argument types instead of u32. | ||||
* | | | Merge pull request #2978 from lioncash/doxygen | Rodrigo Locatti | 2019-10-17 | 1 | -57/+78 |
|\ \ \ | | | | | | | | | video_core/texture_cache: Amend Doxygen references | ||||
| * | | | video_core/texture_cache: Amend Doxygen references | Lioncash | 2019-10-15 | 1 | -57/+78 |
| | | | | | | | | | | | | | | | | | | | | | | | | Amends the doxygen comments so that they properly resolve. While we're at it, we can correct some typos and fix up some of the comments' formatting in order to make them slightly nicer to read. | ||||
* | | | | Merge pull request #2982 from lioncash/surface | Rodrigo Locatti | 2019-10-17 | 1 | -2/+2 |
|\ \ \ \ | | | | | | | | | | | texture_cache: Avoid unnecessary surface copies within PickStrategy() and TryReconstructSurface() | ||||
| * | | | | texture_cache: Avoid unnecessary surface copies within PickStrategy() and TryReconstructSurface() | Lioncash | 2019-10-16 | 1 | -2/+2 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | We can take these by const reference and avoid making unnecessary copies, preventing some atomic reference count increments and decrements. | ||||
* | | | | Merge pull request #2912 from FernandoS27/async-fixes | bunnei | 2019-10-16 | 10 | -24/+38 |
|\ \ \ \ | | | | | | | | | | | General fixes to Async GPU | ||||
| * | | | | AsyncGpu: Address Feedback | Fernando Sahmkow | 2019-10-11 | 2 | -2/+2 |
| | | | | | |||||
| * | | | | GL_Renderer: Remove lefting snippet. | Fernando Sahmkow | 2019-10-05 | 1 | -2/+0 |
| | | | | | |||||
| * | | | | Gl_Rasterizer: Protect CPU Memory mapping from multiple threads. | Fernando Sahmkow | 2019-10-05 | 2 | -0/+4 |
| | | | | | |||||
| * | | | | Core: Wait for GPU to be idle before shutting down. | Fernando Sahmkow | 2019-10-05 | 6 | -0/+17 |
| | | | | | |||||
| * | | | | Nvdrv: Do framelimiting only in the CPU Thread | Fernando Sahmkow | 2019-10-05 | 1 | -3/+0 |
| | | | | | |||||
| * | | | | GPU_Async: Correct fences, display events and more. | Fernando Sahmkow | 2019-10-05 | 4 | -19/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit uses guest fences on vSync event instead of an articial fake fence we had. It also corrects to keep signaling display events while loading the game as the OS is suppose to send buffers to vSync during that time. | ||||
* | | | | | Merge pull request #2984 from lioncash/fallthrough2 | Rodrigo Locatti | 2019-10-16 | 1 | -0/+1 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/surface: Add missing break in PixelFormatFromTextureFormat() | ||||
| * | | | | | video_core/surface: Add missing break in PixelFormatFromTextureFormat() | Lioncash | 2019-10-16 | 1 | -0/+1 |
| | |/ / / | |/| | | | | | | | | | | | | | Prevents fallthrough into the following case. | ||||
* | | | | | Merge pull request #2981 from lioncash/copy | Rodrigo Locatti | 2019-10-16 | 2 | -34/+32 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_shader_decompiler: Minor cleanup-related changes | ||||
| * | | | | | gl_shader_decompiler: Make ExprDecompiler's GetResult() a const member function | Lioncash | 2019-10-16 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is only ever used to read, but not write, the resulting string, so we can enforce this by making it a const member function. | ||||
| * | | | | | gl_shader_decompiler: Use a std::string_view with GetDeclarationWithSuffix() | Lioncash | 2019-10-16 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows the function to be completely non-allocating for inputs of all sizes (i.e. there's no heap cost for an input to convert to a std::string_view). | ||||
| * | | | | | gl_shader_decompiler: Fold flow_var constant into GetFlowVariable() | Lioncash | 2019-10-16 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is only ever used within this function, so we can narrow it's scope down. | ||||
| * | | | | | gl_shader_decompiler: Mark ASTDecompiler/ExprDecompiler parameters as const references where applicable | Lioncash | 2019-10-16 | 1 | -21/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These member functions don't actually modify the input parameter, so we can make this explicit with the use of const. | ||||
| * | | | | | gl_shader_decompiler: Pass by reference to GenerateTextureArgument() | Lioncash | 2019-10-16 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Avoids an unnecessary atomic reference count increment and decrement. | ||||
| * | | | | | gl_shader_decompiler: Use std::holds_alternative within GenerateTexture() | Lioncash | 2019-10-16 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This only ever queries if the type exists within the variant, but doesn't actually do anything with the return value. We can just use std::holds_alternative for this use case. | ||||
| * | | | | | shader/node: std::move Meta instance within OperationNode constructor | Lioncash | 2019-10-16 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | Allows usages of the constructor to avoid an unnecessary copy. | ||||
| * | | | | | gl_shader_decompiler: Avoid unnecessary copies of MetaImage | Lioncash | 2019-10-16 | 1 | -4/+4 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | MetaImage contains a std::vector, so copying here could result in unnecessary reallocations. Given the operation lives throughout the entire scope, this is safe to do. | ||||
* | | | | | Merge pull request #2972 from lioncash/system | bunnei | 2019-10-15 | 1 | -1/+1 |
|\ \ \ \ \ | |/ / / / |/| | | | | {bcat, gpu, nvflinger}: Remove trivial usages of the global system accessor | ||||
| * | | | | video_core/gpu: Remove use of the global system accessor | Lioncash | 2019-10-15 | 1 | -1/+1 |
| | |/ / | |/| | | | | | | | | | | | | | | We can just make use of the reference member variable instead of accessing the global system instance. | ||||
* / | | | common: Rename binary_find.h to algorithm.h | Lioncash | 2019-10-15 | 2 | -3/+3 |
|/ / / | | | | | | | | | | | | | | | | Makes the header more general for other potential algorithms in the future. While we're at it, include a missing <functional> include to satisfy the use of std::less. | ||||
* | | | Merge pull request #2928 from ReinUsesLisp/dirty-depth-bounds | bunnei | 2019-10-09 | 2 | -1/+10 |
|\ \ \ | | | | | | | | | maxwell_3d: Add dirty flags for depth bounds values | ||||
| * | | | maxwell_3d: Add dirty flags for depth bounds values | ReinUsesLisp | 2019-10-05 | 2 | -1/+10 |
| |/ / | | | | | | | | | | | | | This is useful in Vulkan where we want to update depth bounds without caring if it's enabled or disabled through vkCmdSetDepthBounds. | ||||
* | | | Merge pull request #2927 from ReinUsesLisp/polygon-offset-units | bunnei | 2019-10-09 | 1 | -1/+3 |
|\ \ \ | |_|/ |/| | | gl_rasterizer: Fix polygon offset units | ||||
| * | | gl_rasterizer: Fix polygon offset units | ReinUsesLisp | 2019-10-01 | 1 | -1/+3 |
| |/ | | | | | | | | | | | For some reason hardware divides polygon offset units by two. This is visible since drivers multiply the application requested polygon offset by two. | ||||
* | | shader/half_set_predicate: Fix HSETP2 for constant buffers | ReinUsesLisp | 2019-10-07 | 1 | -0/+2 |
| | | | | | | | | | | HSETP2 when used with a constant buffer parses the second operand type as F32. This is not configurable. | ||||
* | | shader/half_set_predicate: Reduce DEBUG_ASSERT to LOG_DEBUG | ReinUsesLisp | 2019-10-07 | 1 | -1/+2 |
| | | |||||
* | | gl_shader_disk_cache: Properly ignore existing cache | ReinUsesLisp | 2019-10-06 | 2 | -16/+17 |
| | | | | | | | | | | Previously old entries where appended to the file even if the shader cache was ignored at boot. Address that issue. | ||||
* | | video_core/control_flow: Eliminate variable shadowing warnings | Lioncash | 2019-10-05 | 1 | -6/+6 |
| | | |||||
* | | video_core/control_flow: Eliminate pessimizing moves | Lioncash | 2019-10-05 | 1 | -5/+8 |
| | | | | | | | | These can inhibit the ability of a compiler to perform RVO. | ||||
* | | video_core/ast: Unindent most of IsFullyDecompiled() by one level | Lioncash | 2019-10-05 | 1 | -12/+12 |
| | | |||||
* | | video_core/ast: Make ShowCurrentState() take a string_view instead of std::string | Lioncash | 2019-10-05 | 2 | -2/+2 |
| | | | | | | | | Allows the function to be non-allocating in terms of the output string. | ||||
* | | video_core/ast: Eliminate variable shadowing warnings | Lioncash | 2019-10-05 | 1 | -3/+3 |
| | | |||||
* | | video_core/ast: Replace std::string with a constexpr std::string_view | Lioncash | 2019-10-05 | 1 | -3/+1 |
| | | | | | | | | Same behavior, but without the need to heap allocate | ||||
* | | video_core/ast: Default the move constructor and assignment operator | Lioncash | 2019-10-05 | 2 | -26/+2 |
| | | | | | | | | | | This is behaviorally equivalent and also fixes a bug where some members weren't being moved over. | ||||
* | | video_core/{ast, expr}: Organize forward declaration | Lioncash | 2019-10-05 | 2 | -10/+10 |
| | | | | | | | | Keeps them alphabetically sorted for readability. | ||||
* | | video_core/expr: Supply operator!= along with operator== | Lioncash | 2019-10-05 | 2 | -1/+32 |
| | | | | | | | | Provides logical symmetry to the interface. | ||||
* | | video_core/{ast, expr}: Use std::move where applicable | Lioncash | 2019-10-05 | 4 | -45/+47 |
| | | | | | | | | Avoids unnecessary atomic reference count increments and decrements. | ||||
* | | video_core/ast: Supply const accessors for data where applicable | Lioncash | 2019-10-05 | 2 | -37/+41 |
| | | | | | | | | | | Provides const equivalents of data accessors for use within const contexts. | ||||
* | | Merge pull request #2888 from FernandoS27/decompiler2 | David | 2019-10-05 | 15 | -160/+2287 |
|\ \ | | | | | | | Shader_IR: Implement a full control flow decompiler for the shader IR. | ||||
| * | | Shader_ir: Address feedback | Fernando Sahmkow | 2019-10-05 | 6 | -65/+24 |
| | | | |||||
| * | | Shader_Ir: Address Feedback and clang format. | Fernando Sahmkow | 2019-10-05 | 4 | -68/+68 |
| | | | |||||
| * | | vk_shader_decompiler: Correct Branches inside conditionals. | Fernando Sahmkow | 2019-10-05 | 1 | -1/+11 |
| | | | |||||
| * | | vk_shader_decompiler: Clean code and be const correct. | Fernando Sahmkow | 2019-10-05 | 2 | -8/+6 |
| | | | |||||
| * | | Shader_IR: clean up AST handling and add documentation. | Fernando Sahmkow | 2019-10-05 | 1 | -2/+6 |
| | | | |||||
| * | | Shader_IR: Correct OutwardMoves for Ifs | Fernando Sahmkow | 2019-10-05 | 1 | -22/+11 |
| | | | |||||
| * | | vk_shader_compiler: Don't enclose branches with if(true) to avoid crashing AMD | Fernando Sahmkow | 2019-10-05 | 1 | -16/+33 |
| | | | |||||
| * | | gl_shader_decompiler: Refactor and address feedback. | Fernando Sahmkow | 2019-10-05 | 1 | -17/+18 |
| | | | |||||
| * | | Shader_IR: corrections and clang-format | Fernando Sahmkow | 2019-10-05 | 2 | -70/+64 |
| | | | |||||
| * | | vk_shader_compiler: Correct SPIR-V AST Decompiling | Fernando Sahmkow | 2019-10-05 | 1 | -4/+11 |
| | | | |||||
| * | | Shader_IR: allow else derivation to be optional. | Fernando Sahmkow | 2019-10-05 | 7 | -10/+18 |
| | | | |||||
| * | | vk_shader_compiler: Implement the decompiler in SPIR-V | Fernando Sahmkow | 2019-10-05 | 3 | -23/+301 |
| | | | |||||
| * | | Shader_IR: mark labels as unused for partial decompile. | Fernando Sahmkow | 2019-10-05 | 2 | -3/+9 |
| | | | |||||
| * | | Shader_Ir: Refactor Decompilation process and allow multiple decompilation modes. | Fernando Sahmkow | 2019-10-05 | 13 | -82/+334 |
| | | | |||||
| * | | gl_shader_decompiler: Implement AST decompiling | Fernando Sahmkow | 2019-10-05 | 11 | -63/+358 |
| | | | |||||
| * | | shader_ir: Declare Manager and pass it to appropiate programs. | Fernando Sahmkow | 2019-10-05 | 7 | -104/+214 |
| | | | |||||
| * | | shader_ir: Corrections to outward movements and misc stuffs | Fernando Sahmkow | 2019-10-05 | 6 | -58/+306 |
| | | | |||||
| * | | shader_ir: Add basic goto elimination | Fernando Sahmkow | 2019-10-05 | 2 | -38/+484 |
| | | | |||||
| * | | shader_ir: Initial Decompile Setup | Fernando Sahmkow | 2019-10-05 | 6 | -5/+510 |
| |/ | |||||
* | | Texture_Cache: Blit Deduction corrections and simplifications. | Fernando Sahmkow | 2019-10-05 | 1 | -18/+20 |
| | | |||||
* | | TextureCache: Add the ability to deduce if two textures are depth on blit. | Fernando Sahmkow | 2019-10-05 | 1 | -2/+142 |
|/ | |||||
* | gl_shader_decompiler: Add tailing return for HUnpack2 | ReinUsesLisp | 2019-09-24 | 1 | -0/+2 |
| | |||||
* | gl_shader_decompiler: Fix clang build issues | ReinUsesLisp | 2019-09-24 | 1 | -26/+23 |
| | |||||
* | Merge pull request #2869 from ReinUsesLisp/suld | bunnei | 2019-09-24 | 11 | -229/+199 |
|\ | | | | | shader/image: Implement SULD and fix SUATOM | ||||
| * | gl_shader_decompiler: Use uint for images and fix SUATOM | ReinUsesLisp | 2019-09-21 | 7 | -188/+93 |
| | | | | | | | | | | | | In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop. | ||||
| * | shader/image: Implement SULD and remove irrelevant code | ReinUsesLisp | 2019-09-21 | 10 | -47/+110 |
| | | | | | | | | | | * Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array. | ||||
| * | shader_bytecode: Add SULD encoding | ReinUsesLisp | 2019-09-21 | 1 | -0/+2 |
| | | |||||
* | | Merge pull request #2870 from FernandoS27/multi-draw | David | 2019-09-22 | 9 | -91/+273 |
|\ \ | | | | | | | Implement a MME Draw commands Inliner and correct host instance drawing | ||||
| * | | Maxwell3D: Corrections and refactors to MME instance refactor | Fernando Sahmkow | 2019-09-22 | 4 | -44/+46 |
| | | | |||||
| * | | Rasterizer: Correct introduced bug where a conditional render wouldn't stop a draw call from executing | Fernando Sahmkow | 2019-09-20 | 1 | -10/+16 |
| | | | |||||
| * | | Rasterizer: Refactor and simplify DrawBatch Interface. | Fernando Sahmkow | 2019-09-19 | 4 | -35/+16 |
| | | | |||||
| * | | Rasterizer: Address Feedback and conscerns. | Fernando Sahmkow | 2019-09-19 | 1 | -11/+11 |
| | | | |||||
| * | | Rasterizer: Refactor draw calls, remove deadcode and clean up. | Fernando Sahmkow | 2019-09-19 | 3 | -106/+68 |
| | | | |||||
| * | | VideoCore: Corrections to the MME Inliner and removal of hacky instance management. | Fernando Sahmkow | 2019-09-19 | 6 | -31/+81 |
| | | | |||||
| * | | Video Core: initial Implementation of InstanceDraw Packaging | Fernando Sahmkow | 2019-09-19 | 7 | -11/+192 |
| | | | |||||
* | | | Merge pull request #2891 from FearlessTobi/rod-tex | Fernando Sahmkow | 2019-09-22 | 8 | -24/+39 |
|\ \ \ | | | | | | | | | video_core: Implement RGBX16F and lower Surface Copy log severity | ||||
| * | | | Fix clang-format | FearlessTobi | 2019-09-22 | 2 | -2/+2 |
| | | | | |||||
| * | | | fermi_2d: Lower surface copy log severity to DEBUG | FearlessTobi | 2019-09-22 | 1 | -1/+1 |
| | | | | |||||
| * | | | video_core: Implement RGBX16F PixelFormat | FearlessTobi | 2019-09-22 | 7 | -22/+37 |
| | | | | |||||
* | | | | Merge pull request #2867 from ReinUsesLisp/configure-framebuffers-clean | David | 2019-09-22 | 4 | -121/+33 |
|\ \ \ \ | |/ / / |/| | | | gl_rasterizer: Remove unused code paths from ConfigureFramebuffers | ||||
| * | | | gl_rasterizer: Remove unused code paths from ConfigureFramebuffers | ReinUsesLisp | 2019-09-17 | 4 | -121/+33 |
| | | | | |||||
* | | | | Merge pull request #2878 from FernandoS27/icmp | Rodrigo Locatti | 2019-09-21 | 2 | -0/+42 |
|\ \ \ \ | | | | | | | | | | | shader_ir: Implement ICMP | ||||
| * | | | | Shader_IR: ICMP corrections and fixes | Fernando Sahmkow | 2019-09-21 | 2 | -6/+11 |
| | | | | | |||||
| * | | | | Shader_IR: Implement ICMP. | Fernando Sahmkow | 2019-09-20 | 2 | -0/+37 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #2868 from ReinUsesLisp/fix-mipmaps | David | 2019-09-21 | 1 | -2/+2 |
|\ \ \ \ | | | | | | | | | | | maxwell_to_gl: Fix mipmap filtering | ||||
| * | | | | maxwell_to_gl: Fix mipmap filtering | ReinUsesLisp | 2019-09-17 | 1 | -2/+2 |
| | |/ / | |/| | | | | | | | | | | | | | | OpenGL texture filters follow GL_<texture_filter>_MIPMAP_<mipmap_filter> but we were using them in the opposite way. | ||||
* | | | | Mark DrawArrays as LOG_TRACE | David Marcec | 2019-09-21 | 1 | -1/+1 |
| |_|/ |/| | | | | | | | | There's no reason to clog logs with DrawArray. | ||||
* | | | Merge pull request #2846 from ReinUsesLisp/fixup-viewport-index | bunnei | 2019-09-20 | 1 | -10/+14 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Avoid writing output attribute when unimplemented | ||||
| * | | | gl_shader_decompiler: Avoid writing output attribute when unimplemented | ReinUsesLisp | 2019-09-06 | 1 | -10/+14 |
| | | | | |||||
* | | | | Merge pull request #2855 from ReinUsesLisp/shfl | bunnei | 2019-09-20 | 6 | -9/+182 |
|\ \ \ \ | |_|_|/ |/| | | | shader_ir/warp: Implement SHFL for Nvidia devices | ||||
| * | | | shader_ir/warp: Implement SHFL | ReinUsesLisp | 2019-09-17 | 6 | -9/+182 |
| | |/ | |/| | |||||
* | | | Merge pull request #2784 from ReinUsesLisp/smem | bunnei | 2019-09-18 | 5 | -21/+81 |
|\ \ \ | |/ / |/| | | shader_ir: Implement shared memory | ||||
| * | | gl_shader_decompiler: Implement shared memory | ReinUsesLisp | 2019-09-05 | 1 | -0/+23 |
| | | | |||||
| * | | shader_ir: Implement LD_S | ReinUsesLisp | 2019-09-05 | 1 | -10/+13 |
| | | | | | | | | | | | | Loads from shared memory. | ||||
| * | | shader_ir: Implement ST_S | ReinUsesLisp | 2019-09-05 | 4 | -11/+45 |
| | | | | | | | | | | | | | | | This instruction writes to a memory buffer shared with threads within the same work group. It is known as "shared" memory in GLSL. | ||||
* | | | Merge pull request #2851 from ReinUsesLisp/srgb | Fernando Sahmkow | 2019-09-15 | 6 | -30/+9 |
|\ \ \ | | | | | | | | | renderer_opengl: Fix sRGB blits | ||||
| * | | | renderer_opengl: Fix rebase mistake | ReinUsesLisp | 2019-09-11 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_rasterizer: Correct sRGB Fix regression | Fernando Sahmkow | 2019-09-11 | 1 | -0/+12 |
| | | | | |||||
| * | | | renderer_opengl: Fix sRGB blits | ReinUsesLisp | 2019-09-11 | 6 | -43/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Removes the sRGB hack of tracking if a frame used an sRGB rendertarget to apply at least once to blit the final texture as sRGB. Instead of doing this apply sRGB if the presented image has sRGB. Also enable sRGB by default on Maxwell3D registers as some games seem to assume this. | ||||
* | | | | Merge pull request #2824 from ReinUsesLisp/mme | Fernando Sahmkow | 2019-09-15 | 3 | -4/+20 |
|\ \ \ \ | | | | | | | | | | | Revert "Revert #2466" and stub FirmwareCall 4 | ||||
| * | | | | maxwell_3d: Update firmware 4 call stub commentary | Rodrigo Locatti | 2019-09-15 | 1 | -1/+2 |
| | | | | | |||||
| * | | | | Revert "Revert #2466" and stub FirmwareCall 4 | ReinUsesLisp | 2019-09-04 | 3 | -4/+19 |
| | | | | | |||||
* | | | | | Merge pull request #2857 from ReinUsesLisp/surface-srgb | Fernando Sahmkow | 2019-09-14 | 2 | -0/+22 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/surface: Add function to detect sRGB surfaces | ||||
| * | | | | | video_core/surface: Add function to detect sRGB surfaces | ReinUsesLisp | 2019-09-13 | 2 | -0/+22 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | This is required for proper conversion to RGBA8_UNORM or RGBA8_SRGB surfaces when a backend can target both native and converted ASTC. | ||||
* | | | | | Merge pull request #2858 from ReinUsesLisp/vk-device | Fernando Sahmkow | 2019-09-14 | 3 | -111/+258 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_device: Add miscellaneous features and minor style changes | ||||
| * | | | | | vk_device: Add miscellaneous features and minor style changes | ReinUsesLisp | 2019-09-13 | 3 | -111/+258 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Increase minimum Vulkan requirements * Require VK_EXT_vertex_attribute_divisor * Require depthClamp, samplerAnisotropy and largePoints features * Search and expose VK_KHR_uniform_buffer_standard_layout * Search and expose VK_EXT_index_type_uint8 * Search and expose native float16 arithmetics * Track current driver with VK_KHR_driver_properties * Query and expose SSBO alignment * Query more image formats * Improve logging overall * Minor style changes * Minor rephrasing of commentaries | ||||
* / / / / | shader/image: Implement SUATOM and fix SUST | ReinUsesLisp | 2019-09-11 | 7 | -69/+329 |
|/ / / / | |||||
* | | | | Merge pull request #2823 from ReinUsesLisp/shr-clamp | bunnei | 2019-09-10 | 2 | -6/+17 |
|\ \ \ \ | | | | | | | | | | | shader/shift: Implement SHR wrapped and clamped variants | ||||
| * | | | | shader/shift: Implement SHR wrapped and clamped variants | ReinUsesLisp | 2019-09-04 | 2 | -6/+17 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | Nvidia defaults to wrapped shifts, but this is undefined behaviour on OpenGL's spec. Explicitly mask/clamp according to what the guest shader requires. | ||||
* | | | | Merge pull request #2810 from ReinUsesLisp/mme-opt | bunnei | 2019-09-10 | 4 | -12/+24 |
|\ \ \ \ | | | | | | | | | | | maxwell_3d: Avoid moving macro_params | ||||
| * | | | | maxwell_3d: Avoid moving macro_params | ReinUsesLisp | 2019-09-04 | 4 | -12/+24 |
| |/ / / | |||||
* | | | | gl_shader_decompiler: Keep track of written images and mark them as modified | ReinUsesLisp | 2019-09-06 | 7 | -62/+92 |
| | | | | |||||
* | | | | texture_cache: Minor changes | ReinUsesLisp | 2019-09-06 | 4 | -19/+17 |
| | | | | |||||
* | | | | gl_rasterizer: Apply textures and images state | ReinUsesLisp | 2019-09-06 | 1 | -0/+2 |
| | | | | |||||
* | | | | gl_rasterizer: Add samplers to compute dispatches | ReinUsesLisp | 2019-09-06 | 2 | -3/+36 |
| | | | | |||||
* | | | | gl_rasterizer: Minor code changes | ReinUsesLisp | 2019-09-06 | 2 | -20/+31 |
| | | | | |||||
* | | | | gl_state: Split textures and samplers into two arrays | ReinUsesLisp | 2019-09-06 | 4 | -91/+39 |
| | | | | |||||
* | | | | gl_rasterizer: Implement image bindings | ReinUsesLisp | 2019-09-06 | 5 | -32/+106 |
| | | | | |||||
* | | | | gl_state: Add support for glBindImageTextures | ReinUsesLisp | 2019-09-06 | 2 | -0/+24 |
| | | | | |||||
* | | | | texture_cache: Pass TIC to texture cache | ReinUsesLisp | 2019-09-06 | 4 | -27/+25 |
| | | | | |||||
* | | | | kepler_compute: Implement texture queries | ReinUsesLisp | 2019-09-06 | 5 | -5/+99 |
| | | | | |||||
* | | | | gl_rasterizer: Split SetupTextures | ReinUsesLisp | 2019-09-06 | 2 | -22/+38 |
| |_|/ |/| | | |||||
* | | | Merge pull request #2804 from ReinUsesLisp/remove-gs-special | Fernando Sahmkow | 2019-09-05 | 2 | -80/+9 |
|\ \ \ | | | | | | | | | gl_shader_cache: Remove special casing for geometry shaders | ||||
| * | | | gl_shader_cache: Remove special casing for geometry shaders | ReinUsesLisp | 2019-09-04 | 2 | -80/+9 |
| |/ / | | | | | | | | | | | | | Now that ProgramVariants holds the primitive topology we no longer need to keep track of individual geometry shaders topologies. | ||||
* | | | Merge pull request #2833 from ReinUsesLisp/fix-stencil | bunnei | 2019-09-05 | 1 | -11/+11 |
|\ \ \ | |_|/ |/| | | gl_rasterizer: Fix stencil testing | ||||
| * | | gl_rasterizer: Fix stencil testing | ReinUsesLisp | 2019-09-04 | 1 | -11/+11 |
| |/ | | | | | | | | | | | * Fix stencil dirty flags tracking when stencil is disabled * Attach stencil on clears (previously it only attached depth) * Attach stencil on drawing regardless of stencil testing being enabled | ||||
* | | Merge pull request #2802 from ReinUsesLisp/hsetp2-pred | David | 2019-09-05 | 1 | -10/+9 |
|\ \ | | | | | | | half_set_predicate: Fix HSETP2 predicate assignments | ||||
| * | | half_set_predicate: Fix predicate assignments | ReinUsesLisp | 2019-09-04 | 1 | -10/+9 |
| |/ | |||||
* | | gl_shader_decompiler: Fixup slow path | ReinUsesLisp | 2019-09-04 | 1 | -1/+1 |
| | | |||||
* | | gl_device: Disable precise in fragment shaders on bugged drivers | ReinUsesLisp | 2019-09-04 | 3 | -15/+43 |
| | | |||||
* | | gl_shader_decompiler: Fixup AMD's slow path type | ReinUsesLisp | 2019-09-04 | 1 | -1/+1 |
| | | |||||
* | | gl_shader_decompiler: Rework GLSL decompiler type system | ReinUsesLisp | 2019-09-04 | 1 | -416/+505 |
|/ | | | | | | | | | | | | | | | GLSL decompiler type system was broken. We converted all return values to float except for some cases where returning we couldn't and implicitly broke the rule of returning floats (e.g. for bools or bool pairs). Instead of doing this introduce class Expression that knows what type a return value has and when a consumer wants to use the string it asks for it with a required type, emitting a runtime error if types are incompatible. This has the disadvantage that there's more C++ code, but we can emit better GLSL code that's easier to read. | ||||
* | Merge pull request #2793 from ReinUsesLisp/bgr565 | bunnei | 2019-09-04 | 14 | -203/+70 |
|\ | | | | | renderer_opengl: Implement RGB565 framebuffer format | ||||
| * | renderer_opengl: Implement RGB565 framebuffer format | ReinUsesLisp | 2019-08-21 | 3 | -3/+9 |
| | | |||||
| * | renderer_opengl: Use block linear swizzling for CPU framebuffers | ReinUsesLisp | 2019-08-21 | 3 | -150/+33 |
| | | |||||
| * | renderer_opengl: Use VideoCore pixel format | ReinUsesLisp | 2019-08-21 | 3 | -23/+11 |
| | | |||||
| * | gpu: Change optional<reference_wrapper<T>> to T* for FramebufferConfig | ReinUsesLisp | 2019-08-21 | 10 | -31/+21 |
| | | |||||
* | | Merge pull request #2812 from ReinUsesLisp/f2i-selector | bunnei | 2019-09-04 | 2 | -7/+23 |
|\ \ | | | | | | | shader_ir/conversion: Implement F2I and F2F F16 selector | ||||
| * | | shader_ir/conversion: Split int and float selector and implement F2F H1 | ReinUsesLisp | 2019-08-28 | 2 | -19/+24 |
| | | | |||||
| * | | shader_ir/conversion: Implement F2I F16 Ra.H1 | ReinUsesLisp | 2019-08-28 | 2 | -6/+17 |
| | | | |||||
* | | | Merge pull request #2811 from ReinUsesLisp/fsetp-fix | bunnei | 2019-09-04 | 2 | -4/+6 |
|\ \ \ | | | | | | | | | float_set_predicate: Add missing negation bit for the second operand | ||||
| * | | | float_set_predicate: Add missing negation bit for the second operand | ReinUsesLisp | 2019-08-28 | 2 | -4/+6 |
| |/ / | |||||
* | | | Merge pull request #2826 from ReinUsesLisp/macro-binding | bunnei | 2019-09-04 | 2 | -10/+4 |
|\ \ \ | | | | | | | | | maxwell_3d: Fix macro binding cursor | ||||
| * | | | maxwell_3d: Fix macro binding cursor | ReinUsesLisp | 2019-09-01 | 2 | -10/+4 |
| | | | | |||||
* | | | | Merge pull request #2765 from FernandoS27/dma-fix | bunnei | 2019-09-01 | 3 | -23/+36 |
|\ \ \ \ | |/ / / |/| | | | MaxwellDMA: Fixes, corrections and relaxations. | ||||
| * | | | MaxwellDMA: Fixes, corrections and relaxations. | Fernando Sahmkow | 2019-07-26 | 3 | -23/+36 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors. | ||||
* | | | | video_core: Silent miscellaneous warnings (#2820) | Rodrigo Locatti | 2019-08-30 | 23 | -48/+22 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * texture_cache/surface_params: Remove unused local variable * rasterizer_interface: Add missing documentation commentary * maxwell_dma: Remove unused rasterizer reference * video_core/gpu: Sort member declaration order to silent -Wreorder warning * fermi_2d: Remove unused MemoryManager reference * video_core: Silent unused variable warnings * buffer_cache: Silent -Wreorder warnings * kepler_memory: Remove unused MemoryManager reference * gl_texture_cache: Add missing override * buffer_cache: Add missing include * shader/decode: Remove unused variables | ||||
* | | | | gl_buffer_cache: Add missing include | ReinUsesLisp | 2019-08-30 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | RasterizerInterface was considered an incomplete object by clang. | ||||
* | | | | Merge pull request #2742 from ReinUsesLisp/fix-texture-buffers | bunnei | 2019-08-29 | 4 | -4/+12 |
|\ \ \ \ | | | | | | | | | | | gl_texture_cache: Miscellaneous texture buffer fixes | ||||
| * | | | | gl_shader_decompiler: Rename bufferImage to imageBuffer | ReinUsesLisp | 2019-07-18 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | The online OpenGL documentation is wrong. The type definition is imageBuffer. | ||||
| * | | | | gl_shader_cache: Fix newline on buffer preprocessor definitions | ReinUsesLisp | 2019-07-18 | 1 | -2/+6 |
| | | | | | |||||
| * | | | | textures: Fix texture buffer size calculation | ReinUsesLisp | 2019-07-18 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | gl_texture_cache: Do not set texture parameters to buffers | ReinUsesLisp | 2019-07-18 | 1 | -0/+3 |
| | | | | | |||||
| * | | | | gl_texture_cache: Add missing break in CreateTexture | ReinUsesLisp | 2019-07-18 | 1 | -0/+1 |
| | | | | | |||||
* | | | | | Merge pull request #2783 from FernandoS27/new-buffer-cache | bunnei | 2019-08-29 | 9 | -330/+684 |
|\ \ \ \ \ | | | | | | | | | | | | | Implement a New LLE Buffer Cache | ||||
| * | | | | | Buffer Cache: Adress Feedback. | Fernando Sahmkow | 2019-08-21 | 2 | -7/+6 |
| | | | | | | |||||
| * | | | | | Buffer_Cache: Implement flushing. | Fernando Sahmkow | 2019-08-21 | 2 | -1/+30 |
| | | | | | | |||||
| * | | | | | Buffer_Cache: Implement barriers. | Fernando Sahmkow | 2019-08-21 | 1 | -0/+4 |
| | | | | | | |||||
| * | | | | | Buffer_Cache: Optimize and track written areas. | Fernando Sahmkow | 2019-08-21 | 2 | -12/+104 |
| | | | | | | |||||
| * | | | | | BufferCache: Rework mapping caching. | Fernando Sahmkow | 2019-08-21 | 2 | -49/+76 |
| | | | | | | |||||
| * | | | | | Buffer_Cache: Fixes and optimizations. | Fernando Sahmkow | 2019-08-21 | 2 | -68/+38 |
| | | | | | | |||||
| * | | | | | Video_Core: Implement a new Buffer Cache | Fernando Sahmkow | 2019-08-21 | 9 | -327/+560 |
| | |_|/ / | |/| | | | |||||
* | | | | | Merge pull request #2758 from ReinUsesLisp/packed-tid | bunnei | 2019-08-29 | 3 | -0/+15 |
|\ \ \ \ \ | | | | | | | | | | | | | shader/decode: Implement S2R Tic | ||||
| * | | | | | shader/decode: Implement S2R Tic | ReinUsesLisp | 2019-07-22 | 3 | -0/+15 |
| | | | | | | |||||
* | | | | | | shader_ir: Implement VOTE | ReinUsesLisp | 2019-08-21 | 11 | -1/+162 |
| |/ / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers. | ||||
* | | | | | Merge pull request #2769 from FernandoS27/commands-flush | bunnei | 2019-08-21 | 6 | -0/+15 |
|\ \ \ \ \ | | | | | | | | | | | | | GPU: Flush commands on every dma pusher step. | ||||
| * | | | | | GPU: Flush commands on every dma pusher step. | Fernando Sahmkow | 2019-07-26 | 6 | -0/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit ensures that the host gpu is constantly fed with commands to work with, while the guest gpu keeps producing the rest of the commands. This reduces syncing time between host and guest gpu. | ||||
* | | | | | | Merge pull request #2777 from ReinUsesLisp/hsetp2-fe3h-fix | bunnei | 2019-08-21 | 1 | -1/+1 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | half_set_predicate: Fix HSETP2_C constant buffer offset | ||||
| * | | | | | | half_set_predicate: Fix HSETP2_C constant buffer offset | ReinUsesLisp | 2019-08-04 | 1 | -1/+1 |
| |/ / / / / | |||||
* | | | | | | Merge pull request #2753 from FernandoS27/float-convert | bunnei | 2019-08-21 | 5 | -18/+75 |
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | | | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. | ||||
| * | | | | | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. | Fernando Sahmkow | 2019-07-20 | 5 | -18/+75 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done. | ||||
* | | | | | | Merge pull request #2778 from ReinUsesLisp/nop | bunnei | 2019-08-18 | 2 | -0/+13 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader_ir: Implement NOP | ||||
| * | | | | | | shader_ir: Implement NOP | ReinUsesLisp | 2019-08-04 | 2 | -0/+13 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | Merge pull request #2768 from ReinUsesLisp/hsetp2-fix | bunnei | 2019-08-18 | 1 | -3/+3 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | decode/half_set_predicate: Fix predicates | ||||
| * | | | | | decode/half_set_predicate: Fix predicates | ReinUsesLisp | 2019-07-26 | 1 | -3/+3 |
| | | | | | | |||||
* | | | | | | Merge pull request #2592 from FernandoS27/sync1 | bunnei | 2019-07-26 | 9 | -59/+105 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | Implement GPU Synchronization Mechanisms & Correct NVFlinger | ||||
| * | | | | | NVServices: Styling, define constructors as explicit and corrections | Fernando Sahmkow | 2019-07-05 | 4 | -24/+24 |
| | | | | | | |||||
| * | | | | | NVFlinger: Correct GCC compile error | Fernando Sahmkow | 2019-07-05 | 2 | -6/+6 |
| | | | | | | |||||
| * | | | | | NVServices: Make NVEvents Automatic according to documentation. | Fernando Sahmkow | 2019-07-05 | 2 | -3/+6 |
| | | | | | | |||||
| * | | | | | GPU: Correct Interrupts to interrupt on syncpt/value instead of event, mirroring hardware | Fernando Sahmkow | 2019-07-05 | 5 | -29/+23 |
| | | | | | | |||||
| * | | | | | gpu_asynch: Simplify synchronization to a simpler consumer->producer scheme. | Fernando Sahmkow | 2019-07-05 | 2 | -47/+3 |
| | | | | | | |||||
| * | | | | | nv_host_ctrl: Make Sync GPU variant always return synced result. | Fernando Sahmkow | 2019-07-05 | 4 | -5/+11 |
| | | | | | | |||||
| * | | | | | Async GPU: do invalidate as synced operation | Fernando Sahmkow | 2019-07-05 | 1 | -6/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | Async GPU: Always invalidate synced. | ||||
| * | | | | | Gpu: use an std mutex instead of a spin_lock to guard syncpoints | Fernando Sahmkow | 2019-07-05 | 2 | -6/+6 |
| | | | | | | |||||
| * | | | | | Gpu: Mark areas as protected. | Fernando Sahmkow | 2019-07-05 | 2 | -0/+13 |
| | | | | | | |||||
| * | | | | | nv_services: Stub CtrlEventSignal | Fernando Sahmkow | 2019-07-05 | 2 | -1/+14 |
| | | | | | | |||||
| * | | | | | Gpu: Implement Hardware Interrupt Manager and manage GPU interrupts | Fernando Sahmkow | 2019-07-05 | 5 | -4/+21 |
| | | | | | | |||||
| * | | | | | video_core: Implement GPU side Syncpoints | Fernando Sahmkow | 2019-07-05 | 3 | -2/+51 |
| | | | | | | |||||
* | | | | | | Merge pull request #2739 from lioncash/cflow | bunnei | 2019-07-25 | 4 | -33/+53 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | video_core/control_flow: Minor changes/warning cleanup | ||||
| * | | | | | | video_core/control_flow: Provide operator!= for types with operator== | Lioncash | 2019-07-19 | 1 | -4/+21 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Provides operational symmetry for the respective structures. | ||||
| * | | | | | | video_core/control_flow: Prevent sign conversion in TryGetBlock() | Lioncash | 2019-07-19 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The return value is a u32, not an s32, so this would result in an implicit signedness conversion. | ||||
| * | | | | | | video_core/control_flow: Remove unnecessary BlockStack copy constructor | Lioncash | 2019-07-19 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is the default behavior of the copy constructor, so it doesn't need to be specified. While we're at it we can make the other non-default constructor explicit. | ||||
| * | | | | | | video_core/control_flow: Use std::move where applicable | Lioncash | 2019-07-19 | 1 | -10/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Results in less work being done where avoidable. | ||||
| * | | | | | | video_core/control_flow: Use the prefix variant of operator++ for iterators | Lioncash | 2019-07-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Same thing, but potentially allows a standard library implementation to pick a more efficient codepath. | ||||
| * | | | | | | video_core/control_flow: Use empty() member function for checking emptiness | Lioncash | 2019-07-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | It's what it's there for. | ||||
| * | | | | | | video_core: Resolve -Wreorder warnings | Lioncash | 2019-07-19 | 2 | -4/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Ensures that the constructor members are always initialized in the order that they're declared in. | ||||
| * | | | | | | video_core/control_flow: Make program_size for ScanFlow() a std::size_t | Lioncash | 2019-07-19 | 2 | -5/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Prevents a truncation warning from occurring with MSVC. Also the internal data structures already treat it as a size_t, so this is just a discrepancy in the interface. | ||||
| * | | | | | | video_core/control_flow: Place all internally linked types/functions within an anonymous namespace | Lioncash | 2019-07-19 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously, quite a few functions were being linked with external linkage. | ||||
| * | | | | | | video_core/shader/decode: Prevent sign-conversion warnings | Lioncash | 2019-07-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Makes it explicit that the conversions here are intentional. | ||||
* | | | | | | | Merge pull request #2737 from FernandoS27/track-fix | bunnei | 2019-07-25 | 1 | -2/+2 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | Shader_Ir: Correct tracking to track from right to left | ||||
| * | | | | | | | Shader_Ir: Correct tracking to track from right to left | Fernando Sahmkow | 2019-07-16 | 1 | -2/+2 |
| | | | | | | | | |||||
* | | | | | | | | Merge pull request #2743 from FernandoS27/surpress-assert | bunnei | 2019-07-25 | 9 | -29/+36 |
|\ \ \ \ \ \ \ \ | |_|_|_|_|_|_|/ |/| | | | | | | | Downgrade and suppress a series of GPU asserts and debug messages. | ||||
| * | | | | | | | Shader_Ir: Change Debug Asserts for Log Warnings | Fernando Sahmkow | 2019-07-20 | 3 | -10/+17 |
| | | | | | | | | |||||
| * | | | | | | | Shader_Ir: correct clang format | Fernando Sahmkow | 2019-07-18 | 1 | -2/+2 |
| | | | | | | | | |||||
| * | | | | | | | GPU: Add missing puller methods. | Fernando Sahmkow | 2019-07-18 | 2 | -14/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This adds some missing puller methods. We don't assert them as these are nop operations for us. | ||||
| * | | | | | | | MaxwellDMA/KeplerCopy: Downgrade DMA log message to Trace. | Fernando Sahmkow | 2019-07-18 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This log was just to know which games used DMA. It's no longer important. | ||||
| * | | | | | | | Gl_Texture_Cache: Remove assert on component type in GetFormatTuple | Fernando Sahmkow | 2019-07-18 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Textures can have different components types in different orders. This assert was completely inprecise and the effectiveness of such is better handled by case and within the texture cache. | ||||
| * | | | | | | | Shader_Ir: Downgrade precision and rounding asserts to debug asserts. | Fernando Sahmkow | 2019-07-18 | 5 | -10/+10 |
| | |_|_|_|_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit reduces the sevirity of asserts for FP precision and rounding as this are well known and have little to no consequences in gpu's accuracy. | ||||
* | | | | | | | Merge pull request #2704 from FernandoS27/conditional | bunnei | 2019-07-24 | 3 | -2/+100 |
|\ \ \ \ \ \ \ | |_|_|_|_|_|/ |/| | | | | | | maxwell3d: Implement Conditional Rendering | ||||
| * | | | | | | maxwell3d: Implement Conditional Rendering | Fernando Sahmkow | 2019-07-17 | 3 | -2/+100 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conditional Rendering takes care of conditionaly clearing or drawing depending on a set of queries. This PR implements the query checks to stablish if things can be rendered or not. | ||||
* | | | | | | Merge pull request #2734 from ReinUsesLisp/compute-shaders | bunnei | 2019-07-22 | 15 | -140/+357 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_rasterizer: Implement compute shaders | ||||
| * | | | | | | gl_shader_cache: Fix clang-format issues | ReinUsesLisp | 2019-07-16 | 2 | -4/+2 |
| | | | | | | | |||||
| * | | | | | | gl_shader_decompiler: Stub local memory size | ReinUsesLisp | 2019-07-15 | 1 | -8/+14 |
| | | | | | | | |||||
| * | | | | | | gl_shader_cache: Address review commentaries | ReinUsesLisp | 2019-07-15 | 4 | -13/+12 |
| | | | | | | | |||||
| * | | | | | | gl_shader_cache: Address CI issues | ReinUsesLisp | 2019-07-15 | 2 | -3/+3 |
| | | | | | | | |||||
| * | | | | | | gl_rasterizer: Implement compute shaders | ReinUsesLisp | 2019-07-15 | 15 | -136/+350 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #2735 from FernandoS27/pipeline-rework | bunnei | 2019-07-21 | 14 | -116/+528 |
|\ \ \ \ \ \ \ | |_|_|_|_|_|/ |/| | | | | | | Rework Dirty Flags in GPU Pipeline, Optimize CBData and Redo Clearing mechanism | ||||
| * | | | | | | Maxwell3D: Reorganize and address feedback | Fernando Sahmkow | 2019-07-20 | 3 | -20/+33 |
| | | | | | | | |||||
| * | | | | | | GL_State: Feedback and fixes | Fernando Sahmkow | 2019-07-17 | 4 | -14/+27 |
| | | | | | | | |||||
| * | | | | | | Maxwell3D: Address Feedback | Fernando Sahmkow | 2019-07-17 | 5 | -17/+13 |
| | | | | | | | |||||
| * | | | | | | Texture_Cache: Rebase Fixes | Fernando Sahmkow | 2019-07-17 | 1 | -6/+0 |
| | | | | | | | |||||
| * | | | | | | GL_Rasterizer: Corrections to Clearing. | Fernando Sahmkow | 2019-07-17 | 4 | -12/+28 |
| | | | | | | | |||||
| * | | | | | | Maxwell3D: Correct marking dirtiness on CB upload | Fernando Sahmkow | 2019-07-17 | 1 | -0/+1 |
| | | | | | | | |||||
| * | | | | | | GL_Rasterizer: Rework RenderTarget/DepthBuffer clearing | Fernando Sahmkow | 2019-07-17 | 3 | -7/+63 |
| | | | | | | | |||||
| * | | | | | | Maxwell3D: Implement State Dirty Flags. | Fernando Sahmkow | 2019-07-17 | 6 | -44/+199 |
| | | | | | | | |||||
| * | | | | | | Maxwell3D: Rework CBData Upload | Fernando Sahmkow | 2019-07-17 | 2 | -8/+45 |
| | | | | | | | |||||
| * | | | | | | Maxwell3D: Rework the dirty system to be more consistant and scaleable | Fernando Sahmkow | 2019-07-17 | 10 | -80/+211 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | shader/half_set_predicate: Fix HSETP2 implementation | ReinUsesLisp | 2019-07-20 | 4 | -44/+23 |
| | | | | | | |||||
* | | | | | | shader/half_set_predicate: Implement missing HSETP2 variants | ReinUsesLisp | 2019-07-20 | 2 | -19/+49 |
| |_|_|/ / |/| | | | | |||||
* | | | | | Merge pull request #2738 from lioncash/shader-ir | bunnei | 2019-07-18 | 8 | -99/+103 |
|\ \ \ \ \ | |/ / / / |/| | | | | shader-ir: Minor cleanup-related changes | ||||
| * | | | | shader_ir: std::move Node instance where applicable | Lioncash | 2019-07-17 | 4 | -60/+67 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These are std::shared_ptr instances underneath the hood, which means copying them isn't as cheap as a regular pointer. Particularly so on weakly-ordered systems. This avoids atomic reference count increments and decrements where they aren't necessary for the core set of operations. | ||||
| * | | | | shader_ir: Rename Get/SetTemporal to Get/SetTemporary | Lioncash | 2019-07-17 | 5 | -36/+36 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is more accurate in terms of describing what the functions are actually doing. Temporal relates to time, not the setting of a temporary itself. | ||||
| * | | | | shader_ir: Remove unused includes | Lioncash | 2019-07-17 | 1 | -3/+0 |
| | |/ / | |/| | | | | | | | | | | Removes unnecessary header dependencies. | ||||
* | | | | Merge pull request #2740 from lioncash/bra | Fernando Sahmkow | 2019-07-17 | 1 | -1/+1 |
|\ \ \ \ | |/ / / |/| | | | shader/decode/other: Correct branch indirect argument within BRA handling | ||||
| * | | | shader/decode/other: Correct branch indirect argument within BRA handling | Lioncash | 2019-07-16 | 1 | -1/+1 |
| |/ / | | | | | | | | | | | | | This appears to have been a copy/paste error introduced within 8a6fc529a968e007f01464abadd32f9b5eb0a26c | ||||
* | | | Merge pull request #2565 from ReinUsesLisp/track-indirect | Fernando Sahmkow | 2019-07-16 | 6 | -35/+36 |
|\ \ \ | |/ / |/| | | shader/track: Track indirect buffers | ||||
| * | | shader: Allow tracking of indirect buffers without variable offset | ReinUsesLisp | 2019-07-15 | 6 | -35/+36 |
| | | | | | | | | | | | | | | | | | | While changing this code, simplify tracking code to allow returning the base address node, this way callers don't have to manually rebuild it on each invocation. | ||||
* | | | Merge pull request #2695 from ReinUsesLisp/layer-viewport | Fernando Sahmkow | 2019-07-15 | 10 | -40/+137 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders | ||||
| * | | | gl_shader_decompiler: Fix gl_PointSize redeclaration | ReinUsesLisp | 2019-07-11 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_shader_decompiler: Fix conditional usage of GL_ARB_shader_viewport_layer_array | ReinUsesLisp | 2019-07-11 | 1 | -2/+3 |
| | | | | |||||
| * | | | gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders | ReinUsesLisp | 2019-07-08 | 10 | -40/+136 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging. | ||||
* | | | | Merge pull request #2705 from FernandoS27/tex-cache-fixes | bunnei | 2019-07-15 | 7 | -22/+58 |
|\ \ \ \ | |_|/ / |/| | | | GPU: Fixes to Texture Cache and Include Microprofiles for GL State/BufferCopy/Macro Interpreter | ||||
| * | | | Texture_Cache: Address Feedback | Fernando Sahmkow | 2019-07-14 | 3 | -13/+17 |
| | | | | |||||
| * | | | Texture_Cache: Remove some unprecise fallback case and clang format | Fernando Sahmkow | 2019-07-14 | 2 | -13/+5 |
| | | | | |||||
| * | | | Texture_Cache: Force Framebuffer reset if an active render target is unregistered. | Fernando Sahmkow | 2019-07-14 | 3 | -10/+36 |
| | | | | |||||
| * | | | GPU: Add a microprofile for macro interpreter | Fernando Sahmkow | 2019-07-14 | 2 | -1/+6 |
| | | | | |||||
| * | | | GL_State: Add a microprofile timer to OpenGL state. | Fernando Sahmkow | 2019-07-14 | 1 | -0/+4 |
| | | | | |||||
| * | | | Gl_Texture_Cache: Measure Buffer Copy Times | Fernando Sahmkow | 2019-07-14 | 1 | -0/+2 |
| | | | | |||||
| * | | | Texture_Cache: Correct Linear Structural Match. | Fernando Sahmkow | 2019-07-14 | 1 | -3/+6 |
| | | | | |||||
* | | | | Merge pull request #2675 from ReinUsesLisp/opengl-buffer-cache | bunnei | 2019-07-15 | 16 | -407/+537 |
|\ \ \ \ | |/ / / |/| | | | buffer_cache: Implement a generic buffer cache and its OpenGL backend | ||||
| * | | | buffer_cache: Avoid [[nodiscard]] to make clang-format happy | ReinUsesLisp | 2019-07-06 | 1 | -5/+4 |
| | | | | |||||
| * | | | buffer_cache: Try to fix MinGW build | ReinUsesLisp | 2019-07-06 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_rasterizer: Fix nullptr dereference on disabled buffers | ReinUsesLisp | 2019-07-06 | 3 | -5/+5 |
| | | | | |||||
| * | | | gl_rasterizer: Minor style changes | ReinUsesLisp | 2019-07-06 | 4 | -32/+22 |
| | | | | |||||
| * | | | gl_rasterizer: Fix vertex and index data invalidations | ReinUsesLisp | 2019-07-06 | 4 | -8/+67 |
| | | | | |||||
| * | | | gl_buffer_cache: Implement with generic buffer cache | ReinUsesLisp | 2019-07-06 | 8 | -291/+92 |
| | | | | |||||
| * | | | buffer_cache: Implement a generic buffer cache | ReinUsesLisp | 2019-07-06 | 2 | -0/+301 |
| | | | | | | | | | | | | | | | | | | | | | | | | Implements a templated class with a similar approach to our current generic texture cache. It is designed to be compatible with Vulkan and OpenGL, | ||||
| * | | | gl_buffer_cache: Remove global system getters | ReinUsesLisp | 2019-07-06 | 3 | -9/+14 |
| | | | | |||||
| * | | | gl_device: Query SSBO alignment | ReinUsesLisp | 2019-07-06 | 2 | -0/+6 |
| | | | | |||||
| * | | | gl_buffer_cache: Implement flushing | ReinUsesLisp | 2019-07-06 | 2 | -2/+11 |
| | | | | |||||
| * | | | gl_rasterizer: Drop gl_global_cache in favor of gl_buffer_cache | ReinUsesLisp | 2019-07-06 | 7 | -206/+35 |
| | | | | |||||
| * | | | gl_buffer_cache: Rework to support internalized buffers | ReinUsesLisp | 2019-07-06 | 3 | -65/+174 |
| | | | | |||||
| * | | | gl_buffer_cache: Store in CachedBufferEntry the used buffer handle | ReinUsesLisp | 2019-07-06 | 2 | -23/+30 |
| | | | | |||||
| * | | | gl_buffer_cache: Return used buffer from Upload function | ReinUsesLisp | 2019-07-06 | 4 | -36/+35 |
| | | | | |||||
| * | | | gl_rasterizer: Add some commentaries | ReinUsesLisp | 2019-07-06 | 1 | -0/+5 |
| | | | | |||||
| * | | | gl_rasterizer: Make DrawParameters rasterizer instance const | ReinUsesLisp | 2019-07-06 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_rasterizer: Move index buffer uploading to its own method | ReinUsesLisp | 2019-07-06 | 2 | -7/+18 |
| | |/ | |/| | |||||
* | | | Merge pull request #2690 from SciresM/physmem_fixes | Fernando Sahmkow | 2019-07-14 | 3 | -4/+30 |
|\ \ \ | | | | | | | | | Implement MapPhysicalMemory/UnmapPhysicalMemory | ||||
| * | | | prefer system reference over global accessor | Michael Scire | 2019-07-09 | 3 | -9/+13 |
| | | | | |||||
| * | | | Prevent merging of device mapped memory blocks. | Michael Scire | 2019-07-09 | 1 | -1/+23 |
| |/ / | | | | | | | | | | | | | | | | | | | This sets the DeviceMapped attribute for GPU-mapped memory blocks, and prevents merging device mapped blocks. This prevents memory mapped from the gpu from having its backing address changed by block coalesce. | ||||
* | | | Merge pull request #2692 from ReinUsesLisp/tlds-f16 | Fernando Sahmkow | 2019-07-14 | 2 | -2/+9 |
|\ \ \ | | | | | | | | | shader/texture: Add F16 support for TLDS | ||||
| * | | | shader/texture: Add F16 support for TLDS | ReinUsesLisp | 2019-07-07 | 2 | -2/+9 |
| |/ / | |||||
* | | | Merge pull request #2609 from FernandoS27/new-scan | bunnei | 2019-07-11 | 14 | -122/+772 |
|\ \ \ | | | | | | | | | Implement a New Shader Scanner, Decompile Flow Stack and implement BRX BRA.CC | ||||
| * | | | shader_ir: Add comments on missing instruction. | Fernando Sahmkow | 2019-07-09 | 2 | -2/+9 |
| | | | | | | | | | | | | | | | | Also shows Nvidia's address space on comments. | ||||
| * | | | shader_ir: limit explorastion to best known program size. | Fernando Sahmkow | 2019-07-09 | 1 | -1/+1 |
| | | | | |||||
| * | | | control_flow: Correct block breaking algorithm. | Fernando Sahmkow | 2019-07-09 | 1 | -17/+17 |
| | | | | |||||
| * | | | control_flow: Assert shaders bigger than limit. | Fernando Sahmkow | 2019-07-09 | 1 | -0/+2 |
| | | | | |||||
| * | | | control_flow: Address feedback. | Fernando Sahmkow | 2019-07-09 | 1 | -89/+37 |
| | | | | |||||
| * | | | shader_ir: Correct parsing of scheduling instructions and correct sizing | Fernando Sahmkow | 2019-07-09 | 2 | -13/+30 |
| | | | | |||||
| * | | | shader_ir: Correct max sizing | Fernando Sahmkow | 2019-07-09 | 2 | -2/+2 |
| | | | | |||||
| * | | | shader_ir: Remove unnecessary constructors and use optional for ScanFlow result | Fernando Sahmkow | 2019-07-09 | 3 | -28/+17 |
| | | | | |||||
| * | | | shader_ir: Corrections, documenting and asserting control_flow | Fernando Sahmkow | 2019-07-09 | 3 | -52/+54 |
| | | | | |||||
| * | | | shader_ir: Unify blocks in decompiled shaders. | Fernando Sahmkow | 2019-07-09 | 7 | -58/+85 |
| | | | | |||||
| * | | | shader_ir: Decompile Flow Stack | Fernando Sahmkow | 2019-07-09 | 4 | -11/+206 |
| | | | | |||||
| * | | | shader_ir: propagate shader size to the IR | Fernando Sahmkow | 2019-07-09 | 6 | -17/+28 |
| | | | | |||||
| * | | | shader_ir: Implement BRX & BRA.CC | Fernando Sahmkow | 2019-07-09 | 6 | -4/+76 |
| | | | | |||||
| * | | | shader_ir: Remove the old scanner. | Fernando Sahmkow | 2019-07-09 | 2 | -77/+0 |
| | | | | |||||
| * | | | shader_ir: Implement a new shader scanner | Fernando Sahmkow | 2019-07-09 | 4 | -16/+473 |
| | |/ | |/| | |||||
* | | | Merge pull request #2697 from lioncash/doc | bunnei | 2019-07-10 | 1 | -7/+9 |
|\ \ \ | | | | | | | | | gl_rasterizer: Amend documentation comment for ConfigureFramebuffers() | ||||
| * | | | gl_rasterizer: Amend documentation comment for ConfigureFramebuffers() | Lioncash | 2019-07-09 | 1 | -7/+9 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | must_reconfigure isn't a parameter for this function any more, so it can be replaced with current_state. While we're at it, we can make the parameters of the declaration match the same name as the ones in the definition. | ||||
* | | | Merge pull request #2686 from ReinUsesLisp/vk-scheduler | bunnei | 2019-07-10 | 6 | -50/+60 |
|\ \ \ | | | | | | | | | vk_scheduler: Drop execution context in favor of views | ||||
| * | | | vk_scheduler: Drop execution context in favor of views | ReinUsesLisp | 2019-07-07 | 6 | -50/+60 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of passing by copy an execution context through out the whole Vulkan call hierarchy, use a command buffer view and fence view approach. This internally dereferences the command buffer or fence forcing the user to be unable to use an outdated version of it on normal usage. It is still possible to keep store an outdated if it is casted to VKFence& or vk::CommandBuffer. While changing this file, add an extra parameter for Flush and Finish to allow releasing the fence from this calls. | ||||
* | | | Merge pull request #2691 from lioncash/override | bunnei | 2019-07-10 | 2 | -7/+4 |
|\ \ \ | |_|/ |/| | | video_core: Add missing override specifiers | ||||
| * | | vk_sampler_cache: Remove unused includes | Lioncash | 2019-07-07 | 1 | -3/+0 |
| | | | | | | | | | | | | These are no longer used within this header, so they can be removed. | ||||
| * | | video_core: Add missing override specifiers | Lioncash | 2019-07-07 | 2 | -4/+4 |
| |/ | |||||
* / | Delete decode_integer_set.cpp | Tobias | 2019-07-07 | 1 | -0/+0 |
|/ | |||||
* | Merge pull request #2601 from FernandoS27/texture_cache | Zach Hilman | 2019-07-05 | 55 | -3269/+4114 |
|\ | | | | | Implement a new Texture Cache | ||||
| * | texture_cache: Address Feedback | Fernando Sahmkow | 2019-07-05 | 4 | -12/+13 |
| | | |||||
| * | texture_cache: Correct Texture Buffer Uploading | Fernando Sahmkow | 2019-07-05 | 3 | -2/+18 |
| | | |||||
| * | texture_cache: Pack sibling queries inside a method | ReinUsesLisp | 2019-06-30 | 1 | -6/+8 |
| | | |||||
| * | texture_cache: Use std::vector reservation for sampled_textures | ReinUsesLisp | 2019-06-30 | 1 | -17/+10 |
| | | |||||
| * | texture_cache: Style changes | ReinUsesLisp | 2019-06-30 | 3 | -17/+13 |
| | | |||||
| * | texture_cache: Use std::array for siblings_table | ReinUsesLisp | 2019-06-29 | 1 | -10/+13 |
| | | |||||
| * | texture_cache: Address feedback | ReinUsesLisp | 2019-06-29 | 4 | -30/+13 |
| | | |||||
| * | texture_cache: Correct variable naming. | Fernando Sahmkow | 2019-06-26 | 1 | -3/+3 |
| | | |||||
| * | gl_texture_cache: Correct asserts | Fernando Sahmkow | 2019-06-26 | 2 | -2/+2 |
| | | |||||
| * | texture_cache: Corrections, documentation and asserts | Fernando Sahmkow | 2019-06-26 | 1 | -42/+42 |
| | | |||||
| * | surface_params: Corrections, asserts and documentation. | Fernando Sahmkow | 2019-06-26 | 2 | -43/+58 |
| | | |||||
| * | copy_params: use constexpr for constructor | Fernando Sahmkow | 2019-06-25 | 1 | -3/+4 |
| | | |||||
| * | gl_texture_cache: Corrections and fixes | Fernando Sahmkow | 2019-06-25 | 2 | -13/+9 |
| | | |||||
| * | gl_resource_manager: Correct MakeStreamCopy | Fernando Sahmkow | 2019-06-25 | 2 | -3/+2 |
| | | |||||
| * | texture_cache: Query MemoryManager from the system | Fernando Sahmkow | 2019-06-25 | 5 | -20/+7 |
| | | |||||
| * | texture_cache: Include "core/core.h" | ReinUsesLisp | 2019-06-24 | 1 | -4/+1 |
| | | |||||
| * | gl_texture_cache: Explicitly add indirect include | ReinUsesLisp | 2019-06-24 | 1 | -0/+1 |
| | | |||||
| * | texture_cache/surface_view: Address feedback | ReinUsesLisp | 2019-06-24 | 1 | -1/+0 |
| | | |||||
| * | texture_cache/surface_base: Address feedback | ReinUsesLisp | 2019-06-24 | 2 | -2/+10 |
| | | |||||
| * | video_core/surface: Address feedback | ReinUsesLisp | 2019-06-24 | 1 | -2/+2 |
| | | |||||
| * | decode/texture: Address feedback | ReinUsesLisp | 2019-06-24 | 1 | -0/+1 |
| | | |||||
| * | renderer_opengl/utils: Remove unused includes and unused forward declaration | ReinUsesLisp | 2019-06-24 | 1 | -4/+0 |
| | | |||||
| * | gl_texture_cache: Address some feedback | ReinUsesLisp | 2019-06-24 | 1 | -2/+4 |
| | | |||||
| * | gl_shader_disk_cache: Address feedback | ReinUsesLisp | 2019-06-24 | 2 | -4/+8 |
| | | |||||
| * | gl_shader_decompiler: Address feedback | ReinUsesLisp | 2019-06-24 | 1 | -11/+12 |
| | | |||||
| * | shader_bytecode: Include missing <array> | ReinUsesLisp | 2019-06-24 | 1 | -0/+1 |
| | | |||||
| * | texture_cache: Style and Corrections | Fernando Sahmkow | 2019-06-21 | 7 | -71/+75 |
| | | |||||
| * | shader_cache: Correct versioning and size calculation. | Fernando Sahmkow | 2019-06-21 | 2 | -2/+7 |
| | | |||||
| * | texture_cache: Eliminate linear textures fallthrough | Fernando Sahmkow | 2019-06-21 | 1 | -4/+0 |
| | | |||||
| * | texture_cache: Correct format R16U as sibling | Fernando Sahmkow | 2019-06-21 | 2 | -1/+2 |
| | | |||||
| * | texture_cache: Implement texception detection and texture barriers. | Fernando Sahmkow | 2019-06-21 | 2 | -7/+40 |
| | | |||||
| * | texture_cache: Corrections to buffers and shadow formats use. | Fernando Sahmkow | 2019-06-21 | 1 | -10/+34 |
| | | |||||
| * | texture_cache: Implement Irregular Views in surfaces | Fernando Sahmkow | 2019-06-21 | 2 | -4/+24 |
| | | |||||
| * | surface: Correct format S8Z24 | Fernando Sahmkow | 2019-06-21 | 4 | -9/+5 |
| | | |||||
| * | texture_cache: Initialize all siblings to invalid pixel format. | Fernando Sahmkow | 2019-06-21 | 1 | -6/+15 |
| | | |||||
| * | gl_texture_cache: Use Stream Buffers instead of Persistant for Buffer Copies. | Fernando Sahmkow | 2019-06-21 | 3 | -5/+4 |
| | | |||||
| * | gl_texture_cache: Correct Image Blit | Fernando Sahmkow | 2019-06-21 | 1 | -1/+1 |
| | | |||||
| * | decoders: correct block calculation | Fernando Sahmkow | 2019-06-21 | 7 | -29/+41 |
| | | |||||
| * | texture_cache: Use siblings textures on Rebuild and fix possible error on blitting | Fernando Sahmkow | 2019-06-21 | 2 | -11/+24 |
| | | |||||
| * | texture_cache: Remove old rasterizer cache | Fernando Sahmkow | 2019-06-21 | 2 | -1956/+0 |
| | | |||||
| * | texture_cache: Implement siblings texture formats. | Fernando Sahmkow | 2019-06-21 | 2 | -12/+31 |
| | | |||||
| * | fermi2d: Correct Origin Mode | Fernando Sahmkow | 2019-06-21 | 1 | -5/+10 |
| | | |||||
| * | texture_cache: correct texture buffer on surface params | Fernando Sahmkow | 2019-06-21 | 1 | -4/+11 |
| | | |||||
| * | texture_cache: eliminate accelerated depth->color/color->depth copies due to driver instability. | Fernando Sahmkow | 2019-06-21 | 4 | -22/+6 |
| | | |||||
| * | texture_cache: correct mutex locks | Fernando Sahmkow | 2019-06-21 | 1 | -4/+4 |
| | | |||||
| * | shader_ir: Fix image copy rebase issues | Fernando Sahmkow | 2019-06-21 | 1 | -2/+7 |
| | | |||||
| * | texture_cache: Don't Image Copy if component types differ | Fernando Sahmkow | 2019-06-21 | 1 | -1/+2 |
| | | |||||
| * | texture_cache: move some large methods to cpp files | Fernando Sahmkow | 2019-06-21 | 4 | -139/+135 |
| | | |||||
| * | texture_cache: Optimize GetSurface and use references on functions that don't change a surface. | Fernando Sahmkow | 2019-06-21 | 3 | -12/+12 |
| | | |||||
| * | texture_cache: Implement Buffer Copy and detect Turing GPUs Image Copies | Fernando Sahmkow | 2019-06-21 | 8 | -12/+148 |
| | | |||||
| * | texture_cache uncompress-compress is untopological. | Fernando Sahmkow | 2019-06-21 | 5 | -19/+53 |
| | | | | | | | | | | | | This makes conflicts between non compress and compress textures to be auto recycled. It also limits the amount of mipmaps a texture can have if it goes above it's limit. | ||||
| * | texture_cache: Correct copying between compressed and uncompressed formats | Fernando Sahmkow | 2019-06-21 | 3 | -10/+27 |
| | | |||||
| * | texture_cache: Only load on recycle with accurate GPU. | Fernando Sahmkow | 2019-06-21 | 1 | -2/+3 |
| | | | | | | | | | | Testing so far has proven this to be quite safe as texture memory read added a 2-5ms load to the current cache. | ||||
| * | Fix rebase errors | Fernando Sahmkow | 2019-06-21 | 3 | -3/+13 |
| | | |||||
| * | texture_cache: Handle uncontinuous surfaces. | Fernando Sahmkow | 2019-06-21 | 4 | -21/+82 |
| | | |||||
| * | texture_cache: return null surface on invalid address | Fernando Sahmkow | 2019-06-21 | 1 | -0/+12 |
| | | |||||
| * | texture_cache: Add checks for texture buffers. | Fernando Sahmkow | 2019-06-21 | 1 | -2/+16 |
| | | |||||
| * | texture_cache: Fermi2D reform and implement View Mirage | Fernando Sahmkow | 2019-06-21 | 11 | -77/+125 |
| | | | | | | | | | | This also does some fixes on compressed textures reinterpret and on the Fermi2D engine in general. | ||||
| * | gl_shader_decompiler: Implement image binding settings | ReinUsesLisp | 2019-06-21 | 5 | -24/+52 |
| | | |||||
| * | shader: Implement bindless images | ReinUsesLisp | 2019-06-21 | 3 | -2/+40 |
| | | |||||
| * | shader: Decode SUST and implement backing image functionality | ReinUsesLisp | 2019-06-21 | 8 | -3/+282 |
| | | |||||
| * | gl_rasterizer: Track texture buffer usage | ReinUsesLisp | 2019-06-21 | 6 | -74/+119 |
| | | |||||
| * | video_core: Make ARB_buffer_storage a required extension | ReinUsesLisp | 2019-06-21 | 3 | -8/+5 |
| | | |||||
| * | gl_rasterizer_cache: Use texture buffers to emulate texture buffers | ReinUsesLisp | 2019-06-21 | 5 | -11/+35 |
| | | |||||
| * | maxwell_3d: Partially implement texture buffers as 1D textures | ReinUsesLisp | 2019-06-21 | 4 | -10/+24 |
| | | |||||
| * | gl_shader_decompiler: Allow 1D textures to be texture buffers | ReinUsesLisp | 2019-06-21 | 1 | -4/+38 |
| | | |||||
| * | shader: Implement texture buffers | ReinUsesLisp | 2019-06-21 | 3 | -0/+62 |
| | | |||||
| * | texture_cache: loose TryReconstructSurface when accurate GPU is not on. | Fernando Sahmkow | 2019-06-21 | 3 | -4/+20 |
| | | | | | | | | Also corrects some asserts. | ||||
| * | texture_cache: Document the most important methods. | Fernando Sahmkow | 2019-06-21 | 1 | -8/+87 |
| | | |||||
| * | texture_cache: Try to Reconstruct Surface on bigger than overlap. | Fernando Sahmkow | 2019-06-21 | 1 | -4/+11 |
| | | | | | | | | | | This fixes clouds in SMO Cap Kingdom and lens on Cloud Kingdom. Also moved accurate_gpu setting check to Pick Strategy | ||||
| * | texture_cache: Implement Guard mechanism | Fernando Sahmkow | 2019-06-21 | 2 | -1/+12 |
| | | |||||
| * | texture_cache: General Fixes | Fernando Sahmkow | 2019-06-21 | 8 | -47/+170 |
| | | | | | | | | | | | | | | Fixed ASTC mipmaps loading Fixed alignment on openGL upload/download Fixed Block Height Calculation Removed unalign_height | ||||
| * | surface_params: Ensure pitch is always written to avoid surface leaks | ReinUsesLisp | 2019-06-21 | 1 | -0/+2 |
| | | |||||
| * | gl_framebuffer_cache: Use a hashed struct to cache framebuffers | ReinUsesLisp | 2019-06-21 | 6 | -62/+148 |
| | | |||||
| * | texture_cache return invalid buffer on deactivated color_mask | Fernando Sahmkow | 2019-06-21 | 2 | -2/+9 |
| | | |||||
| * | engine_upload: Addapt to new Texture Cache | Fernando Sahmkow | 2019-06-21 | 2 | -5/+5 |
| | | |||||
| * | surface_params: Optimize CreateForTexture | ReinUsesLisp | 2019-06-21 | 2 | -72/+76 |
| | | | | | | | | | | Instead of using Common::AlignUp, use Common::AlignBits to align the texture compression factor. | ||||
| * | gl_texture_cache: Make main views be proxy textures instead of a full view. | Fernando Sahmkow | 2019-06-21 | 2 | -11/+25 |
| | | |||||
| * | texture_cache: Add ASync Protections | Fernando Sahmkow | 2019-06-21 | 1 | -0/+10 |
| | | |||||
| * | Remove Framebuffer reconfiguration and restrict rendertarget protection | Fernando Sahmkow | 2019-06-21 | 4 | -39/+27 |
| | | |||||
| * | texture_cache: Implement GPU Dirty Flags | Fernando Sahmkow | 2019-06-21 | 1 | -15/+22 |
| | | |||||
| * | texture_cache: Optimize GetMipBlockHeight and GetMipBlockDepth | Fernando Sahmkow | 2019-06-21 | 1 | -13/+6 |
| | | |||||
| * | texture_cache: Implement L1_Inner_cache | Fernando Sahmkow | 2019-06-21 | 1 | -13/+30 |
| | | |||||
| * | video_core: Use un-shifted block sizes to avoid integer divisions | ReinUsesLisp | 2019-06-21 | 9 | -60/+73 |
| | | | | | | | | | | | | | | | | | | | | | | | | Instead of storing all block width, height and depths in their shifted form: block_width = 1U << block_shift; Store them like they are provided by the emulated hardware (their block_shift form). This way we can avoid doing the costly Common::AlignUp operation to align texture sizes and drop CPU integer divisions with bitwise logic (defined in Common::AlignBits). | ||||
| * | texture_cache: Change internal cache from lists to vectors | ReinUsesLisp | 2019-06-21 | 1 | -6/+7 |
| | | |||||
| * | Reduce amount of size calculations. | Fernando Sahmkow | 2019-06-21 | 7 | -88/+86 |
| | | |||||
| * | texture_cache: Correct premature texceptions | Fernando Sahmkow | 2019-06-21 | 4 | -14/+51 |
| | | | | | | | | | | | | | | | | | | | | | | | | Due to our current infrastructure, it is possible for a mipmap to be set on as a render target before a texception of that mipmap's superset be set afterwards. This is problematic as we rely on texture views to set up texceptions and protecting render targets targets for 3D texture rendering. One simple solution is to configure framebuffers after texture setup but this brings other problems. This solution, forces a reconfiguration of the framebuffers after such event happens. | ||||
| * | texture_cache: Implement guest flushing | Fernando Sahmkow | 2019-06-21 | 3 | -10/+29 |
| | | |||||
| * | Fixes to mipmap's process and reconstruct process | Fernando Sahmkow | 2019-06-21 | 2 | -3/+3 |
| | | |||||
| * | surface_base: Add parenthesis to EmplaceOverview's predicate | ReinUsesLisp | 2019-06-21 | 1 | -3/+2 |
| | | |||||
| * | Texture Cache: Implement Blitting and Fermi Copies | Fernando Sahmkow | 2019-06-21 | 7 | -100/+93 |
| | | |||||
| * | surface_view: Add constructor for ViewParams | ReinUsesLisp | 2019-06-21 | 3 | -39/+23 |
| | | |||||
| * | surface_base: Split BreakDown into layered and non-layered variants | ReinUsesLisp | 2019-06-21 | 1 | -45/+48 |
| | | |||||
| * | surface_base: Silence truncation warnings and minor renames and reordering | ReinUsesLisp | 2019-06-21 | 2 | -32/+37 |
| | | |||||
| * | copy_params: Use constructor instead of C-like initialization | ReinUsesLisp | 2019-06-21 | 3 | -47/+39 |
| | | |||||
| * | Correct Mipmaps View method in Texture Cache | Fernando Sahmkow | 2019-06-21 | 3 | -32/+29 |
| | | |||||
| * | Change texture_cache chaching from GPUAddr to CacheAddr | Fernando Sahmkow | 2019-06-21 | 7 | -101/+60 |
| | | | | | | | | | | This also reverses the changes to make invalidation and flushing through the GPU address. | ||||
| * | Corrections to Structural Matching | Fernando Sahmkow | 2019-06-21 | 2 | -24/+53 |
| | | | | | | | | | | The texture will now be reconstructed if the width only matches on GoB alignment. | ||||
| * | Implement Texture Cache V2 | Fernando Sahmkow | 2019-06-21 | 6 | -381/+568 |
| | | |||||
| * | Correct Surface Base and Views for new Texture Cache | Fernando Sahmkow | 2019-06-21 | 7 | -380/+466 |
| | | |||||
| * | Add OGLTextureView | Fernando Sahmkow | 2019-06-21 | 2 | -0/+43 |
| | | |||||
| * | Deglobalize Memory Manager on texture cahe and Implement Invalidation and Flushing using GPUVAddr | Fernando Sahmkow | 2019-06-21 | 4 | -1/+20 |
| | | |||||
| * | texture_cache: Remove execution context copies from the texture cache | ReinUsesLisp | 2019-06-21 | 7 | -168/+59 |
| | | | | | | | | | | This is done to simplify the OpenGL implementation, it is needed for Vulkan. | ||||
| * | gl_texture_cache: Implement fermi copies | ReinUsesLisp | 2019-06-21 | 5 | -2/+105 |
| | | |||||
| * | texture_cache: Split texture cache into different files | ReinUsesLisp | 2019-06-21 | 12 | -876/+965 |
| | | |||||
| * | texture_cache: Move staging buffer into a generic implementation | ReinUsesLisp | 2019-06-21 | 4 | -181/+211 |
| | | |||||
| * | texture_cache: Flush 3D textures in the order they are drawn | ReinUsesLisp | 2019-06-21 | 5 | -19/+44 |
| | | |||||
| * | gl_texture_cache: Minor changes | ReinUsesLisp | 2019-06-21 | 5 | -140/+185 |
| | | |||||
| * | gl_texture_cache: Add copy from multiple overlaps into a single surface | ReinUsesLisp | 2019-06-21 | 3 | -6/+84 |
| | | |||||
| * | gl_texture_cache: Attach surface textures instead of views | ReinUsesLisp | 2019-06-21 | 3 | -20/+32 |
| | | |||||
| * | gl_texture_cache: Add fast copy path | ReinUsesLisp | 2019-06-21 | 4 | -7/+60 |
| | | |||||
| * | gl_texture_cache: Initial implementation | ReinUsesLisp | 2019-06-21 | 9 | -47/+809 |
| | | |||||
* | | gl_shader_cache: Make CachedShader constructor private | Zach Hilman | 2019-07-04 | 2 | -5/+5 |
| | | | | | | | | Fixes missing review comments introduced. | ||||
* | | Merge pull request #2563 from ReinUsesLisp/shader-initializers | Zach Hilman | 2019-07-04 | 2 | -52/+53 |
|\ \ | | | | | | | gl_shader_cache: Use static constructors for CachedShader initialization | ||||
| * | | gl_shader_cache: Use static constructors for CachedShader initialization | ReinUsesLisp | 2019-06-08 | 2 | -52/+53 |
| | | | |||||
* | | | rasterizer_cache: Protect inherited caches from submission level | Fernando Sahmkow | 2019-07-01 | 3 | -1/+5 |
| | | | |||||
* | | | Merge pull request #2579 from ReinUsesLisp/fix-aoffi-test | bunnei | 2019-06-21 | 1 | -1/+2 |
|\ \ \ | |_|/ |/| | | gl_device: Fix TestVariableAoffi test | ||||
| * | | gl_device: Fix TestVariableAoffi test | ReinUsesLisp | 2019-06-12 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This test is intended to be invalid GLSL, but it was being invalid in two points instead of one. The intention is to use a non-immediate parameter in a textureOffset like function. The problem is that this shader was being compiled as a separable shader object and the text was writting to gl_Position without a redeclaration, being invalid GLSL. Address that issue by using a user-defined output attribute. | ||||
* | | | Merge pull request #2591 from lioncash/record | bunnei | 2019-06-20 | 1 | -1/+0 |
|\ \ \ | | | | | | | | | core: Remove unused CiTrace source files | ||||
| * | | | core: Remove unused CiTrace source files | Lioncash | 2019-06-18 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These source files have been unused for the entire lifecycle of the project. They're a hold-over from Citra and only add to the build time of the project, so they can be removed. There's also likely no way this would ever work in yuzu in its current form without revamping quite a bit of it, given how different the GPU on the Switch is compared to the 3DS. | ||||
* | | | | Merge pull request #2562 from ReinUsesLisp/split-cbuf-upload | bunnei | 2019-06-18 | 6 | -56/+69 |
|\ \ \ \ | | | | | | | | | | | video_core/engines: Move ConstBufferInfo out of Maxwell3D | ||||
| * | | | | gl_rasterizer: Remove unused parameters in descriptor uploads | ReinUsesLisp | 2019-06-08 | 2 | -8/+6 |
| | | | | | |||||
| * | | | | video_core/engines: Move ConstBufferInfo out of Maxwell3D | ReinUsesLisp | 2019-06-08 | 6 | -49/+64 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #2538 from ReinUsesLisp/ssy-pbk | Zach Hilman | 2019-06-16 | 4 | -27/+78 |
|\ \ \ \ | |_|_|/ |/| | | | shader: Split SSY and PBK stack | ||||
| * | | | shader: Split SSY and PBK stack | ReinUsesLisp | 2019-06-07 | 4 | -27/+78 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT; | ||||
* | | | | Merge pull request #2572 from FernandoS27/gpu-mem | bunnei | 2019-06-12 | 1 | -2/+2 |
|\ \ \ \ | |_|_|/ |/| | | | GPUVM: Correct GPU VM virtual address space | ||||
| * | | | GPUVM: Correct GPU VM virtual address space | Fernando Sahmkow | 2019-06-09 | 1 | -2/+2 |
| | |/ | |/| | |||||
* | | | kepler_compute: Use std::array for cbuf info | ReinUsesLisp | 2019-06-08 | 1 | -2/+3 |
| | | | |||||
* | | | kepler_compute: Fix block_dim_x encoding | ReinUsesLisp | 2019-06-08 | 1 | -1/+1 |
|/ / | |||||
* | | Merge pull request #2514 from ReinUsesLisp/opengl-compat | Zach Hilman | 2019-06-07 | 16 | -223/+42 |
|\ \ | |/ |/| | video_core: Drop OpenGL core in favor of OpenGL compatibility | ||||
| * | gl_buffer_cache: Remove unused ReserveMemory method | ReinUsesLisp | 2019-05-30 | 2 | -13/+0 |
| | | |||||
| * | maxwell_to_gl: Use GL_CLAMP to emulate Clamp wrap mode | ReinUsesLisp | 2019-05-30 | 3 | -7/+4 |
| | | |||||
| * | gl_rasterizer: Move alpha testing to the OpenGL pipeline | ReinUsesLisp | 2019-05-30 | 8 | -71/+33 |
| | | | | | | | | Removes the alpha testing code from each fragment shader invocation. | ||||
| * | gl_rasterizer: Use GL_QUADS to emulate quads rendering | ReinUsesLisp | 2019-05-30 | 6 | -132/+5 |
| | | |||||
* | | shader/node: Minor changes | ReinUsesLisp | 2019-06-07 | 1 | -50/+54 |
| | | | | | | | | | | | | | | Reflect std::shared_ptr nature of Node on initializers and remove constant members in nodes. Add some commentaries. | ||||
* | | shader: Move Node declarations out of the shader IR header | ReinUsesLisp | 2019-06-07 | 4 | -493/+518 |
| | | | | | | | | | | | | Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h | ||||
* | | shader: Use shared_ptr to store nodes and move initialization to file | ReinUsesLisp | 2019-06-06 | 35 | -248/+296 |
| | | | | | | | | | | | | | | | | | | Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class. | ||||
* | | Merge pull request #2520 from ReinUsesLisp/vulkan-refresh | bunnei | 2019-06-06 | 4 | -88/+218 |
|\ \ | | | | | | | vk_device,vk_shader_decompiler: Miscellaneous changes | ||||
| * | | vk_device: Let formats array type be deduced | ReinUsesLisp | 2019-05-26 | 1 | -33/+33 |
| | | | |||||
| * | | vk_shader_decompiler: Misc fixes | ReinUsesLisp | 2019-05-26 | 2 | -45/+67 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fix missing OpSelectionMerge instruction. This caused devices loses on most hardware, Intel didn't care. Fix [-1;1] -> [0;1] depth conversions. Conditionally use VK_EXT_scalar_block_layout. This allows us to use non-std140 layouts on UBOs. Update external Vulkan headers. | ||||
| * | | vk_device: Enable features when available and misc changes | ReinUsesLisp | 2019-05-26 | 2 | -43/+151 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Keeps track of native ASTC support, VK_EXT_scalar_block_layout availability and SSBO range. Check for independentBlend and vertexPipelineStorageAndAtomics as a required feature. Always enable it. Use vk::to_string format to log Vulkan enums. Style changes. | ||||
* | | | Merge pull request #2540 from ReinUsesLisp/remove-guest-position | bunnei | 2019-06-06 | 2 | -36/+21 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Remove guest "position" varying | ||||
| * | | | gl_shader_decompiler: Remove guest "position" varying | ReinUsesLisp | 2019-06-03 | 2 | -36/+21 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | "position" was being written but not read anywhere besides geometry shaders, where it had the same value as gl_Position. This commit replaces "position" with gl_Position, reducing the complexity of our code and the emitted GLSL code. | ||||
* | | | Merge pull request #2512 from ReinUsesLisp/comp-indexing | bunnei | 2019-06-06 | 3 | -3/+80 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Pessimize uniform buffer access on AMD's prorpietary driver | ||||
| * | | | gl_shader_decompiler: Use an if based cbuf indexing for broken drivers | ReinUsesLisp | 2019-05-24 | 1 | -3/+20 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The following code is broken on AMD's proprietary GLSL compiler: ```glsl uint idx = ...; vec4 values = ...; float some_value = values[idx & 3]; ``` It index the wrong components, to fix this the following pessimized code is emitted when that bug is present: ```glsl uint idx = ...; vec4 values = ...; float some_value; if ((idx & 3) == 0) some_value = values.x; if ((idx & 3) == 1) some_value = values.y; if ((idx & 3) == 2) some_value = values.z; if ((idx & 3) == 3) some_value = values.w; ``` | ||||
| * | | | gl_device: Add test to detect broken component indexing | ReinUsesLisp | 2019-05-24 | 2 | -0/+60 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Component indexing on AMD's proprietary driver is broken. This commit adds a test to detect when we are on a driver that can't successfully manage component indexing. It dispatches a dummy draw with just one vertex shader that writes to an indexed SSBO from the GPU with data sent through uniforms, it then reads that data from the CPU and compares the expected output. | ||||
* | | | | Merge pull request #2545 from lioncash/timing | Zach Hilman | 2019-06-05 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | core/core_timing_util: Use std::chrono types for specifying time units | ||||
| * | | | | core/core_timing_util: Use std::chrono types for specifying time units | Lioncash | 2019-06-05 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Makes the interface more type-safe and consistent in terms of return values. | ||||
* | | | | | Merge pull request #2534 from ReinUsesLisp/shader-cleanup | Zach Hilman | 2019-06-05 | 2 | -31/+36 |
|\ \ \ \ \ | |/ / / / |/| | | | | gl_shader_cache: Minor style changes | ||||
| * | | | | gl_shader_cache: Store a system class and drop global accessors | ReinUsesLisp | 2019-05-30 | 2 | -7/+9 |
| | | | | | |||||
| * | | | | gl_shader_cache: Add commentaries explaining the intention in shaders creation | ReinUsesLisp | 2019-05-30 | 1 | -0/+2 |
| | | | | | |||||
| * | | | | gl_shader_cache: Flip if condition in GetStageProgram to reduce indentation | ReinUsesLisp | 2019-05-30 | 1 | -25/+26 |
| | |/ / | |/| | | |||||
* / | | | shader_bytecode: Mark EXIT as flow instruction | Fernando Sahmkow | 2019-06-04 | 1 | -1/+1 |
|/ / / | |||||
* | | | Merge pull request #2446 from ReinUsesLisp/tid | bunnei | 2019-05-29 | 4 | -22/+76 |
|\ \ \ | | | | | | | | | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ||||
| * | | | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ReinUsesLisp | 2019-05-20 | 4 | -15/+69 |
| | | | | |||||
| * | | | gl_shader_decompiler: Make GetSwizzle constexpr | ReinUsesLisp | 2019-05-20 | 1 | -7/+7 |
| | | | | |||||
* | | | | gl_device: Add commentary to AOFFI unit test source code | ReinUsesLisp | 2019-05-27 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | The intention behind this commit is to hint someone inspecting an apitrace dump to ignore this ill-formed GLSL code. | ||||
* | | | | gl_shader_gen: Always declare extensions after the version declaration | ReinUsesLisp | 2019-05-27 | 2 | -7/+5 |
| |_|/ |/| | | | | | | | | | | | This addresses a bug on geometry shaders where code was being written before all #extension declarations were done. Ref to #2523 | ||||
* | | | renderer_opengl/utils: Use a std::string_view with LabelGLObject() | Lioncash | 2019-05-25 | 2 | -10/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Uses a std::string_view instead of a std::string, given the pointed to string isn't modified and is only used in a formatting operation. This is nice because a few usages directly supply a string literal to the function, allowing these usages to otherwise not heap allocate, unlike the std::string overloads. While we're at it, we can combine the address formatting into a single formatting call. | ||||
* | | | Merge pull request #2358 from ReinUsesLisp/parallel-shader | bunnei | 2019-05-25 | 8 | -53/+113 |
|\ \ \ | | | | | | | | | gl_shader_cache: Use shared contexts to build shaders in parallel at boot | ||||
| * | | | gl_shader_cache: Fix clang strict standard build issues | ReinUsesLisp | 2019-05-21 | 3 | -9/+13 |
| | | | | |||||
| * | | | gl_shader_cache: Use shared contexts to build shaders in parallel | ReinUsesLisp | 2019-05-21 | 6 | -47/+103 |
| | | | | |||||
* | | | | Merge pull request #2485 from ReinUsesLisp/generic-memory | bunnei | 2019-05-25 | 3 | -35/+73 |
|\ \ \ \ | |_|_|/ |/| | | | shader/memory: Implement generic memory stores and loads (ST and LD) | ||||
| * | | | shader/memory: Implement ST (generic memory) | ReinUsesLisp | 2019-05-21 | 2 | -21/+36 |
| | | | | |||||
| * | | | shader/memory: Implement LD (generic memory) | ReinUsesLisp | 2019-05-21 | 3 | -15/+38 |
| |/ / | |||||
* | | | shader/shader_ir: Make Comment() take a std::string by value | Lioncash | 2019-05-23 | 2 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This allows for forming comment nodes without making unnecessary copies of the std::string instance. e.g. previously: Comment(fmt::format("Base address is c[0x{:x}][0x{:x}]", cbuf->GetIndex(), cbuf_offset)); Would result in a copy of the string being created, as CommentNode() takes a std::string by value (a const ref passed to a value parameter results in a copy). Now, only one instance of the string is ever moved around. (fmt::format returns a std::string, and since it's returned from a function by value, this is a prvalue (which can be treated like an rvalue), so it's moved into Comment's string parameter), we then move it into the CommentNode constructor, which then moves the string into its member variable). | ||||
* | | | shader/decode/*: Add missing newline to files lacking them | Lioncash | 2019-05-23 | 18 | -18/+18 |
| | | | | | | | | | | | | Keeps the shader code file endings consistent. | ||||
* | | | shader/decode/*: Eliminate indirect inclusions | Lioncash | 2019-05-23 | 6 | -1/+5 |
| | | | | | | | | | | | | | | | | | | | | | Amends cases where we were using things that were indirectly being satisfied through other headers. This way, if those headers change and eliminate dependencies on other headers in the future, we don't have cascading compilation errors. | ||||
* | | | shader/decode/memory: Remove left in debug pragma | Lioncash | 2019-05-22 | 1 | -2/+0 |
| | | | |||||
* | | | renderer_opengl/gl_shader_decompiler: Remove redundant name specification in format string | Lioncash | 2019-05-21 | 1 | -1/+1 |
|/ / | | | | | | | This accidentally slipped through a rebase. | ||||
* | | Merge pull request #2494 from lioncash/shader-text | bunnei | 2019-05-21 | 1 | -181/+195 |
|\ \ | |/ |/| | gl_shader_decompiler: Add AddLine() overloads with single function that forwards to libfmt | ||||
| * | gl_shader_decompiler: Tidy up minor remaining cases of unnecessary std::string concatenation | Lioncash | 2019-05-20 | 1 | -21/+20 |
| | | |||||
| * | gl_shader_decompiler: Replace individual overloads with the fmt-based one | Lioncash | 2019-05-20 | 1 | -28/+16 |
| | | | | | | | | | | | | | | | | | | | | Gets rid of the need to special-case brace handling depending on the overload used, and makes it consistent across the board with how fmt handles them. Strings with compile-time deducible strings are directly forwarded to std::string's constructor, so we don't need to worry about the performance difference here, as it'll be identical. | ||||
| * | gl_shader_decompiler: Utilize fmt overload of AddLine() where applicable | Lioncash | 2019-05-20 | 1 | -136/+152 |
| | | |||||
| * | gl_shader_decompiler: Add AddLine() overload that forwards to fmt | Lioncash | 2019-05-19 | 1 | -0/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In a lot of places throughout the decompiler, string concatenation via operator+ is used quite heavily. This is usually fine, when not heavily used, but when used extensively, can be a problem. operator+ creates an entirely new heap allocated temporary string and given we perform expressions like: std::string thing = a + b + c + d; this ends up with a lot of unnecessary temporary strings being created and discarded, which kind of thrashes the heap more than we need to. Given we utilize fmt in some AddLine calls, we can make this a part of the ShaderWriter's API. We can make an overload that simply acts as a passthrough to fmt. This way, whenever things need to be appended to a string, the operation can be done via a single string formatting operation instead of discarding numerous temporary strings. This also has the benefit of making the strings themselves look nicer and makes it easier to spot errors in them. | ||||
* | | Revert #2466 | Fernando Sahmkow | 2019-05-19 | 1 | -1/+3 |
|/ | | | | | | This reverts a tested behavior on delay slots not exiting if the exit flag is set. Currently new tests are required in order to ensure this behavior. | ||||
* | Merge pull request #2441 from ReinUsesLisp/al2p | bunnei | 2019-05-19 | 10 | -157/+310 |
|\ | | | | | shader: Implement AL2P and ALD.PHYS | ||||
| * | shader_ir/other: Implement IPA.IDX | ReinUsesLisp | 2019-05-03 | 2 | -5/+9 |
| | | |||||
| * | gl_shader_decompiler: Skip physical unused attributes | ReinUsesLisp | 2019-05-03 | 1 | -18/+27 |
| | | |||||
| * | shader_ir/memory: Assert on non-32 bits ALD.PHYS | ReinUsesLisp | 2019-05-03 | 1 | -0/+3 |
| | | |||||
| * | shader: Add physical attributes commentaries | ReinUsesLisp | 2019-05-03 | 4 | -4/+8 |
| | | |||||
| * | gl_shader_decompiler: Implement GLSL physical attributes | ReinUsesLisp | 2019-05-03 | 2 | -66/+101 |
| | | |||||
| * | shader_ir/memory: Implement physical input attributes | ReinUsesLisp | 2019-05-03 | 4 | -6/+32 |
| | | |||||
| * | gl_shader_decompiler: Abstract generic attribute operations | ReinUsesLisp | 2019-05-03 | 1 | -29/+26 |
| | | |||||
| * | gl_shader_decompiler: Declare all possible varyings on physical attribute usage | ReinUsesLisp | 2019-05-03 | 4 | -27/+88 |
| | | |||||
| * | shader: Remove unused AbufNode Ipa mode | ReinUsesLisp | 2019-05-03 | 6 | -35/+14 |
| | | |||||
| * | shader_ir/memory: Emit AL2P IR | ReinUsesLisp | 2019-05-03 | 2 | -0/+22 |
| | | |||||
| * | shader_bytecode: Add AL2P decoding | ReinUsesLisp | 2019-05-03 | 1 | -2/+15 |
| | | |||||
* | | Merge pull request #2491 from FernandoS27/dma-fix | Hexagon12 | 2019-05-19 | 1 | -0/+7 |
|\ \ | | | | | | | Dma_pusher: ASSERT on empty command_list | ||||
| * | | Dma_pusher: ASSERT on empty command_list | Fernando Sahmkow | 2019-05-19 | 1 | -0/+7 |
| | | | | | | | | | | | | | | | This is a measure to avoid crashes on command list reading as an empty command_list is considered a NOP. | ||||
* | | | Merge pull request #2452 from FernandoS27/raster-cache-fix | Hexagon12 | 2019-05-19 | 1 | -1/+2 |
|\ \ \ | | | | | | | | | Correct possible error on Rasterizer Caches | ||||
| * | | | Correct possible error on Rasterizer Caches | Fernando Sahmkow | 2019-05-07 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | There was a weird bug that could happen if the object died directly and the cache address wasn't stored. | ||||
* | | | | Merge pull request #2497 from lioncash/shader-ir | Hexagon12 | 2019-05-19 | 3 | -32/+28 |
|\ \ \ \ | | | | | | | | | | | shader/shader_ir: Minor changes | ||||
| * | | | | shader/shader_ir: Remove unnecessary inline specifiers | Lioncash | 2019-05-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | constexpr internally links by default, so the inline specifier is unnecessary. | ||||
| * | | | | shader/shader_ir: Simplify constructors for OperationNode | Lioncash | 2019-05-19 | 1 | -15/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Many of these constructors don't even need to be templated. The only ones that need to be templated are the ones that actually make use of the parameter pack. Even then, since std::vector accepts an initializer list, we can supply the parameter pack directly to it instead of creating our own copy of the list, then copying it again into the std::vector. | ||||
| * | | | | shader/shader_ir: Remove unnecessary template parameter packs from Operation() overloads where applicable | Lioncash | 2019-05-19 | 1 | -2/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | These overloads don't actually make use of the parameter pack, so they can be turned into regular non-template function overloads. | ||||
| * | | | | shader/shader_ir: Mark tracking functions as const member functions | Lioncash | 2019-05-19 | 2 | -8/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | These don't actually modify instance state, so they can be marked as const member functions | ||||
| * | | | | shader/shader_ir: Place implementations of constructor and destructor in cpp file | Lioncash | 2019-05-19 | 2 | -5/+9 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | Given the class contains quite a lot of non-trivial types, place the constructor and destructor within the cpp file to avoid inlining construction and destruction code everywhere the class is used. | ||||
* | | | | Merge pull request #2495 from lioncash/cache | Hexagon12 | 2019-05-19 | 2 | -34/+48 |
|\ \ \ \ | | | | | | | | | | | gl_shader_disk_cache: Minor cleanup | ||||
| * | | | | gl_shader_disk_cache: in-class initialize virtual file offset of ShaderDiskCacheOpenGL | Lioncash | 2019-05-19 | 2 | -5/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given the offset is assigned a fixed value in the constructor, we can just assign it directly and get rid of the need to write the name of the variable again in the constructor initializer list. | ||||
| * | | | | gl_shader_disk_cache: Default ShaderDiskCacheOpenGL's destructor in the cpp file | Lioncash | 2019-05-19 | 2 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given the disk shader cache contains non-trivial types, we should default it in the cpp file in order to prevent inlining of the complex destruction logic. | ||||
| * | | | | gl_shader_disk_cache: Make hash specializations noexcept | Lioncash | 2019-05-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The standard library expects hash specializations that don't throw exceptions. Make this explicit in the type to allow selection of better code paths if possible in implementations. | ||||
| * | | | | gl_shader_disk_cache: Remove redundant code string construction in LoadDecompiledEntry() | Lioncash | 2019-05-19 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We don't need to load the code into a vector and then construct a string over the data. We can just create a string with the necessary size ahead of time, and read the data directly into it, getting rid of an unnecessary heap allocation. | ||||
| * | | | | gl_shader_disk_cache: Make variable non-const in decompiled entry case | Lioncash | 2019-05-19 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | std::move does nothing when applied to a const variable. Resources can't be moved if the object is immutable. With this change, we don't end up making several unnecessary heap allocations and copies. | ||||
| * | | | | gl_shader_disk_cache: Special-case boolean handling | Lioncash | 2019-05-19 | 2 | -24/+37 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Booleans don't have a guaranteed size, but we still want to have them integrate into the disk cache system without needing to actually use a different type. We can do this by supplying non-template overloads for the bool type. Non-template overloads always have precedence during function resolution, so this is safe to provide. This gets rid of the need to smatter ternary conditionals, as well as the need to use u8 types to store the value in. | ||||
* | | | | Merge pull request #2467 from lioncash/move | Hexagon12 | 2019-05-19 | 1 | -6/+0 |
|\ \ \ \ | | | | | | | | | | | video_core/gpu_thread: Remove redundant copy constructor for CommandDataContainer | ||||
| * | | | | video_core/gpu_thread: Remove redundant copy constructor for CommandDataContainer | Lioncash | 2019-05-14 | 1 | -6/+0 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | std::move within a copy constructor (on a data member that isn't mutable) will always result in a copy. Because of that, the behavior of this copy constructor is identical to the one that would be generated automatically by the compiler, so we can remove it. | ||||
* | | | | Merge pull request #2466 from yuzu-emu/mme-exit-delay-slot | Hexagon12 | 2019-05-19 | 1 | -3/+3 |
|\ \ \ \ | | | | | | | | | | | GPU/MMEInterpreter: Ignore the 'exit' flag when it's executed inside a delay slot. | ||||
| * | | | | GPU/MMEInterpreter: Ignore the 'exit' flag when it's executed inside a delay slot. | Sebastian Valle | 2019-05-12 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | It seems instructions marked with the 'exit' flag will not cause an exit when executed within a delay slot. This was hwtested by fincs. | ||||
* | | | | | Merge pull request #2468 from lioncash/deduction | Hexagon12 | 2019-05-19 | 2 | -2/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | yuzu: Remove explicit types from locks where applicable | ||||
| * | | | | | yuzu: Remove explicit types from locks where applicable | Lioncash | 2019-05-14 | 2 | -2/+2 |
| | |/ / / | |/| | | | | | | | | | | | | | | | | | | With C++17's deduction guides, the type doesn't need to be explicitly specified within locking primitives anymore. | ||||
* | | | | | Merge pull request #2472 from FernandoS27/tic | Hexagon12 | 2019-05-19 | 1 | -1/+1 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_3d: reduce severity of different component formats assert. | ||||
| * | | | | | maxwell_3d: reduce sevirity of different component formats assert. | Fernando Sahmkow | 2019-05-14 | 1 | -1/+1 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This was reduced due to happening on most games and at such constant rate that it affected performance heavily for the end user. In general, we are well aware of the assert and an implementation is already planned. | ||||
* | | | | | Merge pull request #2469 from lioncash/copyable | Hexagon12 | 2019-05-19 | 1 | -0/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs | ||||
| * | | | | | video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs | Lioncash | 2019-05-14 | 1 | -0/+2 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | std::memset is used to clear the entire register structure, which requires that the Regs struct be trivially copyable (otherwise undefined behavior is invoked). This prevents the case where a non-trivial type is potentially added to the struct. | ||||
* | | | | | Merge pull request #2470 from lioncash/ranged-for | Sebastian Valle | 2019-05-19 | 1 | -18/+18 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults() | ||||
| * | | | | | video_core/engines/maxwell3d: Get rid of three magic values in CallMethod() | Lioncash | 2019-05-14 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | We can use the named constant instead of using 32 directly. | ||||
| * | | | | | video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults() | Lioncash | 2019-05-14 | 1 | -15/+15 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | Lessens the amount of code that needs to be read, and gets rid of the need to introduce an indexing variable. Instead, we just operate on the objects directly. | ||||
* | | | | | Merge pull request #2480 from ReinUsesLisp/fix-quads | Hexagon12 | 2019-05-19 | 1 | -2/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_rasterizer: Pass the right number of array quad vertices count | ||||
| * | | | | | gl_rasterizer: Pass the right number of array quad vertices count | ReinUsesLisp | 2019-05-17 | 1 | -2/+2 |
| |/ / / / | |||||
* | | | | | Merge pull request #2483 from ReinUsesLisp/fix-point-size | Hexagon12 | 2019-05-19 | 1 | -1/+3 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_rasterizer: Limit OpenGL point size to a minimum of 1 | ||||
| * | | | | | gl_rasterizer: Limit OpenGL point size to a minimum of 1 | ReinUsesLisp | 2019-05-18 | 1 | -1/+3 |
| |/ / / / | |||||
* | | | | | Merge pull request #2471 from lioncash/engine-upload | Sebastian Valle | 2019-05-19 | 2 | -6/+8 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/engines/engine_upload: Minor tidying | ||||
| * | | | | | video_core/engines/engine_upload: Amend constructor initializer list order | Lioncash | 2019-05-14 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | Silences a -Wreorder warning. | ||||
| * | | | | | video_core/engines/engine_upload: Default destructor in the cpp file | Lioncash | 2019-05-14 | 2 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids inlining destruction logic where applicable, and also makes forward declarations not cause unexpected compilation errors depending on where the State class is used. | ||||
| * | | | | | video_core/engines/engine_upload: Remove unnecessary const on parameters in function declarations | Lioncash | 2019-05-14 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These only apply in the definition of the function. They can be omitted from the declaration. | ||||
| * | | | | | video_core/engines/engine_upload: Remove unnecessary includes | Lioncash | 2019-05-14 | 2 | -2/+2 |
| |/ / / / | |||||
* | | | | | Merge pull request #2484 from ReinUsesLisp/triangle-fan | Sebastian Valle | 2019-05-19 | 1 | -0/+2 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_to_gl: Add TriangleFan primitive topology | ||||
| * | | | | | maxwell_to_gl: Add TriangleFan primitive topology | ReinUsesLisp | 2019-05-18 | 1 | -0/+2 |
| |/ / / / | |||||
* / / / / | gl_shader_gen: std::move objects where applicable | Lioncash | 2019-05-19 | 1 | -7/+7 |
|/ / / / | | | | | | | | | | | | | | | | | | | | | Avoids performing copies into the pair being returned. Instead, we can just move the resources into the pair, avoiding the need to make copies of both the std::string and ShaderEntries struct. | ||||
* | | | | Merge pull request #2462 from lioncash/video-mm | Mat M | 2019-05-14 | 2 | -17/+20 |
|\ \ \ \ | | | | | | | | | | | video_core/memory_manager: Minor tidying | ||||
| * | | | | video_core/memory_manager: Mark IsBlockContinuous() as a const member function | Lioncash | 2019-05-10 | 2 | -4/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Corrects the typo in its name and marks the function as a const member function, given it doesn't actually modify memory manager state. | ||||
| * | | | | video_core/memory_manager: Mark the constructor as explicit | Lioncash | 2019-05-10 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | Prevents implicit converting constructions of the memory manager. | ||||
| * | | | | video_core/memory_manager: Default the destructor within the cpp file | Lioncash | 2019-05-10 | 2 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Makes the class less surprising when it comes to forward declaring the type, and also prevents inlining the destruction code of the class, given it contains non-trivial types. | ||||
| * | | | | video_core/memory_manager: Amend doxygen comments | Lioncash | 2019-05-10 | 1 | -7/+7 |
| | | | | | | | | | | | | | | | | | | | | Corrects references to non-existent parameters and corrects typos. | ||||
| * | | | | video_core/memory_manager: Remove superfluous const from function declarations | Lioncash | 2019-05-10 | 1 | -7/+7 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These are able to be omitted from the declaration of functions, since they don't do anything at the type system level. The definitions of the functions can retain the use of const though, since they make the variables immutable in the implementation of the function where they're used. | ||||
* | | | | Merge pull request #2461 from lioncash/unused-var | Mat M | 2019-05-14 | 7 | -22/+4 |
|\ \ \ \ | | | | | | | | | | | video_core: Remove a few unused variables and functions | ||||
| * | | | | video_core/renderer_opengl/gl_shader_cache: Correct member initialization order | Lioncash | 2019-05-10 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | Silences a -Wreorder warning. | ||||
| * | | | | video_core/shader/decode/texture: Remove unused variable from GetTld4Code() | Lioncash | 2019-05-10 | 1 | -1/+0 |
| | | | | | |||||
| * | | | | renderer_vulkan/vk_shader_decompiler: Remove unused variable from DeclareInternalFlags() | Lioncash | 2019-05-10 | 1 | -1/+0 |
| | | | | | |||||
| * | | | | video_core/renderer_opengl/gl_shader_decompiler: Remove unused Composite() function | Lioncash | 2019-05-10 | 1 | -11/+0 |
| | | | | | | | | | | | | | | | | | | | | This isn't used at all, so it can be removed. | ||||
| * | | | | video_core/renderer_opengl/gl_rasterizer_cache: Remove unused variable in UploadGLMipmapTexture() | Lioncash | 2019-05-10 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | | | | | This variable is unused entirely, so it can be removed. | ||||
| * | | | | video_core/gpu_thread: Remove unused local variable | Lioncash | 2019-05-10 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of retrieving the data from the std::variant instance, we can just check if the variant contains that type of data. This is essentially the same behavior, only it returns a bool indicating whether or not the type in the variant is currently active, instead of actually retrieving the data. | ||||
| * | | | | video_core/textures/astc: Remove unused variables | Lioncash | 2019-05-10 | 1 | -6/+2 |
| |/ / / | | | | | | | | | | | | | Silences a few compilation warnings. | ||||
* | | | | Merge pull request #2413 from FernandoS27/opt-gpu | Rodrigo Locatti | 2019-05-14 | 7 | -33/+54 |
|\ \ \ \ | |/ / / |/| | | | Rasterizer Cache: refactor flushing & optimize memory usage of surfaces | ||||
| * | | | Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. | Fernando Sahmkow | 2019-04-21 | 4 | -18/+30 |
| | | | | | | | | | | | | | | | | | | | | This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache. | ||||
| * | | | RasterizerCache Redesign: Flush | Fernando Sahmkow | 2019-04-20 | 6 | -17/+26 |
| | | | | | | | | | | | | | | | | | | | | | | | | flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility. | ||||
* | | | | Merge pull request #2442 from FernandoS27/astc-fix | bunnei | 2019-05-09 | 1 | -1/+3 |
|\ \ \ \ | | | | | | | | | | | Fix Layered ASTC Textures | ||||
| * | | | | Fix Layered ASTC Textures | Fernando Sahmkow | 2019-05-01 | 1 | -1/+3 |
| | |_|/ | |/| | | | | | | | | | | By adding the missing layer offset in ASTC compression. | ||||
* | | | | Merge pull request #2443 from ReinUsesLisp/skip-repeated-variants | bunnei | 2019-05-09 | 1 | -1/+4 |
|\ \ \ \ | | | | | | | | | | | gl_shader_disk_cache: Skip stored shader variants instead of asserting | ||||
| * | | | | gl_shader_disk_cache: Skip stored shader variants instead of asserting | ReinUsesLisp | 2019-05-01 | 1 | -1/+4 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | Instead of asserting on already stored shader variants, silently skip them. This shouldn't be happening but when a shader is invalidated and it is not stored in the shader cache, this assert would hit and save that shader anyways when the asserts are disabled. | ||||
* | | | | Merge pull request #2429 from FernandoS27/compute | bunnei | 2019-05-09 | 13 | -142/+483 |
|\ \ \ \ | |_|_|/ |/| | | | Corrections and Implementation on GPU Engines | ||||
| * | | | Refactors and name corrections. | Fernando Sahmkow | 2019-05-01 | 6 | -35/+35 |
| | | | | |||||
| * | | | Fixes and Corrections to DMA Engine | Fernando Sahmkow | 2019-04-23 | 2 | -37/+57 |
| | | | | |||||
| * | | | Add Swizzle Parameters to the DMA engine | Fernando Sahmkow | 2019-04-23 | 2 | -2/+27 |
| | | | | |||||
| * | | | Add Documentation Headers to all the GPU Engines | Fernando Sahmkow | 2019-04-23 | 5 | -0/+29 |
| | | | | |||||
| * | | | Corrections and styling | Fernando Sahmkow | 2019-04-23 | 5 | -6/+9 |
| | | | | |||||
| * | | | Implement Maxwell3D Data Upload | Fernando Sahmkow | 2019-04-23 | 2 | -3/+32 |
| | | | | |||||
| * | | | Introduce skeleton of the GPU Compute Engine. | Fernando Sahmkow | 2019-04-23 | 3 | -8/+202 |
| | | | | |||||
| * | | | Revamp Kepler Memory to use a subegine to manage uploads | Fernando Sahmkow | 2019-04-23 | 6 | -93/+134 |
| | | | | |||||
* | | | | shader/decode/texture: Remove unused variable | Lioncash | 2019-05-04 | 1 | -1/+0 |
| | | | | | | | | | | | | | | | | This isn't used anywhere, so we can get rid of it. | ||||
* | | | | gl_rasterizer: Silence unused variable warning | Lioncash | 2019-05-04 | 1 | -2/+2 |
| |/ / |/| | | | | | | | | Makes use of src, so it's not considered unused. | ||||
* | | | Merge pull request #2100 from FreddyFunk/disk-cache-precompiled-file | bunnei | 2019-05-01 | 3 | -133/+170 |
|\ \ \ | | | | | | | | | gl_shader_disk_cache: Improve precompiled shader cache generation speed and size | ||||
| * | | | Re added new lines at the end of files | FreddyFunk | 2019-04-23 | 2 | -2/+2 |
| | | | | |||||
| * | | | gl_shader_disk_cache: Compress precompiled shader cache file with Zstandard | unknown | 2019-04-23 | 1 | -6/+10 |
| | | | | |||||
| * | | | gl_shader_disk_cache: Use VectorVfsFile for the virtual precompiled shader cache file | unknown | 2019-04-23 | 3 | -101/+168 |
| | | | | |||||
| * | | | gl_shader_disk_cache: Remove per shader compression | unknown | 2019-04-23 | 2 | -45/+11 |
| | | | | |||||
* | | | | Merge pull request #2435 from ReinUsesLisp/misc-vc | bunnei | 2019-04-29 | 2 | -3/+4 |
|\ \ \ \ | | | | | | | | | | | shader_ir: Miscellaneous fixes | ||||
| * | | | | shader_ir: Move Sampler index entry in operand< to sort declarations | ReinUsesLisp | 2019-04-26 | 1 | -2/+2 |
| | | | | | |||||
| * | | | | shader_ir: Add missing entry to Sampler operand< comparison | ReinUsesLisp | 2019-04-26 | 1 | -2/+3 |
| | | | | | |||||
| * | | | | shader_ir/texture: Fix sampler const buffer key shift | ReinUsesLisp | 2019-04-26 | 1 | -1/+1 |
| |/ / / | |||||
* | | | | Merge pull request #2322 from ReinUsesLisp/wswitch | bunnei | 2019-04-29 | 10 | -77/+106 |
|\ \ \ \ | | | | | | | | | | | video_core: Silent -Wswitch warnings | ||||
| * | | | | video_core: Silent -Wswitch warnings | ReinUsesLisp | 2019-04-18 | 10 | -77/+106 |
| | | | | | |||||
* | | | | | Merge pull request #2423 from FernandoS27/half-correct | bunnei | 2019-04-29 | 2 | -15/+16 |
|\ \ \ \ \ | |_|/ / / |/| | | | | Corrections on Half Float operations: HADD2 HMUL2 and HFMA2 | ||||
| * | | | | Corrections Half Float operations on const buffers and implement saturation. | Fernando Sahmkow | 2019-04-21 | 2 | -15/+16 |
| | | | | | |||||
* | | | | | Merge pull request #2422 from ReinUsesLisp/fixup-samplers | Hexagon12 | 2019-04-23 | 1 | -3/+5 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_state: Fix samplers memory corruption | ||||
| * | | | | | gl_state: Fix samplers memory corruption | ReinUsesLisp | 2019-04-19 | 1 | -3/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | It was possible for "samplers" to be read without being written. This addresses that. | ||||
* | | | | | | Merge pull request #2425 from FernandoS27/y-direction | Hexagon12 | 2019-04-23 | 1 | -0/+3 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Fix flipping on some games by applying Y direction register | ||||
| * | | | | | | Apply Position Y Direction | Fernando Sahmkow | 2019-04-20 | 1 | -0/+3 |
| | |_|_|_|/ | |/| | | | | |||||
* | | | | | | Merge pull request #2383 from ReinUsesLisp/aoffi-test | bunnei | 2019-04-23 | 11 | -75/+163 |
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | | | gl_shader_decompiler: Disable variable AOFFI on unsupported devices | ||||
| * | | | | | gl_shader_decompiler: Use variable AOFFI on supported hardware | ReinUsesLisp | 2019-04-14 | 10 | -71/+102 |
| | | | | | | |||||
| * | | | | | gl_device: Implement interface and add uniform offset alignment | ReinUsesLisp | 2019-04-10 | 5 | -13/+70 |
| | | | | | | |||||
* | | | | | | Merge pull request #2403 from FernandoS27/compressed-linear | bunnei | 2019-04-22 | 1 | -2/+5 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Support compressed formats on linear textures. | ||||
| * | | | | | | Support compressed formats on linear textures. | Fernando Sahmkow | 2019-04-15 | 1 | -2/+5 |
| | |_|_|_|/ | |/| | | | | |||||
* | | | | | | Merge pull request #2411 from FernandoS27/unsafe-gpu | bunnei | 2019-04-22 | 5 | -15/+99 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | GPU Manager: Implement ReadBlockUnsafe and WriteBlockUnsafe | ||||
| * | | | | | | make ReadBlockunsafe and WriteBlockunsafe, ignore invalid pages. | Fernando Sahmkow | 2019-04-20 | 1 | -4/+12 |
| | | | | | | | |||||
| * | | | | | | Implement IsBlockContinous | Fernando Sahmkow | 2019-04-17 | 2 | -2/+13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This detects when a GPU Memory Block is not continous within host cpu memory. | ||||
| * | | | | | | Use ReadBlockUnsafe for fetyching DMA CommandLists | Fernando Sahmkow | 2019-04-16 | 2 | -4/+2 |
| | | | | | | | |||||
| * | | | | | | Document unsafe versions and add BlockCopyUnsafe | Fernando Sahmkow | 2019-04-16 | 3 | -16/+45 |
| | | | | | | | |||||
| * | | | | | | Use ReadBlockUnsafe for Shader Cache | Fernando Sahmkow | 2019-04-16 | 1 | -5/+7 |
| | | | | | | | |||||
| * | | | | | | Use ReadBlockUnsafe on TIC and TSC reading | Fernando Sahmkow | 2019-04-16 | 2 | -2/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there. | ||||
| * | | | | | | GPU MemoryManager: Implement ReadBlockUnsafe and WriteBlockUnsafe | Fernando Sahmkow | 2019-04-16 | 2 | -0/+34 |
| |/ / / / / | |||||
* | | | | | | Merge pull request #2400 from FernandoS27/corret-kepler-mem | bunnei | 2019-04-22 | 4 | -17/+81 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Implement Kepler Memory on both Linear and BlockLinear. | ||||
| * | | | | | | Apply Const correctness to SwizzleKepler and replace u32 for size_t on iterators. | Fernando Sahmkow | 2019-04-16 | 2 | -9/+12 |
| | | | | | | | |||||
| * | | | | | | Use WriteBlock and ReadBlock. | Fernando Sahmkow | 2019-04-16 | 1 | -10/+6 |
| | | | | | | | |||||
| * | | | | | | Implement Block Linear copies in Kepler Memory. | Fernando Sahmkow | 2019-04-16 | 3 | -5/+38 |
| | | | | | | | |||||
| * | | | | | | Correct Kepler Memory on Linear Pushes. | Fernando Sahmkow | 2019-04-15 | 2 | -16/+48 |
| |/ / / / / | |||||
* | | | | | | Merge pull request #2407 from FernandoS27/f2f | bunnei | 2019-04-20 | 2 | -23/+73 |
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | | | Do some corrections in conversion shader instructions. | ||||
| * | | | | | Do some corrections in conversion shader instructions. | Fernando Sahmkow | 2019-04-16 | 2 | -23/+73 |
| |/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs. | ||||
* | | | | | Merge pull request #2409 from ReinUsesLisp/half-floats | bunnei | 2019-04-20 | 9 | -136/+181 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | shader_ir/decode: Miscellaneous fixes to half-float decompilation | ||||
| * | | | | vk_shader_decompiler: Add missing operations | ReinUsesLisp | 2019-04-16 | 1 | -0/+7 |
| | | | | | |||||
| * | | | | shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic | ReinUsesLisp | 2019-04-16 | 9 | -85/+72 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall. | ||||
| * | | | | gl_shader_decompiler: Fix MrgH0 decompilation | ReinUsesLisp | 2019-04-16 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | GLSL decompilation for HMergeH0 was wrong. This addresses that issue. | ||||
| * | | | | shader_ir/decode: Implement half float saturation | ReinUsesLisp | 2019-04-16 | 5 | -8/+31 |
| | | | | | |||||
| * | | | | shader_ir/decode: Reduce severity of unimplemented half-float FTZ | ReinUsesLisp | 2019-04-16 | 3 | -3/+9 |
| | | | | | |||||
| * | | | | renderer_opengl: Implement half float NaN comparisons | ReinUsesLisp | 2019-04-16 | 3 | -36/+59 |
| | | | | | |||||
| * | | | | shader_ir: Avoid using static on heap-allocated objects | ReinUsesLisp | 2019-04-16 | 1 | -5/+4 |
| |/ / / | | | | | | | | | | | | | | | | | Using static here might be faster at runtime, but it adds a heap allocation called before main. | ||||
* | | | | Merge pull request #2374 from lioncash/pagetable | bunnei | 2019-04-20 | 9 | -14/+47 |
|\ \ \ \ | |_|_|/ |/| | | | core: Reorganize boot order | ||||
| * | | | video_core/gpu: Create threads separately from initialization | Lioncash | 2019-04-12 | 9 | -14/+47 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Like with CPU emulation, we generally don't want to fire off the threads immediately after the relevant classes are initialized, we want to do this after all necessary data is done loading first. This splits the thread creation into its own interface member function to allow controlling when these threads in particular get created. | ||||
* | | | | Merge pull request #2318 from ReinUsesLisp/sampler-cache | bunnei | 2019-04-18 | 9 | -181/+183 |
|\ \ \ \ | | | | | | | | | | | gl_sampler_cache: Port sampler cache to OpenGL | ||||
| * | | | | gl_sampler_cache: Port sampler cache to OpenGL | ReinUsesLisp | 2019-04-02 | 5 | -123/+82 |
| | | | | | |||||
| * | | | | video_core: Abstract vk_sampler_cache into a templated class | ReinUsesLisp | 2019-04-02 | 5 | -58/+101 |
| | | | | | |||||
* | | | | | Merge pull request #2348 from FernandoS27/guest-bindless | bunnei | 2019-04-18 | 8 | -44/+217 |
|\ \ \ \ \ | | | | | | | | | | | | | Implement Bindless Textures on Shader Decompiler and GL backend | ||||
| * | | | | | Adapt Bindless to work with AOFFI | Fernando Sahmkow | 2019-04-08 | 1 | -7/+18 |
| | | | | | | |||||
| * | | | | | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format. | Fernando Sahmkow | 2019-04-08 | 9 | -44/+25 |
| | | | | | | |||||
| * | | | | | Fix bad rebase | Fernando Sahmkow | 2019-04-08 | 1 | -2/+1 |
| | | | | | | |||||
| * | | | | | Fix TMML | Fernando Sahmkow | 2019-04-08 | 1 | -5/+7 |
| | | | | | | |||||
| * | | | | | Simplify ConstBufferAccessor | Fernando Sahmkow | 2019-04-08 | 5 | -53/+22 |
| | | | | | | |||||
| * | | | | | Refactor GetTextureCode and GetTexCode to use an optional instead of optional parameters | Fernando Sahmkow | 2019-04-08 | 2 | -34/+33 |
| | | | | | | |||||
| * | | | | | Implement TXQ_B | Fernando Sahmkow | 2019-04-08 | 2 | -2/+10 |
| | | | | | | |||||
| * | | | | | Implement TMML_B | Fernando Sahmkow | 2019-04-08 | 1 | -5/+10 |
| | | | | | | |||||
| * | | | | | Corrections to TEX_B | Fernando Sahmkow | 2019-04-08 | 2 | -4/+37 |
| | | | | | | |||||
| * | | | | | Fixes to Const Buffer Accessor and Formatting | Fernando Sahmkow | 2019-04-08 | 3 | -10/+10 |
| | | | | | | |||||
| * | | | | | Implement Bindless Handling on SetupTexture | Fernando Sahmkow | 2019-04-08 | 4 | -18/+34 |
| | | | | | | |||||
| * | | | | | Unify both sampler types. | Fernando Sahmkow | 2019-04-08 | 4 | -22/+48 |
| | | | | | | |||||
| * | | | | | Implement Bindless Samplers and TEX_B in the IR. | Fernando Sahmkow | 2019-04-08 | 4 | -16/+77 |
| | | | | | | |||||
| * | | | | | Implement Const Buffer Accessor | Fernando Sahmkow | 2019-04-08 | 5 | -2/+65 |
| | | | | | | |||||
* | | | | | | Merge pull request #2315 from ReinUsesLisp/severity-decompiler | bunnei | 2019-04-17 | 2 | -5/+20 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader_ir/decode: Reduce the severity of common assertions | ||||
| * | | | | | | shader_ir/memory: Reduce severity of LD_L cache management and log it | ReinUsesLisp | 2019-04-03 | 2 | -2/+9 |
| | | | | | | | |||||
| * | | | | | | shader_ir/memory: Reduce severity of ST_L cache management and log it | ReinUsesLisp | 2019-04-03 | 2 | -3/+11 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #2384 from ReinUsesLisp/gl-state-clear | bunnei | 2019-04-17 | 1 | -4/+4 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | gl_rasterizer: Apply just the needed state on Clear | ||||
| * | | | | | | | gl_rasterizer: Apply just the needed state on Clear | ReinUsesLisp | 2019-04-10 | 1 | -4/+4 |
| | |_|_|_|_|/ | |/| | | | | | |||||
* | | | | | | | Merge pull request #2092 from ReinUsesLisp/stg | bunnei | 2019-04-17 | 11 | -89/+186 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | shader/memory: Implement STG and global memory flushing | ||||
| * | | | | | | | shader_ir: Implement STG, keep track of global memory usage and flush | ReinUsesLisp | 2019-04-14 | 11 | -89/+186 |
| | |_|_|_|_|/ | |/| | | | | | |||||
* / | | | | | | Correct Pitch in Fermi2D | Fernando Sahmkow | 2019-04-15 | 1 | -4/+1 |
|/ / / / / / | |||||
* | | | | | | Merge pull request #2373 from FernandoS27/z32 | bunnei | 2019-04-14 | 2 | -2/+19 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Set Pixel Format to Z32 if its R32F and depth compare enabled, and Implement format ZF32_X24S8 | ||||
| * | | | | | | Implement Texture Format ZF32_X24S8. | Fernando Sahmkow | 2019-04-09 | 1 | -0/+2 |
| | | | | | | | |||||
| * | | | | | | Correct depth compare with color formats for R32F | Fernando Sahmkow | 2019-04-09 | 1 | -2/+17 |
| | |_|/ / / | |/| | | | | |||||
* | | | | | | Merge pull request #2386 from ReinUsesLisp/shader-manager | bunnei | 2019-04-14 | 2 | -34/+61 |
|\ \ \ \ \ \ | |_|_|_|_|/ |/| | | | | | gl_shader_manager: Move code to source file and minor clean up | ||||
| * | | | | | gl_shader_manager: Move code to source file and minor clean up | ReinUsesLisp | 2019-04-11 | 2 | -34/+61 |
| | |/ / / | |/| | | | |||||
* | | | | | Merge pull request #2235 from ReinUsesLisp/spirv-decompiler | bunnei | 2019-04-12 | 3 | -1/+1465 |
|\ \ \ \ \ | | | | | | | | | | | | | vk_shader_decompiler: Implement a SPIR-V decompiler | ||||
| * | | | | | vk_shader_decompiler: Implement flow primitives | ReinUsesLisp | 2019-04-10 | 1 | -5/+82 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement most common texture primitives | ReinUsesLisp | 2019-04-10 | 1 | -8/+65 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement texture decompilation helper functions | ReinUsesLisp | 2019-04-10 | 1 | -0/+32 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement Assign and LogicalAssign | ReinUsesLisp | 2019-04-10 | 1 | -2/+64 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement non-OperationCode visits | ReinUsesLisp | 2019-04-10 | 1 | -7/+129 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement OperationCode decompilation interface | ReinUsesLisp | 2019-04-10 | 1 | -1/+411 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement Visit | ReinUsesLisp | 2019-04-10 | 1 | -1/+50 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement labels tree and flow | ReinUsesLisp | 2019-04-10 | 1 | -0/+71 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Implement declarations | ReinUsesLisp | 2019-04-10 | 1 | -3/+457 |
| | | | | | | |||||
| * | | | | | vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler | ReinUsesLisp | 2019-04-10 | 3 | -0/+127 |
| | | | | | | |||||
| * | | | | | video_core: Add sirit as optional dependency with Vulkan | ReinUsesLisp | 2019-04-10 | 1 | -1/+4 |
| |/ / / / | | | | | | | | | | | | | | | | sirit is a runtime assembler for SPIR-V | ||||
* | | | | | gl_rasterizer_cache: Relax restrictions on FastCopySurface and FastLayeredCopySurface | Fernando Sahmkow | 2019-04-11 | 1 | -4/+10 |
| | | | | | |||||
* | | | | | Merge pull request #2278 from ReinUsesLisp/vc-texture-cache | bunnei | 2019-04-11 | 3 | -0/+974 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core: Implement API agnostic view based texture cache | ||||
| * | | | | | video_core: Implement API agnostic view based texture cache | ReinUsesLisp | 2019-03-22 | 3 | -0/+974 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implements an API agnostic texture view based texture cache. Classes defined here are intended to be inherited by the API implementation and used in API-specific code. This implementation exposes protected virtual functions to be called from the implementer. Before executing any surface copies methods (defined in API-specific code) it tries to detect if the overlapping surface is a superset and if it is, it creates a view. Views are references of a subset of a surface, it can be a superset view (the same as referencing the whole texture). Current code manages 1D, 1D array, 2D, 2D array, cube maps and cube map arrays with layer and mipmap level views. Texture 3D slices views are not implemented. If the view attempt fails, the fast path is invoked with the overlapping textures (defined in the implementer). If that one fails (returning nullptr) it will flush and reload the texture. | ||||
* | | | | | | Merge pull request #2372 from FernandoS27/fermi-fix | bunnei | 2019-04-11 | 1 | -0/+4 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | Correct Fermi Copy on Linear Textures. | ||||
| * | | | | | Correct Fermi Copy on Linear Textures. | Fernando Sahmkow | 2019-04-09 | 1 | -0/+4 |
| | |/ / / | |/| | | | |||||
* | | | | | Merge pull request #2345 from ReinUsesLisp/multibind | bunnei | 2019-04-10 | 4 | -30/+69 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_rasterizer: Use ARB_multi_bind to update buffers with a single call per drawcall | ||||
| * | | | | | renderer_opengl/utils: Skip empty binds | ReinUsesLisp | 2019-04-06 | 1 | -0/+3 |
| | | | | | | |||||
| * | | | | | gl_rasterizer: Use ARB_multi_bind to update SSBOs | ReinUsesLisp | 2019-04-06 | 2 | -9/+9 |
| | | | | | | |||||
| * | | | | | gl_rasterizer: Use ARB_multi_bind to update UBOs across stages | ReinUsesLisp | 2019-04-06 | 4 | -22/+58 |
| | | | | | | |||||
* | | | | | | Merge pull request #2375 from FernandoS27/fix-ldc | bunnei | 2019-04-10 | 1 | -2/+1 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Remove unnecessary bounding in LD_C | ||||
| * | | | | | | Remove bounding in LD_C | Fernando Sahmkow | 2019-04-10 | 1 | -2/+1 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | Merge pull request #2354 from lioncash/header | bunnei | 2019-04-10 | 8 | -3/+10 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | video_core/texures/texture: Remove unnecessary includes | ||||
| * | | | | | | video_core/textures/convert: Replace include with a forward declaration | Lioncash | 2019-04-06 | 2 | -1/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids dragging in a direct dependency in a header. | ||||
| * | | | | | | video_core/texures/texture: Remove unnecessary includes | Lioncash | 2019-04-06 | 6 | -2/+5 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches. | ||||
* | | | | | | | Merge pull request #2366 from FernandoS27/xmad-fix | bunnei | 2019-04-10 | 2 | -9/+33 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | Correct XMAD mode, psl and high_b on different encodings. | ||||
| * | | | | | | | Correct XMAD mode, psl and high_b on different encodings. | Fernando Sahmkow | 2019-04-08 | 2 | -9/+33 |
| | |/ / / / / | |/| | | | | | |||||
* | | | | | | | Merge pull request #2369 from FernandoS27/mip-align | bunnei | 2019-04-09 | 2 | -4/+12 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | gl_backend: Align Pixel Storage | ||||
| * | | | | | | | gl_backend: Align Pixel Storage | Fernando Sahmkow | 2019-04-08 | 2 | -4/+12 |
| |/ / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit makes sure GL reads on the correct pack size for the respective texture buffer. | ||||
* / / / / / / | Correct LOP_IMN encoding | Fernando Sahmkow | 2019-04-08 | 1 | -1/+1 |
|/ / / / / / | |||||
* | | | | | | Merge pull request #2300 from FernandoS27/null-shader | bunnei | 2019-04-07 | 1 | -0/+4 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | shader_cache: Permit a Null Shader in case of a bad host_ptr. | ||||
| * | | | | | | Permit a Null Shader in case of a bad host_ptr. | Fernando Sahmkow | 2019-04-07 | 1 | -0/+4 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #2355 from ReinUsesLisp/sync-point | bunnei | 2019-04-07 | 1 | -2/+2 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | maxwell_3d: Reduce severity of ProcessSyncPoint | ||||
| * | | | | | | | maxwell_3d: Reduce severity of ProcessSyncPoint | ReinUsesLisp | 2019-04-06 | 1 | -2/+2 |
| | |/ / / / / | |/| | | | | | |||||
* | | | | | | | Merge pull request #2306 from ReinUsesLisp/aoffi | bunnei | 2019-04-07 | 4 | -71/+205 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | shader_ir: Implement AOFFI for TEX and TLD4 | ||||
| * | | | | | | | gl_shader_decompiler: Hide local definitions inside an anonymous namespace | ReinUsesLisp | 2019-03-31 | 1 | -6/+8 |
| | | | | | | | | |||||
| * | | | | | | | shader_ir/decode: Silent implicit sign conversion warning | Mat M | 2019-03-31 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
| * | | | | | | | gl_shader_decompiler: Add AOFFI backing implementation | ReinUsesLisp | 2019-03-30 | 1 | -38/+85 |
| | | | | | | | | |||||
| * | | | | | | | shader_ir/decode: Implement AOFFI for TEX and TLD4 | ReinUsesLisp | 2019-03-30 | 2 | -27/+94 |
| | | | | | | | | |||||
| * | | | | | | | shader_ir: Implement immediate register tracking | ReinUsesLisp | 2019-03-30 | 2 | -1/+19 |
| | | | | | | | | |||||
* | | | | | | | | Merge pull request #2321 from ReinUsesLisp/gl-state-rework | bunnei | 2019-04-07 | 3 | -339/+324 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gl_state: Rework to enable individual applies | ||||
| * | | | | | | | | gl_state: Rework to enable individual applies | ReinUsesLisp | 2019-04-04 | 3 | -339/+324 |
| | |_|_|_|_|/ / | |/| | | | | | | |||||
* | | | | | | | | Merge pull request #2098 from FreddyFunk/disk-cache-zstd | bunnei | 2019-04-07 | 1 | -6/+6 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gl_shader_disk_cache: Use Zstandard for compression | ||||
| * | | | | | | | | common/zstd_compression: simplify decompression interface | unknown | 2019-03-29 | 1 | -3/+2 |
| | | | | | | | | | |||||
| * | | | | | | | | gl_shader_disk_cache: Fixup clang format | unknown | 2019-03-29 | 1 | -2/+3 |
| | | | | | | | | | |||||
| * | | | | | | | | gl_shader_disk_cache: Use Zstandard for compression | unknown | 2019-03-29 | 1 | -6/+6 |
| | | | | | | | | | |||||
* | | | | | | | | | Merge pull request #2352 from bunnei/mem-manager-fixes | bunnei | 2019-04-07 | 3 | -12/+84 |
|\ \ \ \ \ \ \ \ \ | |_|_|_|/ / / / / |/| | | | | | | | | memory_manager: Improved implementation of read/write/copy block. | ||||
| * | | | | | | | | memory_manager: Improved implementation of read/write/copy block. | bunnei | 2019-04-06 | 3 | -12/+84 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Fixes graphical issues with Chocobo's Mystery Dungeon EVERY BUDDY! - Fixes a crash with Mario Tennis Aces | ||||
* | | | | | | | | | Merge pull request #2317 from FernandoS27/sync | bunnei | 2019-04-06 | 2 | -1/+27 |
|\ \ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | Implement SyncPoint Register in the GPU. | ||||
| * | | | | | | | | | Implement SyncPoint Register in the GPU. | Fernando Sahmkow | 2019-04-06 | 2 | -1/+27 |
| | |_|_|/ / / / / | |/| | | | | | | | |||||
* | | | | | | | | | Merge pull request #2346 from lioncash/header | bunnei | 2019-04-06 | 10 | -22/+39 |
|\ \ \ \ \ \ \ \ \ | |_|/ / / / / / / |/| | | | | | | | | video_core/engines: Remove unnecessary inclusions where applicable | ||||
| * | | | | | | | | video_core/engines: Make memory manager members private | Lioncash | 2019-04-06 | 9 | -13/+14 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These aren't used externally by anything, so they can be made private data members. | ||||
| * | | | | | | | | video_core/engines: Remove unnecessary inclusions where applicable | Lioncash | 2019-04-06 | 10 | -9/+25 |
| | |_|_|_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h | ||||
* | | | | | | | | Merge pull request #2350 from lioncash/vmem | bunnei | 2019-04-06 | 2 | -22/+38 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | video_core/memory_manager: Mark a few member functions with the const qualifier | ||||
| * | | | | | | | | video_core/memory_manager: Make Read() a const qualified member function | Lioncash | 2019-04-06 | 2 | -6/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given this doesn't actually alter internal state, this can be made a const member function. | ||||
| * | | | | | | | | video_core/memory_manager: Make ReadBlock() a const qualifier member function | Lioncash | 2019-04-06 | 2 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Now, since we have a const qualified variant of GetPointer(), we can put it to use in ReadBlock() to retrieve the source pointer that is passed into memcpy. Now block reading may be done from a const context. | ||||
| * | | | | | | | | video_core/memory_manager: Add a const qualified variant of GetPointer() | Lioncash | 2019-04-06 | 2 | -2/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Allows retrieving read-only pointers from a const context externally. | ||||
| * | | | | | | | | video_core/memory_manager: Make FindFreeRegion() a const member function | Lioncash | 2019-04-06 | 2 | -10/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This doesn't modify internal state, so it can be made a const member function. | ||||
| * | | | | | | | | video_core/memory_manager: Make GpuToCpuAddress() a const member function | Lioncash | 2019-04-06 | 2 | -3/+3 |
| |/ / / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This doesn't modify any internal state, so it can be made a const member function to allow its use in const contexts. | ||||
* | | | | | | | | Merge pull request #2347 from lioncash/trunc | bunnei | 2019-04-06 | 1 | -1/+1 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | video_core/gpu_thread: Silence truncation warning in ThreadManager's constructor | ||||
| * | | | | | | | | video_core/gpu_thread: Silence truncation warning in ThreadManager's constructor | Lioncash | 2019-04-06 | 1 | -1/+1 |
| |/ / / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Since c5d41fd812d7eb1a04f36b76c08fe971cee0868c callback parameters were changed to use an s64 to represent late cycles instead of an int, so this was causing a truncation warning to occur here. Changing it to s64 is sufficient to silence the warning. | ||||
* | | | | | | | | Merge pull request #2327 from ReinUsesLisp/crash-safe-visit | bunnei | 2019-04-06 | 1 | -1/+6 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gl_shader_decompiler: Return early when an operation is invalid | ||||
| * | | | | | | | | gl_shader_decompiler: Return early when an operation is invalid | ReinUsesLisp | 2019-04-03 | 1 | -1/+6 |
| | |_|_|/ / / / | |/| | | | | | | |||||
* | | | | | | | | Merge pull request #2337 from lioncash/temporary | bunnei | 2019-04-06 | 1 | -12/+12 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gl_shader_decompiler: Rename GenerateTemporal() to GenerateTemporary() | ||||
| * | | | | | | | | gl_shader_decompiler: Rename GenerateTemporal() to GenerateTemporary() | Lioncash | 2019-04-05 | 1 | -12/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Temporal generally indicates a relation to time, but this is just creating a temporary, so this isn't really an accurate name for what the function is actually doing. | ||||
* | | | | | | | | | video_core/macro_interpreter: Remove assertion within FetchParameter() | Lioncash | 2019-04-06 | 1 | -2/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We can just use .at(), which essentially does the same thing, but with less code. | ||||
* | | | | | | | | | video_core/macro_interpreter: Simplify GetRegister() | Lioncash | 2019-04-06 | 1 | -11/+6 |
| |_|/ / / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given we already ensure nothing can set the zeroth register in SetRegister(), we don't need to check if the index is zero and special case it. We can just access the register normally, since it's already going to be zero. We can also replace the assertion with .at() to perform the equivalent behavior inline as part of the API. | ||||
* | | | | | | | | Merge pull request #2282 from bunnei/gpu-asynch-v2 | bunnei | 2019-04-05 | 3 | -51/+65 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gpu_thread: Improve synchronization by using CoreTiming. | ||||
| * | | | | | | | | gpu_thread: Improve synchronization by using CoreTiming. | bunnei | 2019-04-02 | 3 | -51/+65 |
| | |_|_|_|_|_|/ | |/| | | | | | | |||||
* | | | | | | | | Merge pull request #2336 from ReinUsesLisp/txq | bunnei | 2019-04-05 | 1 | -2/+3 |
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | gl_shader_decompiler: Fix TXQ types | ||||
| * | | | | | | | | gl_shader_decompiler: Fix TXQ types | ReinUsesLisp | 2019-04-05 | 1 | -2/+3 |
| | |_|/ / / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TXQ returns integer types. Shaders usually do: R0 = TXQ(); // => int R0 = static_cast<float>(R0); If we don't treat it as an integer, it will cast a binary float value as float - resulting in a corrupted number. | ||||
* | | | | | | | | video_core/renderer_opengl: Remove unnecessary includes | Lioncash | 2019-04-04 | 13 | -24/+4 |
| |_|/ / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Quite a few unused includes have built up over time, particularly on core/memory.h. Removing these includes means the source files including those files will no longer need to be rebuilt if they're changed, making compilation slightly faster in this scenario. | ||||
* | | | | | | | Merge pull request #2093 from FreddyFunk/disk-cache-better-compression | bunnei | 2019-04-04 | 2 | -39/+9 |
|\ \ \ \ \ \ \ | | |_|_|/ / / | |/| | | | | | Better LZ4 compression utilization for the disk based shader cache and the yuzu build system | ||||
| * | | | | | | gl_shader_disk_cache: Use LZ4HC with compression level 9 instead of compression level 12 for less compression time | unknown | 2019-03-29 | 1 | -3/+3 |
| | | | | | | | |||||
| * | | | | | | Addressed feedback | unknown | 2019-03-29 | 1 | -6/+6 |
| | | | | | | | |||||
| * | | | | | | gl_shader_disk_cache: Use better compression for transferable and precompiled shader disk chache files | unknown | 2019-03-29 | 1 | -2/+2 |
| | | | | | | | |||||
| * | | | | | | data_compression: Move LZ4 compression from video_core/gl_shader_disk_cache to common/data_compression | unknown | 2019-03-29 | 2 | -39/+9 |
| | |_|/ / / | |/| | | | | |||||
* | | | | | | Merge pull request #2299 from lioncash/maxwell | bunnei | 2019-04-04 | 4 | -17/+15 |
|\ \ \ \ \ \ | |_|/ / / / |/| | | | | | gl_shader_manager: Remove reliance on a global accessor within MaxwellUniformData::SetFromRegs() | ||||
| * | | | | | gl_shader_manager: Remove unnecessary gl_shader_manager inclusion | Lioncash | 2019-03-28 | 1 | -2/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This isn't used at all in the OpenGL shader cache, so we can remove it's include here, meaning one less file needs to be recompiled if any changes ever occur within that header. core/memory.h is also not used within this file at all, so we can remove it as well. | ||||
| * | | | | | gl_shader_manager: Move using statement into the cpp file | Lioncash | 2019-03-28 | 2 | -4/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Avoids introducing Maxwell3D into the namespace for everything that includes the header. | ||||
| * | | | | | gl_shader_manager: Remove reliance on global accessor within MaxwellUniformData::SetFromRegs() | Lioncash | 2019-03-28 | 3 | -9/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We can just pass in the Maxwell3D instance instead of going through the system class to get at it. This also lets us simplify the interface a little bit. Since we pass in the Maxwell3D context now, we only really need to pass the shader stage index value in. | ||||
| * | | | | | gl_shader_manager: Amend Doxygen string for MaxwellUniformData | Lioncash | 2019-03-27 | 1 | -3/+3 |
| | |_|/ / | |/| | | | | | | | | | | | | | | | | | | Previously only one line of the whole comment was in proper Doxygen formatting. | ||||
* | | | | | Merge pull request #2302 from ReinUsesLisp/vk-swapchain | bunnei | 2019-04-03 | 3 | -1/+305 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | vk_swapchain: Implement a swapchain manager | ||||
| * | | | | vk_swapchain: Implement a swapchain manager | ReinUsesLisp | 2019-03-29 | 3 | -1/+305 |
| | |/ / | |/| | | |||||
* | | | | general: Use deducation guides for std::lock_guard and std::unique_lock | Lioncash | 2019-04-01 | 4 | -17/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Since C++17, the introduction of deduction guides for locking facilities means that we no longer need to hardcode the mutex type into the locks themselves, making it easier to switch mutex types, should it ever be necessary in the future. | ||||
* | | | | Merge pull request #2297 from lioncash/reorder | bunnei | 2019-03-31 | 6 | -14/+14 |
|\ \ \ \ | | | | | | | | | | | video_core: Amend constructor initializer list order where applicable | ||||
| * | | | | video_core: Amend constructor initializer list order where applicable | Lioncash | 2019-03-27 | 6 | -14/+14 |
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | | Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings. | ||||
* | | | | Merge pull request #2298 from lioncash/variable | bunnei | 2019-03-31 | 5 | -14/+7 |
|\ \ \ \ | |_|/ / |/| | | | video_core/{gl_rasterizer, gpu_thread}: Remove unused class variables where applicable | ||||
| * | | | gpu_thread: Remove unused dma_pusher class member variable from ThreadManager | Lioncash | 2019-03-27 | 2 | -5/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | The pusher instance is only ever used in the constructor of the ThreadManager for creating the thread that the ThreadManager instance contains. Aside from that, the member is unused, so it can be removed. | ||||
| * | | | gl_rasterizer: Remove unused reference member variable from RasterizerOpenGL | Lioncash | 2019-03-27 | 3 | -9/+5 |
| |/ / | | | | | | | | | | | | | This member variable is no longer being used, so it can be removed, removing a dependency on EmuWindow from the rasterizer's interface" | ||||
* | | | Merge pull request #2296 from lioncash/override | bunnei | 2019-03-28 | 3 | -4/+4 |
|\ \ \ | | | | | | | | | video_core: Add missing override specifiers | ||||
| * | | | video_core: Add missing override specifiers | Lioncash | 2019-03-27 | 3 | -4/+4 |
| |/ / | | | | | | | | | | | | | | | | Ensures that the signatures will always match with the base class. Also silences a few compilation warnings. | ||||
* / / | video_core/gpu: Amend typo in GPU member variable name | Lioncash | 2019-03-27 | 2 | -7/+8 |
|/ / | | | | | | | smaphore -> semaphore | ||||
* | | Merge pull request #2256 from bunnei/gpu-vmm | bunnei | 2019-03-22 | 21 | -237/+533 |
|\ \ | | | | | | | gpu: Rewrite MemoryManager based on the VMManager implementation. | ||||
| * | | memory_manager: Cleanup FindFreeRegion. | bunnei | 2019-03-21 | 2 | -12/+6 |
| | | | |||||
| * | | memory_manager: Use Common::AlignUp in public interface as needed. | bunnei | 2019-03-21 | 1 | -11/+22 |
| | | | |||||
| * | | memory_manager: Bug fixes and further cleanup. | bunnei | 2019-03-21 | 2 | -73/+72 |
| | | | |||||
| * | | maxwell_dma: Check for valid source in destination before copy. | bunnei | 2019-03-21 | 1 | -0/+10 |
| | | | | | | | | | | | | - Avoid a crash in Octopath Traveler. | ||||
| * | | memory_manager: Add protections for invalid GPU addresses. | bunnei | 2019-03-21 | 2 | -22/+43 |
| | | | | | | | | | | | | - Avoid a crash in Xenoblade Chronicles 2. | ||||
| * | | gl_rasterizer_cache: Check that backing memory is valid before creating a surface. | bunnei | 2019-03-21 | 2 | -15/+12 |
| | | | | | | | | | | | | - Fixes a crash in Puyo Puyo Tetris. | ||||
| * | | gpu: Rewrite virtual memory manager using PageTable. | bunnei | 2019-03-21 | 10 | -201/+472 |
| | | | |||||
| * | | gpu: Move GPUVAddr definition to common_types. | bunnei | 2019-03-21 | 13 | -31/+24 |
| |/ | |||||
* / | Revert "Devirtualize Register/Unregister and use a wrapper instead." | bunnei | 2019-03-22 | 3 | -8/+12 |
|/ | | | | - Fixes graphical issues from transitions in Super Mario Odyssey. | ||||
* | gl_rasterizer: Skip zero addr/sized regions on flush/invalidate. | bunnei | 2019-03-17 | 1 | -0/+6 |
| | |||||
* | Merge pull request #2244 from bunnei/gpu-mem-refactor | bunnei | 2019-03-17 | 19 | -186/+194 |
|\ | | | | | video_core: Refactor to use MemoryManager interface for all memory access. | ||||
| * | video_core: Refactor to use MemoryManager interface for all memory access. | bunnei | 2019-03-16 | 19 | -186/+194 |
| | | | | | | | | | | | | | | | | | | | | | | # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp | ||||
* | | memory: Simplify rasterizer cache operations. | bunnei | 2019-03-16 | 1 | -2/+1 |
|/ | |||||
* | gpu: Use host address for caching instead of guest address. | bunnei | 2019-03-15 | 24 | -288/+384 |
| | |||||
* | Merge pull request #2233 from ReinUsesLisp/morton-cleanup | bunnei | 2019-03-15 | 4 | -187/+146 |
|\ | | | | | video_core/morton: Miscellaneous changes | ||||
| * | video_core/morton: Use enum to describe MortonCopyPixels128 mode | ReinUsesLisp | 2019-03-13 | 3 | -7/+10 |
| | | |||||
| * | video_core/morton: Remove unused parameter in MortonSwizzle | ReinUsesLisp | 2019-03-13 | 3 | -8/+7 |
| | | |||||
| * | video_core/morton: Remove clang-format off when it's not needed | ReinUsesLisp | 2019-03-13 | 1 | -133/+129 |
| | | |||||
| * | video_core/morton: Remove unused functions | ReinUsesLisp | 2019-03-13 | 1 | -39/+0 |
| | | |||||
* | | Merge pull request #2229 from ReinUsesLisp/vk-sampler-cache | bunnei | 2019-03-15 | 4 | -24/+168 |
|\ \ | | | | | | | vk_sampler_cache: Implement a sampler cache | ||||
| * | | vk_sampler_cache: Use operator== instead of memcmp | Mat M | 2019-03-13 | 1 | -1/+1 |
| | | | | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
| * | | vk_sampler_cache: Implement a sampler cache | ReinUsesLisp | 2019-03-13 | 4 | -1/+140 |
| | | | |||||
| * | | video_core/texture: Add a raw representation of TSCEntry | ReinUsesLisp | 2019-03-12 | 1 | -24/+29 |
| |/ | |||||
* | | Merge pull request #2216 from ReinUsesLisp/rasterizer-system | bunnei | 2019-03-14 | 2 | -29/+31 |
|\ \ | | | | | | | gl_rasterizer: Use system instance passed from argument | ||||
| * | | gl_rasterizer: Use system instance passed from argument | ReinUsesLisp | 2019-03-11 | 2 | -29/+31 |
| | | | |||||
* | | | Merge pull request #2227 from lioncash/override | bunnei | 2019-03-13 | 2 | -5/+5 |
|\ \ \ | | | | | | | | | renderer_opengl/gl_global_cache: Add missing override specifiers | ||||
| * | | | renderer_opengl/gl_global_cache: Replace indexing for assignment with insert_or_assign | Lioncash | 2019-03-11 | 2 | -3/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The previous code had some minor issues with it, really not a big deal, but amending it is basically 'free', so I figured, "why not?". With the standard container maps, when: map[key] = thing; is done, this can cause potentially undesirable behavior in certain scenarios. In particular, if there's no value associated with the key, then the map constructs a default initialized instance of the value type. In this case, since it's a std::shared_ptr (as a type alias) that is the value type, this will construct a std::shared_pointer, and then assign over it (with objects that are quite large, or actively heap allocate this can be extremely undesirable). We also make the function take the region by value, as we can avoid a copy (and by extension with std::shared_ptr, a copy causes an atomic reference count increment), in certain scenarios when ownership isn't a concern (i.e. when ReserveGlobalRegion is called with an rvalue reference, then no copy at all occurs). So, it's more-or-less a "free" gain without many downsides. | ||||
| * | | | renderer_opengl/gl_global_cache: Append missing override specifiers | Lioncash | 2019-03-11 | 1 | -2/+2 |
| |/ / | | | | | | | | | | | | | Two of the functions here are overridden functions, so we can append these specifiers to make it explicit. | ||||
* | / | video_core/texture: Fix up sampler lod bias | ReinUsesLisp | 2019-03-13 | 1 | -1/+1 |
| |/ |/| | |||||
* | | Merge pull request #2215 from ReinUsesLisp/samplers | bunnei | 2019-03-12 | 3 | -64/+72 |
|\ \ | |/ |/| | gl_rasterizer: Encapsulate sampler queries into methods | ||||
| * | gl_rasterizer: Encapsulate sampler queries into methods | ReinUsesLisp | 2019-03-09 | 3 | -64/+72 |
| | | |||||
* | | Merge pull request #2147 from ReinUsesLisp/texture-clean | bunnei | 2019-03-10 | 7 | -527/+588 |
|\ \ | | | | | | | shader_ir: Remove "extras" from the MetaTexture | ||||
| * | | shader/decode: Remove extras from MetaTexture | ReinUsesLisp | 2019-02-26 | 4 | -40/+65 |
| | | | |||||
| * | | shader/decode: Split memory and texture instructions decoding | ReinUsesLisp | 2019-02-26 | 6 | -501/+537 |
| | | | |||||
* | | | Merge pull request #2143 from ReinUsesLisp/texview | bunnei | 2019-03-10 | 3 | -32/+42 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Create texture views for array discrepancies | ||||
| * | | | gl_rasterizer_cache: Create texture views for array discrepancies | ReinUsesLisp | 2019-02-27 | 3 | -32/+42 |
| | | | | | | | | | | | | | | | | | | | | | | | | When a texture is sampled in a shader with a different array mode than the cached state, create a texture view and bind that to the shader instead. | ||||
* | | | | gl_rasterizer: Minor logger changes | ReinUsesLisp | 2019-03-09 | 1 | -19/+13 |
| |_|/ |/| | | |||||
* | | | Merge pull request #2209 from lioncash/reorder | bunnei | 2019-03-08 | 1 | -5/+2 |
|\ \ \ | | | | | | | | | video_core/gpu_thread: Silence a -Wreorder warning | ||||
| * | | | video_core/gpu_thread: Remove unimplemented WaitForIdle function prototype | Lioncash | 2019-03-07 | 1 | -3/+0 |
| | | | | | | | | | | | | | | | | | | | | This function didn't have a definition, so we can remove it to prevent accidentally attempting to use it. | ||||
| * | | | video_core/gpu_thread: Amend constructor initializer list order | Lioncash | 2019-03-07 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Moves the data members to satisfy the order they're declared as in the constructor initializer list. Silences a -Wreorder warning. | ||||
* | | | | Merge pull request #2208 from lioncash/gpu | bunnei | 2019-03-08 | 3 | -3/+3 |
|\ \ \ \ | | | | | | | | | | | video_core/gpu: Make GPU's destructor virtual | ||||
| * | | | | video_core/gpu: Make GPU's destructor virtual | Lioncash | 2019-03-07 | 3 | -3/+3 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Because of the recent separation of GPU functionality into sync/async variants, we need to mark the destructor virtual to provide proper destruction behavior, given we use the base class within the System class. Prior to this, it was undefined behavior whether or not the destructor in the derived classes would ever execute. | ||||
* | | | | Merge pull request #2191 from ReinUsesLisp/maxwell-to-vk | bunnei | 2019-03-08 | 4 | -3/+553 |
|\ \ \ \ | | | | | | | | | | | maxwell_to_vk: Initial implementation | ||||
| * | | | | maxwell_to_vk: Initial implementation | ReinUsesLisp | 2019-03-04 | 4 | -3/+553 |
| | | | | | |||||
* | | | | | dma_pusher: Store command_list_header by copy | ReinUsesLisp | 2019-03-08 | 1 | -1/+1 |
| |/ / / |/| | | | | | | | | | | | | | | | | | | | Instead of holding a reference that will get invalidated by dma_pushbuffer.pop(), hold it as a copy. This doesn't have any performance cost since CommandListHeader is 8 bytes long. | ||||
* | | | | Merge pull request #2055 from bunnei/gpu-thread | bunnei | 2019-03-07 | 12 | -26/+480 |
|\ \ \ \ | | | | | | | | | | | Asynchronous GPU command processing | ||||
| * | | | | gpu_thread: Fix deadlock with threading idle state check. | bunnei | 2019-03-07 | 2 | -7/+11 |
| | | | | | |||||
| * | | | | gpu_thread: (HACK) Ignore flush on FlushAndInvalidateRegion. | bunnei | 2019-03-07 | 1 | -3/+1 |
| | | | | | |||||
| * | | | | gpu: Always flush. | bunnei | 2019-03-07 | 2 | -13/+6 |
| | | | | | |||||
| * | | | | gpu: Refactor a/synchronous implementations into their own classes. | bunnei | 2019-03-07 | 7 | -63/+155 |
| | | | | | |||||
| * | | | | gpu: Move command processing to another thread. | bunnei | 2019-03-07 | 7 | -10/+353 |
| | | | | | |||||
| * | | | | gpu: Refactor command and swap buffers interface for asynch. | bunnei | 2019-03-07 | 2 | -3/+22 |
| | | | | | |||||
| * | | | | gpu: Refactor to take RendererBase instead of RasterizerInterface. | bunnei | 2019-03-07 | 2 | -17/+22 |
| | | | | | |||||
* | | | | | Merge pull request #2149 from ReinUsesLisp/decoders-style | bunnei | 2019-03-07 | 8 | -150/+183 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_rasterizer_cache: Move format conversion functions to their own file | ||||
| * | | | | | gl_rasterizer_cache: Move format conversion to its own file | ReinUsesLisp | 2019-02-27 | 7 | -136/+175 |
| | | | | | | |||||
| * | | | | | decoders: Minor style changes | ReinUsesLisp | 2019-02-27 | 2 | -14/+8 |
| | |_|_|/ | |/| | | | |||||
* | | | | | Merge pull request #2190 from lioncash/ogl-global | bunnei | 2019-03-07 | 2 | -11/+17 |
|\ \ \ \ \ | |_|/ / / |/| | | | | core: Remove the global telemetry accessor function | ||||
| * | | | | video_core/renderer_opengl: Replace direct usage of global system object accessors | Lioncash | 2019-03-04 | 2 | -11/+17 |
| | | | | | | | | | | | | | | | | | | | | | | | | | We already pass a reference to the system object to the constructor of the renderer, so we can just use that instead of using the global accessor functions. | ||||
* | | | | | Merge pull request #2203 from lioncash/engines-include | bunnei | 2019-03-06 | 10 | -11/+11 |
|\ \ \ \ \ | | | | | | | | | | | | | video_core/engines: Remove unnecessary includes | ||||
| * | | | | | video_core/engines: Remove unnecessary includes | Lioncash | 2019-03-06 | 10 | -11/+11 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those. | ||||
* | | | | | | video_core/surface: Remove obsolete TODO in PixelFormatFromRenderTargetFormat() | Lioncash | 2019-03-05 | 1 | -2/+0 |
|/ / / / / | | | | | | | | | | | | | | | | This isn't needed anymore, according to Hexagon | ||||
* | | | | | Merge pull request #2165 from ReinUsesLisp/unbind-tex | bunnei | 2019-03-04 | 2 | -14/+2 |
|\ \ \ \ \ | |_|_|/ / |/| | | | | gl_rasterizer: Unbind textures but don't apply the gl_state | ||||
| * | | | | gl_rasterizer: Remove texture unbinding after dispatching a draw call | ReinUsesLisp | 2019-02-28 | 1 | -12/+0 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Unbinding was required when OpenGL delete operations didn't unbind a resource if it was bound. This is no longer needed and can be removed. | ||||
| * | | | | gl_state: Fixup multibind bug | ReinUsesLisp | 2019-02-28 | 1 | -2/+2 |
| | | | | | |||||
* | | | | | Merge pull request #2183 from ReinUsesLisp/vk-buffer-cache-clang | Mat M | 2019-03-02 | 1 | -3/+3 |
|\ \ \ \ \ | |_|/ / / |/| | | | | vk_buffer_cache: Fix clang-format | ||||
| * | | | | vk_buffer_cache: Fix clang-format | ReinUsesLisp | 2019-03-02 | 1 | -3/+3 |
| | | | | | |||||
* | | | | | Merge pull request #2182 from bunnei/my-wasted-friday | bunnei | 2019-03-02 | 1 | -1/+1 |
|\ \ \ \ \ | |/ / / / |/| | | | | fuck git for ruining my day, I will learn but I will not forgive | ||||
| * | | | | fuck git for ruining my day, I will learn but I will not forgive | bunnei | 2019-03-02 | 1 | -1/+1 |
| |/ / / | |||||
* / / / | vk_buffer_cache: Implement a buffer cache | ReinUsesLisp | 2019-03-01 | 3 | -0/+205 |
|/ / / | | | | | | | | | | | | | This buffer cache is just like OpenGL's buffer cache with some minor style changes. It uses VKStreamBuffer. | ||||
* | | | Merge pull request #2152 from ReinUsesLisp/vk-stream-buffer | bunnei | 2019-02-28 | 5 | -8/+172 |
|\ \ \ | | | | | | | | | vk_stream_buffer: Implement a stream buffer | ||||
| * | | | vk_stream_buffer: Remove copy code path | ReinUsesLisp | 2019-02-26 | 2 | -53/+18 |
| | | | | |||||
| * | | | vk_stream_buffer: Implement a stream buffer | ReinUsesLisp | 2019-02-24 | 3 | -1/+200 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This manages two kinds of streaming buffers: one for unified memory models and one for dedicated GPUs. The first one skips the copy from the staging buffer to the real buffer, since it creates an unified buffer. This implementation waits for all fences to finish their operation before "invalidating". This is suboptimal since it should allocate another buffer or start searching from the beginning. There is room for improvement here. This could also handle AMD's "pinned" memory (a heap with 256 MiB) that seems to be designed for buffer streaming. | ||||
| * | | | vk_resource_manager: Minor VKFenceWatch changes | ReinUsesLisp | 2019-02-24 | 2 | -7/+7 |
| | | | | |||||
* | | | | Merge pull request #2121 from FernandoS27/texception2 | bunnei | 2019-02-28 | 4 | -16/+213 |
|\ \ \ \ | | | | | | | | | | | Improve the Accuracy of the Rasterizer Cache through a Texception Pass | ||||
| * | | | | Devirtualize Register/Unregister and use a wrapper instead. | Fernando Sahmkow | 2019-02-28 | 3 | -12/+8 |
| | | | | | |||||
| * | | | | Corrections and redesign. | Fernando Sahmkow | 2019-02-28 | 2 | -51/+51 |
| | | | | | |||||
| * | | | | Fix linux compile error. | Fernando Sahmkow | 2019-02-28 | 1 | -1/+1 |
| | | | | | |||||
| * | | | | Remove NotifyFrameBuffer as we are doing a texception pass every drawcall. | Fernando Sahmkow | 2019-02-28 | 2 | -25/+0 |
| | | | | | |||||
| * | | | | Remove certain optimizations that caused texception to fail in certain scenarios. | Fernando Sahmkow | 2019-02-28 | 3 | -24/+1 |
| | | | | | |||||
| * | | | | Bug fixes and formatting | Fernando Sahmkow | 2019-02-28 | 2 | -3/+4 |
| | | | | | |||||
| * | | | | rasterizer_cache_gl: Implement Texception Pass | Fernando Sahmkow | 2019-02-28 | 3 | -0/+51 |
| | | | | | |||||
| * | | | | rasterizer_cache_gl: Implement Partial Reinterpretation of Surfaces. | Fernando Sahmkow | 2019-02-28 | 2 | -0/+100 |
| | | | | | |||||
| * | | | | rasterizer_cache: mark reinterpreted surfaces and add ability to reload marked surfaces on next use. | Fernando Sahmkow | 2019-02-28 | 2 | -0/+78 |
| | | | | | |||||
| * | | | | rasterizer_cache_gl: Notify on framebuffer change | Fernando Sahmkow | 2019-02-28 | 2 | -4/+23 |
| | | | | | |||||
| * | | | | rasterizer_cache: Expose FlushObject to Child classes and allow redefining of Register and Unregister | Fernando Sahmkow | 2019-02-28 | 1 | -11/+11 |
| | | | | | |||||
* | | | | | Merge pull request #2172 from lioncash/reorder | bunnei | 2019-02-28 | 2 | -3/+3 |
|\ \ \ \ \ | |/ / / / |/| | | | | gl_rasterizer/vk_memory_manager: Silence -Wreorder warnings | ||||
| * | | | | vk_memory_manager: Reorder constructor initializer list in terms of member declaration order | Lioncash | 2019-02-27 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Reorders members in the order that they would actually be initialized in. Silences a -Wreorder warning. | ||||
| * | | | | gl_rasterizer: Reorder constructor initializer list in terms of member declaration order | Lioncash | 2019-02-27 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Orders the members in the order they would actually be initialized in. Silences a -Wreorder warning. | ||||
* | | | | | Merge pull request #2163 from ReinUsesLisp/bitset-dirty | bunnei | 2019-02-28 | 4 | -52/+51 |
|\ \ \ \ \ | |_|_|_|/ |/| | | | | maxwell_3d: Use std::bitset to manage dirty flags | ||||
| * | | | | maxwell_3d: Use std::bitset to manage dirty flags | ReinUsesLisp | 2019-02-26 | 4 | -52/+51 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #2167 from lioncash/namespace | bunnei | 2019-02-27 | 9 | -23/+23 |
|\ \ \ \ | | | | | | | | | | | common: Move Quaternion, Rectangle, Vec2, Vec3, and Vec4 into the Common namespace | ||||
| * | | | | common/math_util: Move contents into the Common namespace | Lioncash | 2019-02-27 | 9 | -23/+23 |
| | |/ / | |/| | | | | | | | | | | | | | | These types are within the common library, so they should be within the Common namespace. | ||||
* / | | | gl_shader_disk_cache: Remove #pragma once from cpp file | Lioncash | 2019-02-27 | 1 | -2/+0 |
|/ / / | | | | | | | | | | This is only necessary in headers. Silences a warning with clang. | ||||
* / / | renderer_opengl: Update pixel format tracking | ReinUsesLisp | 2019-02-26 | 1 | -0/+1 |
|/ / | |||||
* | | shader/track: Resolve variable shadowing warnings | Lioncash | 2019-02-25 | 1 | -5/+5 |
| | | |||||
* | | Merge pull request #2118 from FernandoS27/ipa-improve | bunnei | 2019-02-25 | 6 | -38/+74 |
|\ \ | | | | | | | shader_decompiler: Improve Accuracy of Attribute Interpolation. | ||||
| * | | shader_decompiler: Improve Accuracy of Attribute Interpolation. | Fernando Sahmkow | 2019-02-14 | 6 | -38/+74 |
| | | | |||||
* | | | Merge pull request #2119 from FernandoS27/fix-copy | bunnei | 2019-02-25 | 1 | -1/+5 |
|\ \ \ | | | | | | | | | rasterizer_cache_gl: Only do fast layered copy on the same format. | ||||
| * | | | rasterizer_cache_gl: Only do fast layered copy on the same format. As | Fernando Sahmkow | 2019-02-13 | 1 | -1/+5 |
| |/ / | | | | | | | | | | glCopyImageSubData does not support different formats. | ||||
* | | | Merge pull request #2139 from degasus/dma_pusher | bunnei | 2019-02-24 | 2 | -28/+34 |
|\ \ \ | |_|/ |/| | | video_core/dma_pusher: The full list of headers at once. | ||||
| * | | video_core/dma_pusher: Simplyfy Step() logic. | Markus Wick | 2019-02-19 | 2 | -81/+77 |
| | | | | | | | | | | | | | | | As fetching command list headers and and the list of command headers is a fixed 1:1 relation now, they can be implemented within a single call. This cleans up the Step() logic quite a bit. | ||||
| * | | video_core/dma_pusher: The full list of headers at once. | Markus Wick | 2019-02-19 | 2 | -48/+58 |
| | | | | | | | | | | | | | | | Fetching every u32 from memory leads to a big overhead. So let's fetch all of them as a block if possible. This reduces the Memory::* calls by the dma_pusher by a factor of 10. | ||||
* | | | Merge pull request #2146 from ReinUsesLisp/vulkan-scheduler | bunnei | 2019-02-24 | 3 | -1/+132 |
|\ \ \ | | | | | | | | | vk_scheduler: Implement a scheduler | ||||
| * | | | vk_scheduler: Implement a scheduler | ReinUsesLisp | 2019-02-22 | 3 | -1/+132 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The scheduler abstracts command buffer and fence management with an interface that's able to do OpenGL-like operations on Vulkan command buffers. It returns by value a command buffer and fence that have to be used for subsequent operations until Flush or Finish is executed, after that the current execution context (the pair of command buffers and fences) gets invalidated a new one must be fetched. Thankfully validation layers will quickly detect if this is skipped throwing an error due to modifications to a sent command buffer. | ||||
* | | | | Merge pull request #2150 from ReinUsesLisp/fixup-layer-swizzle | bunnei | 2019-02-24 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer_cache: Fixup parameter order in layered swizzle | ||||
| * | | | | gl_rasterizer_cache: Fixup parameter order in layered swizzle | ReinUsesLisp | 2019-02-24 | 1 | -1/+1 |
| |/ / / | |||||
* / / / | vk_memory_manager: Fixup commit interval allocation | ReinUsesLisp | 2019-02-24 | 1 | -2/+1 |
|/ / / | | | | | | | | | | | | | VKMemoryCommitImpl was using as the end of its interval "begin + end". That ended up wasting memory. | ||||
* | | | Merge pull request #2138 from ReinUsesLisp/vulkan-memory-manager | bunnei | 2019-02-22 | 3 | -0/+342 |
|\ \ \ | | | | | | | | | vk_memory_manager: Implement memory manager | ||||
| * | | | vk_memory_manager: Implement memory manager | ReinUsesLisp | 2019-02-19 | 3 | -0/+342 |
| |/ / | | | | | | | | | | | | | A memory manager object handles the memory allocations for a device. It allocates chunks of Vulkan memory objects and then suballocates. | ||||
* | | | Merge pull request #2125 from ReinUsesLisp/fixup-glstate | bunnei | 2019-02-21 | 1 | -83/+61 |
|\ \ \ | | | | | | | | | gl_state: Synchronize gl_state even when state is disabled | ||||
| * | | | gl_state: Synchronize gl_state even when state is disabled | ReinUsesLisp | 2019-02-15 | 1 | -83/+61 |
| | | | | | | | | | | | | | | | | | | | | | | | | There are some potential edge cases where gl_state may fail to track the state if a related state changes while the toggle is disabled or it didn't change. This addresses that. | ||||
* | | | | Merge pull request #2130 from lioncash/system_engine | bunnei | 2019-02-21 | 8 | -22/+48 |
|\ \ \ \ | |_|/ / |/| | | | video_core: Remove usages of System::GetInstance() within the engines | ||||
| * | | | video_core: Remove usages of System::GetInstance() within the engines | Lioncash | 2019-02-16 | 8 | -22/+48 |
| | | | | | | | | | | | | | | | | | | | | Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface. | ||||
* | | | | Merge pull request #2122 from ReinUsesLisp/vulkan-resource-manager | bunnei | 2019-02-19 | 3 | -1/+468 |
|\ \ \ \ | | | | | | | | | | | vk_resource_manager: Implement fence and command buffer allocator | ||||
| * | | | | vk_resource_manager: Implement a command buffer pool with VKFencedPool | ReinUsesLisp | 2019-02-14 | 2 | -1/+59 |
| | | | | | |||||
| * | | | | vk_resource_manager: Add VKFencedPool interface | ReinUsesLisp | 2019-02-14 | 2 | -0/+83 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Handles a pool of resources protected by fences. Manages resource overflow allocating more resources. This class is intended to be used through inheritance. | ||||
| * | | | | vk_resource_manager: Implement VKResourceManager and fence allocator | ReinUsesLisp | 2019-02-14 | 2 | -0/+85 |
| | | | | | | | | | | | | | | | | | | | | | | | | | CommitFence iterates a pool of fences until one is found. If all fences are being used at the same time, allocate more. | ||||
| * | | | | vk_resource_manager: Implement VKFenceWatch | ReinUsesLisp | 2019-02-14 | 2 | -0/+68 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | A fence watch is used to keep track of the usage of a fence and protect a resource or set of resources without having to inherit from their handlers. | ||||
| * | | | | vk_resource_manager: Implement VKFence | ReinUsesLisp | 2019-02-14 | 2 | -0/+131 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fences take ownership of objects, protecting them from GPU-side or driver-side concurrent access. They must be commited from the resource manager. Their usage flow is: commit the fence from the resource manager, protect resources with it and use them, send the fence to an execution queue and Wait for it if needed and then call Release. Used resources will automatically be signaled when they are free to be reused. | ||||
| * | | | | vk_resource_manager: Add VKResource interface | ReinUsesLisp | 2019-02-14 | 3 | -1/+43 |
| | | | | | | | | | | | | | | | | | | | | | | | | | VKResource is an interface that gets signaled by a fence when it is free to be reused. | ||||
* | | | | | Merge pull request #2127 from FearlessTobi/fix-screenshot-srgb | bunnei | 2019-02-16 | 1 | -1/+2 |
|\ \ \ \ \ | |_|/ / / |/| | | | | renderer_opengl: respect the sRGB colorspace for the screenshot feature | ||||
| * | | | | renderer_opengl: respect the sRGB colorspace for the screenshot feature | fearlessTobi | 2019-02-15 | 1 | -1/+2 |
| | |/ / | |/| | | | | | | | | | | | | | | Previously, we were completely ignoring for screenshots whether the game uses RGB or sRGB. This resulted in screenshot colors that looked off for some titles. | ||||
* | | | | Merge pull request #2123 from lioncash/coretiming-global | James Rowe | 2019-02-16 | 3 | -3/+4 |
|\ \ \ \ | |/ / / |/| | | | core_timing: De-globalize core_timing facilities | ||||
| * | | | core_timing: Convert core timing into a class | Lioncash | 2019-02-16 | 3 | -3/+4 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Gets rid of the largest set of mutable global state within the core. This also paves a way for eliminating usages of GetInstance() on the System class as a follow-up. Note that no behavioral changes have been made, and this simply extracts the functionality into a class. This also has the benefit of making dependencies on the core timing functionality explicit within the relevant interfaces. | ||||
* | | | Merge pull request #2112 from lioncash/shadowing | bunnei | 2019-02-15 | 1 | -7/+13 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Get rid of variable shadowing | ||||
| * | | | gl_rasterizer_cache: Remove unnecessary newline | Lioncash | 2019-02-12 | 1 | -2/+0 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Get rid of variable shadowing | Lioncash | 2019-02-12 | 1 | -6/+14 |
| | | | | | | | | | | | | | | | | | | | | Avoids shadowing the members of the struct itself, which results in a -Wshadow warning. | ||||
* | | | | Merge pull request #2111 from ReinUsesLisp/fetch-fix | bunnei | 2019-02-15 | 2 | -22/+35 |
|\ \ \ \ | |_|/ / |/| | | | gl_shader_decompiler: Re-implement TLDS lod | ||||
| * | | | gl_shader_decompiler: Re-implement TLDS lod | ReinUsesLisp | 2019-02-12 | 2 | -22/+35 |
| | | | | |||||
* | | | | Merge pull request #2113 from ReinUsesLisp/vulkan-base | bunnei | 2019-02-14 | 4 | -0/+402 |
|\ \ \ \ | |_|_|/ |/| | | | vulkan: Add dependencies and device abstraction | ||||
| * | | | vk_device: Abstract device handling into a class | ReinUsesLisp | 2019-02-13 | 3 | -1/+351 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | VKDevice contains all the data required to manage and initialize a physical device. Its intention is to be passed across Vulkan objects to query device-specific data (for example the logical device and the dispatch loader). | ||||
| * | | | renderer_vulkan: Add declarations file | ReinUsesLisp | 2019-02-12 | 2 | -0/+52 |
| |/ / | | | | | | | | | | | | | | | | | | | This file is intended to be included instead of vulkan/vulkan.hpp. It includes declarations of unique handlers using a dynamic dispatcher instead of a static one (which would require linking to a Vulkan library). | ||||
* | | | Merge pull request #2099 from greggameplayer/BGRA8-Framebuffer-Real | bunnei | 2019-02-13 | 3 | -0/+4 |
|\ \ \ | | | | | | | | | Implement BGRA8 framebuffer format | ||||
| * | | | Implement BGRA8 framebuffer format | greggameplayer | 2019-02-09 | 3 | -0/+4 |
| | |/ | |/| | |||||
* | | | renderer_opengl: Remove reference to global system instance | Lioncash | 2019-02-13 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | | | We already store a reference to the system instance that the renderer is created with, so we don't need to refer to the system instance via Core::System::GetInstance() | ||||
* | | | Merge pull request #2110 from lioncash/namespace | bunnei | 2019-02-13 | 3 | -3/+3 |
|\ \ \ | | | | | | | | | core_timing: Rename CoreTiming namespace to Core::Timing | ||||
| * | | | core_timing: Rename CoreTiming namespace to Core::Timing | Lioncash | 2019-02-12 | 3 | -3/+3 |
| |/ / | | | | | | | | | | | | | | | | Places all of the timing-related functionality under the existing Core namespace to keep things consistent, rather than having the timing utilities sitting in its own completely separate namespace. | ||||
* | | | Merge pull request #2104 from ReinUsesLisp/compute-assert | bunnei | 2019-02-13 | 6 | -52/+59 |
|\ \ \ | |_|/ |/| | | kepler_compute: Fixup assert and rename the engine | ||||
| * | | kepler_compute: Fixup assert and rename engines | ReinUsesLisp | 2019-02-10 | 6 | -52/+59 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When I originally added the compute assert I used the wrong documentation. This addresses that. The dispatch register was tested with homebrew against hardware and is triggered by some games (e.g. Super Mario Odyssey). What exactly is missing to get a valid program bound by this engine requires more investigation. | ||||
* | | | Merge pull request #2108 from FernandoS27/fix-cc | bunnei | 2019-02-12 | 1 | -2/+2 |
|\ \ \ | | | | | | | | | Fix incorrect value for CC bit in IADD | ||||
| * | | | Fix incorrect value for CC bit in IADD | Fernando Sahmkow | 2019-02-11 | 1 | -2/+2 |
| | |/ | |/| | |||||
* | | | Merge pull request #2109 from FernandoS27/fix-f2i | bunnei | 2019-02-12 | 2 | -4/+4 |
|\ \ \ | | | | | | | | | Corrected F2I None mode to RoundEven. | ||||
| * | | | Corrected F2I None mode to RoundEven. | Fernando Sahmkow | 2019-02-11 | 2 | -4/+4 |
| |/ / | |||||
* | | | Merge pull request #2068 from ReinUsesLisp/shader-cleanup-textures | bunnei | 2019-02-12 | 3 | -153/+123 |
|\ \ \ | |/ / |/| | | shader_ir: Clean texture management code | ||||
| * | | shader_ir: Remove F4 prefix to texture operations | ReinUsesLisp | 2019-02-07 | 3 | -26/+25 |
| | | | | | | | | | | | | | | | | | | This was originally included because texture operations returned a vec4. These operations now return a single float and the F4 prefix doesn't mean anything. | ||||
| * | | shader_ir: Clean texture management code | ReinUsesLisp | 2019-02-07 | 3 | -133/+104 |
| |/ | | | | | | | | | | | | | | | | | Previous code relied on GLSL parameter order (something that's always ill-formed on an IR design). This approach passes spatial coordiantes through operation nodes and array and depth compare values in the the texture metadata. It still contains an "extra" vector containing generic nodes for bias and component index (for example) which is still a bit ill-formed but it should be better than the previous approach. | ||||
* | | Merge pull request #1904 from bunnei/better-fermi-copy | bunnei | 2019-02-09 | 7 | -72/+206 |
|\ \ | | | | | | | gl_rasterizer: Implement a more accurate fermi 2D copy. | ||||
| * | | gl_rasterizer_cache: Mark surface copy destinations as modified. | bunnei | 2019-02-07 | 2 | -4/+18 |
| | | | |||||
| * | | gl_rasterizer: Implement a more accurate fermi 2D copy. | bunnei | 2019-02-07 | 7 | -68/+188 |
| | | | | | | | | | | | | - This is a blit, use the blit registers. | ||||
* | | | Implement linear textures (#2089) | Fernando Sahmkow | 2019-02-09 | 2 | -5/+39 |
| | | | |||||
* | | | gl_rasterizer_cache: Fixup texture view parameters | ReinUsesLisp | 2019-02-08 | 1 | -2/+2 |
| |/ |/| | | | | | | | These parameters were declared as constants and passed to glTextureView but then they were removed on a rabase. This addresses that mistake. | ||||
* | | Merge pull request #2083 from ReinUsesLisp/shader-ir-cbuf-tracking | bunnei | 2019-02-07 | 30 | -127/+141 |
|\ \ | |/ |/| | shader/track: Add a more permissive global memory tracking | ||||
| * | shader/track: Search inside of conditional nodes | ReinUsesLisp | 2019-02-03 | 1 | -0/+11 |
| | | | | | | | | | | | | Some games search conditionally use global memory instructions. This allows the heuristic to search inside conditional nodes for the source constant buffer. | ||||
| * | shader_ir: Rename BasicBlock to NodeBlock | ReinUsesLisp | 2019-02-03 | 30 | -122/+120 |
| | | | | | | | | It's not always used as a basic block. Rename it for consistency. | ||||
| * | shader_ir: Pass decoded nodes as a whole instead of per basic blocks | ReinUsesLisp | 2019-02-03 | 27 | -57/+62 |
| | | | | | | | | | | | | | | | | | | Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't. | ||||
* | | gl_shader_disk_cache: Check LZ4 size limit | Frederic L | 2019-02-07 | 1 | -0/+4 |
| | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
* | | gl_shader_disk_cache: Consider compressed size zero as an error | Frederic L | 2019-02-07 | 1 | -2/+2 |
| | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
* | | gl_shader_disk_cache: Use unordered containers | ReinUsesLisp | 2019-02-07 | 4 | -56/+64 |
| | | |||||
* | | gl_shader_cache: Fixup GLSL unique identifiers | ReinUsesLisp | 2019-02-07 | 2 | -3/+3 |
| | | |||||
* | | gl_shader_cache: Link loading screen with disk shader cache load | ReinUsesLisp | 2019-02-07 | 5 | -9/+40 |
| | | |||||
* | | gl_shader_cache: Set GL_PROGRAM_SEPARABLE to dumped shaders | ReinUsesLisp | 2019-02-07 | 1 | -0/+1 |
| | | | | | | | | | | | | i965 (and probably all mesa drivers) require GL_PROGRAM_SEPARABLE when using glProgramBinary. This is probably required by the standard but it's ignored by permisive proprietary drivers. | ||||
* | | gl_shader_disk_cache: Pass core system as argument and guard against games without title ids | ReinUsesLisp | 2019-02-07 | 10 | -17/+57 |
| | | |||||
* | | gl_shader_disk_cache: Guard reads and writes against failure | ReinUsesLisp | 2019-02-07 | 2 | -216/+339 |
| | | |||||
* | | gl_shader_disk_cache: Address miscellaneous feedback | ReinUsesLisp | 2019-02-07 | 5 | -43/+57 |
| | | |||||
* | | gl_shader_disk_cache: Pass return values returning instead of by parameters | ReinUsesLisp | 2019-02-07 | 3 | -39/+37 |
| | | |||||
* | | gl_shader_disk_cache: Compress program binaries using LZ4 | ReinUsesLisp | 2019-02-07 | 1 | -7/+28 |
| | | |||||
* | | gl_shader_disk_cache: Compress GLSL code using LZ4 | ReinUsesLisp | 2019-02-07 | 2 | -6/+57 |
| | | |||||
* | | gl_shader_disk_cache: Save GLSL and entries into the precompiled file | ReinUsesLisp | 2019-02-07 | 9 | -135/+234 |
| | | |||||
* | | settings: Hide shader cache behind a setting | ReinUsesLisp | 2019-02-07 | 1 | -0/+21 |
| | | |||||
* | | gl_shader_disk_cache: Invalidate shader cache changes with CMake hash | ReinUsesLisp | 2019-02-07 | 1 | -7/+16 |
| | | |||||
* | | gl_shader_cache: Refactor to support disk shader cache | ReinUsesLisp | 2019-02-07 | 2 | -121/+388 |
| | | |||||
* | | gl_shader_disk_cache: Add transferable cache invalidation | ReinUsesLisp | 2019-02-07 | 2 | -0/+8 |
| | | |||||
* | | gl_shader_disk_cache: Add precompiled load | ReinUsesLisp | 2019-02-07 | 2 | -0/+45 |
| | | |||||
* | | gl_shader_disk_cache: Add precompiled save | ReinUsesLisp | 2019-02-07 | 2 | -0/+57 |
| | | |||||
* | | gl_shader_disk_cache: Add transferable load | ReinUsesLisp | 2019-02-07 | 2 | -0/+56 |
| | | |||||
* | | gl_shader_disk_cache: Add transferable stores | ReinUsesLisp | 2019-02-07 | 2 | -0/+194 |
| | | |||||
* | | gl_shader_disk_cache: Add ShaderDiskCacheOpenGL class and helpers | ReinUsesLisp | 2019-02-07 | 2 | -0/+76 |
| | | |||||
* | | gl_shader_disk_cache: Add file and move BaseBindings declaration | ReinUsesLisp | 2019-02-07 | 4 | -10/+58 |
| | | |||||
* | | gl_shader_decompiler: Remove name entries | ReinUsesLisp | 2019-02-07 | 2 | -28/+10 |
| | | |||||
* | | gl_shader_util: Add parameter to handle retrievable programs | ReinUsesLisp | 2019-02-07 | 3 | -6/+10 |
| | | |||||
* | | rasterizer_interface: Add disk cache entry for the rasterizer | ReinUsesLisp | 2019-02-07 | 5 | -0/+14 |
| | | |||||
* | | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 2019-02-07 | 1 | -0/+33 |
| | | |||||
* | | Merge pull request #2042 from ReinUsesLisp/nouveau-tex | bunnei | 2019-02-07 | 11 | -79/+82 |
|\ \ | | | | | | | maxwell_3d: Allow texture handles with TIC id zero | ||||
| * | | video_core: Assert on invalid GPU to CPU address queries | ReinUsesLisp | 2019-02-03 | 8 | -47/+67 |
| | | | |||||
| * | | maxwell_3d: Allow sampler handles with TSC id zero | ReinUsesLisp | 2019-02-03 | 1 | -10/+6 |
| | | | |||||
| * | | maxwell_3d: Allow texture handles with TIC id zero | ReinUsesLisp | 2019-02-03 | 3 | -21/+7 |
| | | | | | | | | | | | | | | | Also remove "enabled" field from Tegra::Texture::FullTextureInfo because it would become unused. | ||||
| * | | memory_manager: Check for reserved page status | ReinUsesLisp | 2019-02-03 | 1 | -1/+2 |
| |/ | |||||
* | | Merge pull request #2071 from ReinUsesLisp/dsa-texture | bunnei | 2019-02-07 | 8 | -216/+153 |
|\ \ | | | | | | | gl_rasterizer: Use DSA for textures and move swizzling to texture state | ||||
| * | | gl_rasterizer_cache: Fixup test clause | ReinUsesLisp | 2019-01-30 | 1 | -6/+5 |
| | | | |||||
| * | | gl_rasterizer_cache: Guard clause swizzle testing | Mat M | 2019-01-30 | 1 | -1/+3 |
| | | | | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
| * | | gl_state: Remove texture target tracking | ReinUsesLisp | 2019-01-30 | 2 | -5/+0 |
| | | | |||||
| * | | gl_rasterizer_cache: Move swizzling to textures instead of state | ReinUsesLisp | 2019-01-30 | 6 | -28/+35 |
| | | | |||||
| * | | gl_state: Use DSA and multi bind to update texture bindings | ReinUsesLisp | 2019-01-30 | 1 | -8/+22 |
| | | | |||||
| * | | gl_rasterizer: Use DSA for textures | ReinUsesLisp | 2019-01-30 | 5 | -185/+105 |
| | | | |||||
* | | | Merge pull request #2085 from ReinUsesLisp/cube-minus-one | bunnei | 2019-02-05 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | video_core/texture: Fix BitField size for depth_minus_one | ||||
| * | | | video_core/texture: Fix BitField size for depth_minus_one | ReinUsesLisp | 2019-02-05 | 1 | -1/+1 |
| | | | | |||||
* | | | | Merge pull request #2081 from ReinUsesLisp/lmem-64 | bunnei | 2019-02-05 | 2 | -15/+46 |
|\ \ \ \ | |/ / / |/| | | | shader_ir/memory: Add LD_L 64 bits loads | ||||
| * | | | shader_ir/memory: Add ST_L 64 and 128 bits stores | ReinUsesLisp | 2019-02-03 | 1 | -3/+11 |
| | | | | |||||
| * | | | shader_ir/memory: Add LD_L 128 bits loads | ReinUsesLisp | 2019-02-03 | 1 | -7/+19 |
| | | | | |||||
| * | | | shader_bytecode: Rename BytesN enums to BitsN | ReinUsesLisp | 2019-02-03 | 2 | -7/+7 |
| | | | | |||||
| * | | | shader_ir/memory: Add LD_L 64 bits loads | ReinUsesLisp | 2019-02-03 | 1 | -6/+17 |
| | |/ | |/| | |||||
* | | | Merge pull request #2082 from FernandoS27/txq-stl | bunnei | 2019-02-05 | 2 | -6/+13 |
|\ \ \ | |/ / |/| | | Fix TXQ not using the component mask. | ||||
| * | | Update src/video_core/engines/shader_bytecode.h | Mat M | 2019-02-04 | 1 | -1/+1 |
| | | | | | | | | | Co-Authored-By: FernandoS27 <fsahmkow27@gmail.com> | ||||
| * | | Fix TXQ not using the component mask. | Fernando Sahmkow | 2019-02-03 | 2 | -6/+13 |
| | | | |||||
* | | | Merge pull request #2074 from ReinUsesLisp/shader-ir-unify-offset | bunnei | 2019-02-01 | 17 | -25/+36 |
|\ \ \ | | | | | | | | | shader_ir: Unify constant buffer offset values | ||||
| * | | | shader_ir: Unify constant buffer offset values | ReinUsesLisp | 2019-01-30 | 17 | -25/+36 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries. | ||||
* | | | | Merge pull request #2067 from ReinUsesLisp/workaround-fb | bunnei | 2019-02-01 | 2 | -14/+19 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Workaround invalid zeta clears | ||||
| * | | | | gl_rasterizer: Workaround invalid zeta clears | ReinUsesLisp | 2019-01-30 | 2 | -14/+19 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Some games (like Xenoblade Chronicles 2) clear both depth and stencil buffers while there's a depth-only texture attached (e.g. D16 Unorm). This commit reads the zeta format of the bound surface on ConfigureFramebuffers and returns if depth and/or stencil attachments were set. This is ignored on DrawArrays but on Clear it's used to just clear those attachments, bypassing an OpenGL error. | ||||
* | | | | | rasterizer_interface: Remove unused AccelerateFill operation | ReinUsesLisp | 2019-02-01 | 3 | -11/+0 |
| | | | | | |||||
* | | | | | video_core: Remove unused Fill surface type | ReinUsesLisp | 2019-02-01 | 2 | -6/+1 |
| |_|/ / |/| | | | |||||
* | | | | Merge pull request #2076 from lioncash/enc | Hexagon12 | 2019-01-30 | 1 | -1/+1 |
|\ \ \ \ | |_|_|/ |/| | | | video_core/dma_pusher: Silence C4828 warnings | ||||
| * | | | video_core/dma_pusher: Silence C4828 warnings | Lioncash | 2019-01-30 | 1 | -1/+1 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | | This was previously causing: warning C4828: The file contains a character starting at offset 0xa33 that is illegal in the current source character set (codepage 65001). warnings on Windows when compiling yuzu. | ||||
* | | | Merge pull request #1485 from FernandoS27/render-info | bunnei | 2019-01-30 | 2 | -2/+57 |
|\ \ \ | |/ / |/| | | Add more info into textures' object labels | ||||
| * | | Add more info into textures' object labels | FernandoS27 | 2018-12-09 | 2 | -2/+57 |
| | | | |||||
* | | | Merge pull request #2070 from ReinUsesLisp/cubearray-view | bunnei | 2019-01-30 | 4 | -3/+28 |
|\ \ \ | | | | | | | | | gl_shader_cache: Fix texture view for cubemaps as cubemap arrays | ||||
| * | | | gl_shader_cache: Fix texture view for cubemaps as cubemap arrays | ReinUsesLisp | 2019-01-30 | 4 | -3/+28 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cubemaps are considered layered and to create a texture view the texture mustn't be a layered texture, resulting in cubemaps being bound as cubemap arrays. To fix this issue this commit introduces an extra surface parameter called "is_array" and uses this to query for texture view creation. Now that texture views for cubemaps are actually being created, this also fixes the number of layers created for the texture view (since they have to be 6 to create a texture view of cubemaps). | ||||
* | | | gl_shader_cache: Use explicit bindings | ReinUsesLisp | 2019-01-30 | 7 | -249/+194 |
| | | | |||||
* | | | gl_rasterizer: Implement global memory management | ReinUsesLisp | 2019-01-30 | 6 | -4/+140 |
| | | | |||||
* | | | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 2019-01-30 | 7 | -10/+240 |
| | | | |||||
* | | | video_core/GPU Implemented the GPU PFIFO puller semaphore operations. (#1908) | Kevin | 2019-01-30 | 2 | -12/+242 |
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Implemented the puller semaphore operations. * Nit: Fix 2 style issues * Nit: Add Break to default case. * Fix style. * Update for comments. Added ReferenceCount method * Forgot to remove GpuSmaphoreAddress union. * Fix the clang-format issues. * More clang formatting. * two more white spaces for the Clang formatting. * Move puller members into the regs union * Updated to use Memory::WriteBlock instead of Memory::Write* * Fix clang style issues * White space clang error * Removing unused funcitons and other pr comment * Removing unused funcitons and other pr comment * More union magic for setting regs value. * union magic refcnt as well * Remove local var * Set up the regs and regs_assert_positions up properly * Fix clang error | ||||
* | | shader/shader_ir: Amend three comment typos | Lioncash | 2019-01-28 | 1 | -3/+3 |
| | | | | | | | | | | Given we're in the area, these are three trivial typos that can be corrected. | ||||
* | | shader/shader_ir: Amend constructor initializer ordering for AbufNode | Lioncash | 2019-01-28 | 1 | -2/+2 |
| | | | | | | | | | | Orders the class members in the same order that they would actually be initialized in. Gets rid of two compiler warnings. | ||||
* | | shader/decode: Avoid a pessimizing std::move within DecodeRange() | Lioncash | 2019-01-28 | 1 | -1/+1 |
| | | | | | | | | | | | | std::moveing a local variable in a return statement has the potential to prevent copy elision from occurring, so this can just be converted into a regular return. | ||||
* | | video_core: Silent implicit conversion warning | ReinUsesLisp | 2019-01-26 | 1 | -3/+4 |
| | | |||||
* | | Merge pull request #1927 from ReinUsesLisp/shader-ir | bunnei | 2019-01-26 | 39 | -3808/+5497 |
|\ \ | | | | | | | video_core: Replace gl_shader_decompiler with an IR based decompiler | ||||
| * | | shader_ir: Fixup clang build | ReinUsesLisp | 2019-01-16 | 1 | -4/+6 |
| | | | |||||
| * | | gl_shader_decompiler: replace std::get<> with std::get_if<> for macOS compatibility | ReinUsesLisp | 2019-01-15 | 1 | -44/+58 |
| | | | |||||
| * | | gl_shader_decompiler: Inline textureGather component | ReinUsesLisp | 2019-01-15 | 1 | -15/+16 |
| | | | |||||
| * | | shader_decode: Fixup XMAD | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
| * | | shader_ir: Pass to decoder functions basic block's code | ReinUsesLisp | 2019-01-15 | 27 | -82/+83 |
| | | | |||||
| * | | shader_decode: Improve zero flag implementation | ReinUsesLisp | 2019-01-15 | 15 | -75/+79 |
| | | | |||||
| * | | shader_ir: Remove composite primitives and use temporals instead | ReinUsesLisp | 2019-01-15 | 4 | -241/+224 |
| | | | |||||
| * | | gl_shader_decompiler: Fixup AssignCompositeHalf | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
| * | | shader_decode: Use proper primitive names | ReinUsesLisp | 2019-01-15 | 4 | -25/+21 |
| | | | |||||
| * | | shader_decode: Use BitfieldExtract instead of shift + and | ReinUsesLisp | 2019-01-15 | 8 | -48/+37 |
| | | | |||||
| * | | shader_ir: Remove Ipa primitive | ReinUsesLisp | 2019-01-15 | 3 | -13/+2 |
| | | | |||||
| * | | gl_shader_decompiler: Use rasterizer's UBO size limit | ReinUsesLisp | 2019-01-15 | 1 | -1/+3 |
| | | | |||||
| * | | gl_shader_gen: Fixup code formatting | ReinUsesLisp | 2019-01-15 | 2 | -18/+22 |
| | | | |||||
| * | | video_core: Rename glsl_decompiler to gl_shader_decompiler | ReinUsesLisp | 2019-01-15 | 7 | -7/+7 |
| | | | |||||
| * | | shader_ir: Remove RZ and use Register::ZeroIndex instead | ReinUsesLisp | 2019-01-15 | 3 | -12/+16 |
| | | | |||||
| * | | shader_decode: Implement TEXS.F16 | ReinUsesLisp | 2019-01-15 | 3 | -15/+57 |
| | | | |||||
| * | | shader_decode: Fixup R2P | ReinUsesLisp | 2019-01-15 | 1 | -2/+3 |
| | | | |||||
| * | | glsl_decompiler: Fixup TLDS | ReinUsesLisp | 2019-01-15 | 1 | -1/+0 |
| | | | |||||
| * | | glsl_decompiler: Fixup geometry shaders | ReinUsesLisp | 2019-01-15 | 2 | -15/+17 |
| | | | |||||
| * | | shader_decode: Fixup WriteLogicOperation zero comparison | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
| * | | glsl_decompiler: Fixup permissive member function declarations | ReinUsesLisp | 2019-01-15 | 1 | -133/+133 |
| | | | |||||
| * | | shader_decode: Fixup PSET | ReinUsesLisp | 2019-01-15 | 1 | -2/+3 |
| | | | |||||
| * | | shader_decode: Fixup clang-format | ReinUsesLisp | 2019-01-15 | 2 | -2/+4 |
| | | | |||||
| * | | video_core: Implement IR based geometry shaders | ReinUsesLisp | 2019-01-15 | 4 | -10/+102 |
| | | | |||||
| * | | shader_decode: Implement VMAD and VSETP | ReinUsesLisp | 2019-01-15 | 5 | -2/+129 |
| | | | |||||
| * | | shader_decode: Implement HSET2 | ReinUsesLisp | 2019-01-15 | 3 | -1/+50 |
| | | | |||||
| * | | shader_decode: Rework HSETP2 | ReinUsesLisp | 2019-01-15 | 4 | -47/+57 |
| | | | |||||
| * | | shader_decode: Implement R2P | ReinUsesLisp | 2019-01-15 | 1 | -1/+28 |
| | | | |||||
| * | | shader_decode: Implement CSETP | ReinUsesLisp | 2019-01-15 | 1 | -14/+37 |
| | | | |||||
| * | | shader_decode: Implement PSET | ReinUsesLisp | 2019-01-15 | 1 | -1/+16 |
| | | | |||||
| * | | shader_decode: Implement HFMA2 | ReinUsesLisp | 2019-01-15 | 4 | -5/+60 |
| | | | |||||
| * | | glsl_decompiler: Remove HNegate inlining | ReinUsesLisp | 2019-01-15 | 1 | -10/+0 |
| | | | |||||
| * | | shader_decode: Implement POPC | ReinUsesLisp | 2019-01-15 | 4 | -1/+22 |
| | | | |||||
| * | | shader_decode: Implement TLDS (untested) | ReinUsesLisp | 2019-01-15 | 3 | -10/+92 |
| | | | |||||
| * | | shader_decode: Update TLD4 reflecting #1862 changes | ReinUsesLisp | 2019-01-15 | 2 | -52/+52 |
| | | | |||||
| * | | shader_ir: Fixup TEX and TEXS and partially fix TLD4 decompiling | ReinUsesLisp | 2019-01-15 | 3 | -60/+72 |
| | | | |||||
| * | | shader_decode: Fixup FSET | ReinUsesLisp | 2019-01-15 | 1 | -2/+2 |
| | | | |||||
| * | | shader_decode: Implement IADD32I | ReinUsesLisp | 2019-01-15 | 1 | -0/+11 |
| | | | |||||
| * | | shader_decode: Fixup clang-format | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
| * | | video_core: Return safe values after an assert hits | ReinUsesLisp | 2019-01-15 | 8 | -8/+19 |
| | | | |||||
| * | | shader_decode: Implement FFMA | ReinUsesLisp | 2019-01-15 | 1 | -1/+36 |
| | | | |||||
| * | | video_core: Address feedback | ReinUsesLisp | 2019-01-15 | 4 | -13/+16 |
| | | | |||||
| * | | shader_ir: Fixup file inclusions and clang-format | ReinUsesLisp | 2019-01-15 | 3 | -2/+2 |
| | | | |||||
| * | | shader_ir: Move comment node string | Mat M | 2019-01-15 | 1 | -2/+2 |
| | | | | | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
| * | | shader_ir: Address feedback to avoid UB in bit casting | ReinUsesLisp | 2019-01-15 | 1 | -2/+4 |
| | | | |||||
| * | | shader_decode: Fixup clang-format | ReinUsesLisp | 2019-01-15 | 2 | -3/+2 |
| | | | |||||
| * | | shader_decode: Implement LEA | ReinUsesLisp | 2019-01-15 | 1 | -0/+55 |
| | | | |||||
| * | | shader_decode: Implement IADD3 | ReinUsesLisp | 2019-01-15 | 1 | -0/+61 |
| | | | |||||
| * | | shader_decode: Implement LOP3 | ReinUsesLisp | 2019-01-15 | 2 | -0/+62 |
| | | | |||||
| * | | shader_decode: Implement ST_L | ReinUsesLisp | 2019-01-15 | 1 | -0/+17 |
| | | | |||||
| * | | shader_decode: Implement LD_L | ReinUsesLisp | 2019-01-15 | 1 | -0/+18 |
| | | | |||||
| * | | shader_decode: Implement HSETP2 | ReinUsesLisp | 2019-01-15 | 1 | -1/+37 |
| | | | |||||
| * | | shader_decode: Implement HADD2 and HMUL2 | ReinUsesLisp | 2019-01-15 | 1 | -1/+48 |
| | | | |||||
| * | | shader_decode: Implement HADD2_IMM and HMUL2_IMM | ReinUsesLisp | 2019-01-15 | 1 | -1/+28 |
| | | | |||||
| * | | shader_decode: Implement MOV_SYS | ReinUsesLisp | 2019-01-15 | 1 | -0/+27 |
| | | | |||||
| * | | shader_decode: Implement IMNMX | ReinUsesLisp | 2019-01-15 | 1 | -0/+16 |
| | | | |||||
| * | | shader_decode: Implement F2F_C | ReinUsesLisp | 2019-01-15 | 1 | -2/+10 |
| | | | |||||
| * | | shader_decode: Implement I2I | ReinUsesLisp | 2019-01-15 | 1 | -0/+26 |
| | | | |||||
| * | | shader_decode: Implement BRA internal flag | ReinUsesLisp | 2019-01-15 | 1 | -4/+8 |
| | | | |||||
| * | | shader_decode: Implement ISCADD | ReinUsesLisp | 2019-01-15 | 1 | -0/+15 |
| | | | |||||
| * | | shader_decode: Implement XMAD | ReinUsesLisp | 2019-01-15 | 1 | -1/+85 |
| | | | |||||
| * | | shader_decode: Implement PBK and BRK | ReinUsesLisp | 2019-01-15 | 1 | -1/+22 |
| | | | |||||
| * | | shader_decode: Implement LOP | ReinUsesLisp | 2019-01-15 | 1 | -0/+15 |
| | | | |||||
| * | | shader_decode: Implement SEL | ReinUsesLisp | 2019-01-15 | 1 | -0/+8 |
| | | | |||||
| * | | shader_decode: Implement IADD | ReinUsesLisp | 2019-01-15 | 1 | -1/+28 |
| | | | |||||
| * | | shader_decode: Implement ISETP | ReinUsesLisp | 2019-01-15 | 1 | -1/+30 |
| | | | |||||
| * | | shader_decode: Implement BFI | ReinUsesLisp | 2019-01-15 | 1 | -1/+22 |
| | | | |||||
| * | | shader_decode: Implement ISET | ReinUsesLisp | 2019-01-15 | 1 | -1/+27 |
| | | | |||||
| * | | shader_decode: Implement LD_C | ReinUsesLisp | 2019-01-15 | 1 | -0/+31 |
| | | | |||||
| * | | shader_decode: Implement SHL | ReinUsesLisp | 2019-01-15 | 1 | -0/+8 |
| | | | |||||
| * | | shader_decode: Implement SHR | ReinUsesLisp | 2019-01-15 | 1 | -1/+26 |
| | | | |||||
| * | | shader_decode: Implement LOP32I | ReinUsesLisp | 2019-01-15 | 2 | -1/+72 |
| | | | |||||
| * | | shader_decode: Implement BFE | ReinUsesLisp | 2019-01-15 | 1 | -1/+25 |
| | | | |||||
| * | | shader_decode: Implement FSET | ReinUsesLisp | 2019-01-15 | 1 | -1/+36 |
| | | | |||||
| * | | shader_decode: Implement F2I | ReinUsesLisp | 2019-01-15 | 1 | -0/+37 |
| | | | |||||
| * | | shader_decode: Implement I2F | ReinUsesLisp | 2019-01-15 | 1 | -0/+23 |
| | | | |||||
| * | | shader_decode: Implement F2F | ReinUsesLisp | 2019-01-15 | 1 | -1/+37 |
| | | | |||||
| * | | shader_decode: Stub DEPBAR | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | | | |||||
| * | | shader_decode: Implement SSY and SYNC | ReinUsesLisp | 2019-01-15 | 1 | -0/+19 |
| | | | |||||
| * | | shader_decode: Implement PSETP | ReinUsesLisp | 2019-01-15 | 1 | -1/+21 |
| | | | |||||
| * | | shader_decode: Implement TMML | ReinUsesLisp | 2019-01-15 | 1 | -3/+45 |
| | | | |||||
| * | | shader_decode: Implement TEX and TXQ | ReinUsesLisp | 2019-01-15 | 2 | -0/+223 |
| | | | |||||
| * | | shader_decode: Implement TEXS (F32) | ReinUsesLisp | 2019-01-15 | 2 | -0/+217 |
| | | | |||||
| * | | shader_decode: Implement FSETP | ReinUsesLisp | 2019-01-15 | 1 | -1/+33 |
| | | | |||||
| * | | shader_decode: Partially implement BRA | ReinUsesLisp | 2019-01-15 | 1 | -0/+12 |
| | | | |||||
| * | | shader_decode: Implement IPA | ReinUsesLisp | 2019-01-15 | 1 | -0/+12 |
| | | | |||||
| * | | shader_decode: Implement EXIT | ReinUsesLisp | 2019-01-15 | 1 | -1/+32 |
| | | | |||||
| * | | shader_decode: Implement ST_A | ReinUsesLisp | 2019-01-15 | 1 | -0/+30 |
| | | | |||||
| * | | shader_decode: Implement LD_A | ReinUsesLisp | 2019-01-15 | 1 | -1/+39 |
| | | | |||||
| * | | shader_decode: Implement FADD32I | ReinUsesLisp | 2019-01-15 | 1 | -0/+12 |
| | | | |||||
| * | | shader_decode: Implement FMUL32_IMM | ReinUsesLisp | 2019-01-15 | 1 | -0/+10 |
| | | | |||||
| * | | shader_decode: Implement MOV32_IMM | ReinUsesLisp | 2019-01-15 | 1 | -1/+9 |
| | | | |||||
| * | | shader_decode: Stub RRO_C, RRO_R and RRO_IMM | ReinUsesLisp | 2019-01-15 | 1 | -0/+9 |
| | | | |||||
| * | | shader_decode: Implement FMNMX_C, FMNMX_R and FMNMX_IMM | ReinUsesLisp | 2019-01-15 | 1 | -0/+18 |
| | | | |||||
| * | | shader_decode: Implement MUFU | ReinUsesLisp | 2019-01-15 | 1 | -0/+29 |
| | | | |||||
| * | | shader_decode: Implement FADD_C, FADD_R and FADD_IMM | ReinUsesLisp | 2019-01-15 | 1 | -0/+15 |
| | | | |||||
| * | | shader_decode: Implement FMUL_C, FMUL_R and FMUL_IMM | ReinUsesLisp | 2019-01-15 | 1 | -0/+42 |
| | | | |||||
| * | | shader_decode: Implement MOV_C and MOV_R | ReinUsesLisp | 2019-01-15 | 1 | -1/+23 |
| | | | |||||
| * | | video_core: Replace gl_shader_decompiler | ReinUsesLisp | 2019-01-15 | 8 | -4185/+57 |
| | | | |||||
| * | | glsl_decompiler: Implementation | ReinUsesLisp | 2019-01-15 | 3 | -0/+1483 |
| | | | |||||
| * | | shader_ir: Add condition code helper | ReinUsesLisp | 2019-01-15 | 2 | -0/+13 |
| | | | |||||
| * | | shader_ir: Add predicate combiner helper | ReinUsesLisp | 2019-01-15 | 2 | -0/+15 |
| | | | |||||
| * | | shader_ir: Add comparison helpers | ReinUsesLisp | 2019-01-15 | 2 | -0/+106 |
| | | | |||||
| * | | shader_ir: Add half float helpers | ReinUsesLisp | 2019-01-15 | 2 | -0/+44 |
| | | | |||||
| * | | shader_ir: Add integer helpers | ReinUsesLisp | 2019-01-15 | 2 | -0/+40 |
| | | | |||||
| * | | shader_ir: Add float helpers | ReinUsesLisp | 2019-01-15 | 2 | -0/+24 |
| | | | |||||
| * | | shader_ir: Add setters | ReinUsesLisp | 2019-01-15 | 2 | -0/+24 |
| | | | |||||
| * | | shader_ir: Add local memory getters | ReinUsesLisp | 2019-01-15 | 2 | -0/+7 |
| | | | |||||
| * | | shader_ir: Add internal flag getters | ReinUsesLisp | 2019-01-15 | 2 | -0/+10 |
| | | | |||||
| * | | shader_ir: Add attribute getters | ReinUsesLisp | 2019-01-15 | 2 | -0/+26 |
| | | | |||||
| * | | shader_ir: Add constant buffer getters | ReinUsesLisp | 2019-01-15 | 2 | -0/+25 |
| | | | |||||
| * | | shader_ir: Add register getter | ReinUsesLisp | 2019-01-15 | 2 | -0/+9 |
| | | | |||||
| * | | shader_ir: Add immediate node constructors | ReinUsesLisp | 2019-01-15 | 2 | -1/+34 |
| | | | |||||
| * | | shader_ir: Initial implementation | ReinUsesLisp | 2019-01-15 | 30 | -0/+1573 |
| | | | |||||
| * | | shader_bytecode: Fixup encoding | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
| * | | shader_header: Make local memory size getter constant | ReinUsesLisp | 2019-01-15 | 1 | -1/+1 |
| | | | |||||
* | | | frontend: Refactor ScopeAcquireWindowContext out of renderer_opengl. | bunnei | 2019-01-24 | 3 | -28/+2 |
| | | | |||||
* | | | maxwell_3d: Set rt_separate_frag_data to 1 by default | ReinUsesLisp | 2019-01-22 | 2 | -4/+6 |
| | | | | | | | | | | | | | | | | | | | | | Commercial games assume that this value is 1 but they never set it. On the other hand nouveau manually sets this register. On ConfigureFramebuffers we were asserting for what we are actually implementing (according to envytools). | ||||
* | | | Rename step 1 and step 2 to be a little more descriptive | James Rowe | 2019-01-21 | 1 | -2/+2 |
| | | | |||||
* | | | QT: Upgrade the Loading Bar to look much better | James Rowe | 2019-01-20 | 1 | -0/+9 |
| | | | |||||
* | | | Merge pull request #2008 from ReinUsesLisp/dirty-framebuffers | bunnei | 2019-01-20 | 6 | -8/+78 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Use dirty flags for framebuffers | ||||
| * | | | gl_rasterizer: Skip framebuffer configuration if rendertargets have not been changed | ReinUsesLisp | 2019-01-07 | 2 | -1/+31 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Use dirty flags for the depth buffer | ReinUsesLisp | 2019-01-07 | 4 | -3/+23 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Use dirty flags for color buffers | ReinUsesLisp | 2019-01-07 | 4 | -4/+24 |
| | | | | |||||
* | | | | Merge pull request #2002 from ReinUsesLisp/dsa-vao-buffer | bunnei | 2019-01-20 | 9 | -103/+73 |
|\ \ \ \ | | | | | | | | | | | gl_rasterizer: Use DSA for VAOs and buffers | ||||
| * | | | | gl_rasterizer: Workaround Intel VAO DSA bug | ReinUsesLisp | 2019-01-09 | 3 | -7/+16 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | There is a bug on Intel's blob driver where it fails to properly build a vertex array object if it's not bound even after creating it with glCreateVertexArrays. This workaround binds it after creating it to bypass the issue. | ||||
| * | | | | gl_stream_buffer: Use DSA for buffer management | ReinUsesLisp | 2019-01-06 | 3 | -17/+14 |
| | | | | | |||||
| * | | | | gl_rasterizer: Use DSA for vertex array objects | ReinUsesLisp | 2019-01-06 | 6 | -79/+53 |
| | | | | | |||||
| * | | | | gl_state: Drop uniform buffer state tracking | ReinUsesLisp | 2019-01-06 | 3 | -10/+0 |
| | | | | | |||||
* | | | | | gl_rasterizer: Silent unsafe mix warning | ReinUsesLisp | 2019-01-18 | 1 | -1/+1 |
| |_|/ / |/| | | | |||||
* | | | | gl_global_cache: Add dummy global cache manager | ReinUsesLisp | 2019-01-08 | 5 | -3/+96 |
| | | | | |||||
* | | | | Merge pull request #1999 from ReinUsesLisp/dirty-shader | bunnei | 2019-01-07 | 5 | -2/+23 |
|\ \ \ \ | | |/ / | |/| | | gl_shader_cache: Use dirty flags for shaders | ||||
| * | | | gl_shader_cache: Use dirty flags for shaders | ReinUsesLisp | 2019-01-07 | 5 | -2/+23 |
| |/ / | |||||
* / / | gl_rasterizer_cache: Use GL_STREAM_COPY for PBOs | ReinUsesLisp | 2019-01-05 | 1 | -1/+1 |
|/ / | | | | | | | | | | | Since the data is doing the path CPU -> GPU -> GPU copy is the most approximate hint. Using GL_STREAM_DRAW generated a performance warning on Nvidia's stack. Changing this hint removed the warning. | ||||
* | | Merge pull request #1961 from ReinUsesLisp/tex-view-2d | bunnei | 2019-01-02 | 3 | -14/+74 |
|\ \ | | | | | | | gl_rasterizer_cache: Texture view if shader samples array but OGL is not | ||||
| * | | gl_rasterizer_cache: Texture view if shader samples array but OGL is not | ReinUsesLisp | 2018-12-30 | 3 | -14/+74 |
| | | | | | | | | | | | | | | | | | | | | | When a shader samples a texture array but that texture in OpenGL is created without layers, use a texture view to increase the texture hierarchy. For example, instead of binding a GL_TEXTURE_2D bind a GL_TEXTURE_2D_ARRAY view. | ||||
* | | | gpu: Remove PixelFormat G8R8U and G8R8S, as they do not seem to exist. | bunnei | 2018-12-28 | 4 | -79/+46 |
| | | | | | | | | | | | | - Fixes UI rendering issues in The Legend of Zelda: Breath of the Wild. | ||||
* | | | Add missing uintBitsToFloat to SetRegisterToHalfFloat | Rodolfo Bogado | 2018-12-27 | 1 | -2/+2 |
| | | | |||||
* | | | Merge pull request #1892 from Tinob/master | bunnei | 2018-12-27 | 1 | -113/+122 |
|\ \ \ | | | | | | | | | Improve Zero flag implementation | ||||
| * | | | Apply CC test to the final value to be stored in the register | Rodolfo Bogado | 2018-12-26 | 1 | -9/+12 |
| | | | | |||||
| * | | | Includde saturation in the evaluation of the control code | Rodolfo Bogado | 2018-12-22 | 1 | -3/+4 |
| | | | | |||||
| * | | | Handle RZ cases evaluating the expression instead of the register value. | Rodolfo Bogado | 2018-12-22 | 1 | -14/+22 |
| | | | | |||||
| * | | | complete emulation of ZeroFlag | Rodolfo Bogado | 2018-12-22 | 1 | -100/+97 |
| | | | | |||||
* | | | | renderer_opengl: Correct forward declaration of FramebufferLayout | Lioncash | 2018-12-26 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | This is actually a struct, not a class, which can lead to compilation warnings. | ||||
* | | | | Fixed shader linking error due to TLDS (#1934) | David | 2018-12-26 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Fixed shader linking error due to TLDS coord should be coords * Fix remaining coords | ||||
* | | | | shader_bytecode: Fixup TEXS.F16 encoding | ReinUsesLisp | 2018-12-26 | 1 | -1/+1 |
| |/ / |/| | | |||||
* | | | Merge pull request #1886 from FearlessTobi/port-4164 | bunnei | 2018-12-23 | 6 | -4/+95 |
|\ \ \ | |/ / |/| | | Port citra-emu/citra#4164: "citra_qt, video_core: Screenshot functionality" | ||||
| * | | yuzu, video_core: Screenshot functionality | zhupengfei | 2018-12-18 | 6 | -4/+95 |
| |/ | | | | | | | Allows capturing screenshot at the current internal resolution (native for software renderer), but a setting is available to capture it in other resolutions. The screenshot is saved to a single PNG in the current layout. | ||||
* | | Merge pull request #1921 from ogniK5377/no-unit | bunnei | 2018-12-21 | 10 | -3/+30 |
|\ \ | | | | | | | Fixed uninitialized memory due to missing returns in canary | ||||
| * | | hopefully fix clang format issue | David Marcec | 2018-12-19 | 1 | -0/+1 |
| | | | |||||
| * | | Fixed uninitialized memory due to missing returns in canary | David Marcec | 2018-12-19 | 10 | -3/+29 |
| | | | | | | | | | | | | Functions which are suppose to crash on non canary builds usually don't return anything which lead to uninitialized memory being used. | ||||
* | | | Merge pull request #1920 from heapo/texture_format_selection | bunnei | 2018-12-21 | 1 | -1/+11 |
|\ \ \ | | | | | | | | | Texture format fixes for RGBA16UI for copies and R16U when used as depth | ||||
| * | | | Texture format fixes: Flag RGBA16UI as GL_RGBA_INTEGER format, and interpret R16U as Z16 when depth_compare is enabled. | heapo | 2018-12-18 | 1 | -1/+11 |
| |/ / | |||||
* | | | Merge pull request #1909 from heapo/shadow_sampling_fixes | bunnei | 2018-12-19 | 1 | -16/+14 |
|\ \ \ | |/ / |/| | | Fix arrayed texture LOD selection and depth comparison ordering | ||||
| * | | Fix arrayed shadow sampler array slice/depth comparison ordering, as well as invalid GLSL LOD selection. | heapo | 2018-12-17 | 1 | -16/+14 |
| | | | |||||
* | | | shader_bytecode: Fixup half float's operator B encoding | ReinUsesLisp | 2018-12-18 | 1 | -1/+1 |
| | | | |||||
* | | | Implement postfactor multiplication/division for fmul instructions | heapo | 2018-12-17 | 2 | -5/+21 |
|/ / | |||||
* | | Merge pull request #1893 from lioncash/warn | bunnei | 2018-12-12 | 1 | -3/+3 |
|\ \ | | | | | | | gl_shader_cache: Resolve truncation compiler warning | ||||
| * | | gl_shader_cache: Dehardcode constant in CalculateProgramSize() | Lioncash | 2018-12-11 | 1 | -2/+2 |
| | | | | | | | | | | | | This constant is related to the size of the instruction. | ||||
| * | | gl_shader_cache: Resolve truncation compiler warning | Lioncash | 2018-12-11 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | The previous code would cause a warning, as it was truncating size_t (64-bit) to a u32 (32-bit) implicitly. | ||||
* | | | Merge pull request #1888 from marcosvitali/glFrontFacing | bunnei | 2018-12-11 | 1 | -1/+1 |
|\ \ \ | |/ / |/| | | gl_shader_decompiler: IPA fix FrontFacing. | ||||
| * | | gl_shader_decompiler: IPA FrontFacing: the right value when is the front face is 0xFFFFFFFF. | Marcos Vitali | 2018-12-10 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #1740 from FernandoS27/shader_props | bunnei | 2018-12-10 | 4 | -0/+57 |
|\ \ | | | | | | | Implemented Shader Unique Identifiers | ||||
| * | | Implemented a shader unique identifier. | Fernando Sahmkow | 2018-12-09 | 4 | -0/+57 |
| |/ | |||||
* / | gl_shader_decompiler: TLDS/TLD4/TLD4S Reworked reflecting the source registers, bugs fixed and modularize. | Marcos Vitali | 2018-12-07 | 1 | -106/+134 |
|/ | |||||
* | Merge pull request #1824 from ReinUsesLisp/fbcache | bunnei | 2018-12-06 | 2 | -40/+82 |
|\ | | | | | gl_rasterizer: Implement a framebuffer cache | ||||
| * | gl_rasterizer: Implement a framebuffer cache | ReinUsesLisp | 2018-11-29 | 2 | -40/+82 |
| | | |||||
* | | gl_shader_decompiler: Implement TEXS.F16 | ReinUsesLisp | 2018-12-05 | 2 | -13/+51 |
| | | |||||
* | | gl_shader_decompiler: Fixup inverted if | ReinUsesLisp | 2018-12-05 | 1 | -6/+5 |
| | | |||||
* | | Improve msvc codegen for hot-path array LUTs | heapo | 2018-12-05 | 1 | -275/+277 |
| | | | | | | | | | | | | | | In some constexpr functions, msvc is building the LUT at runtime (pushing each element onto the stack) out of an abundance of caution. Moving the arrays into be file-scoped constexpr's avoids this and turns the functions into simple look-ups as intended. | ||||
* | | Rewrited TEX/TEXS (TEX Scalar). (#1826) | Marcos | 2018-12-04 | 1 | -259/+177 |
| | | | | | | | | | | | | | | | | * Rewrited TEX/TEXS (TEX Scalar). * Style fixes. * Styles issues. | ||||
* | | Merge pull request #1854 from Subv/old_command_processor | bunnei | 2018-12-04 | 2 | -142/+6 |
|\ \ | | | | | | | Don't try to route PFIFO methods (0-0x40) to the other engines. | ||||
| * | | Removed unused file. | Subv | 2018-12-04 | 1 | -142/+0 |
| | | | | | | | | | | | | This is a leftover from #1792 | ||||
| * | | GPU: Don't try to route PFIFO methods (0-0x40) to the other engines. | Subv | 2018-12-04 | 1 | -0/+6 |
| | | | |||||
* | | | Merge pull request #1822 from ReinUsesLisp/glsl-scope | bunnei | 2018-12-03 | 1 | -250/+213 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Introduce a scoped object and style changes | ||||
| * | | | gl_shader_decompiler: Remove texture temporal in TLD4 | ReinUsesLisp | 2018-11-29 | 1 | -3/+1 |
| | | | | |||||
| * | | | gl_shader_decompiler: Flip negated if else statement | ReinUsesLisp | 2018-11-29 | 1 | -3/+3 |
| | | | | |||||
| * | | | gl_shader_decompiler: Use GLSL scope on instructions unrelated to textures | ReinUsesLisp | 2018-11-29 | 1 | -35/+10 |
| | | | | |||||
| * | | | gl_shader_decompiler: Move texture code generation into lambdas | ReinUsesLisp | 2018-11-29 | 1 | -97/+78 |
| | | | | |||||
| * | | | gl_shader_decompiler: Clean up texture instructions | ReinUsesLisp | 2018-11-29 | 1 | -87/+56 |
| | | | | |||||
| * | | | gl_shader_decompiler: Scope GLSL variables with a scoped object | ReinUsesLisp | 2018-11-29 | 1 | -32/+72 |
| | |/ | |/| | |||||
* | | | Merge pull request #1827 from ReinUsesLisp/clip-and-shader | bunnei | 2018-12-02 | 5 | -13/+37 |
|\ \ \ | | | | | | | | | gl_rasterizer: Enable clip distances when set in register and in shader | ||||
| * | | | gl_rasterizer: Enable clip distances when set in register and in shader | ReinUsesLisp | 2018-11-29 | 5 | -13/+37 |
| |/ / | |||||
* | | | Merge pull request #1825 from ReinUsesLisp/shader-pipeline-cache | bunnei | 2018-12-02 | 1 | -4/+17 |
|\ \ \ | | | | | | | | | gl_shader_manager: Update pipeline when programs have changed | ||||
| * | | | gl_shader_manager: Update pipeline when programs have changed | ReinUsesLisp | 2018-11-29 | 1 | -4/+17 |
| |/ / | |||||
* | | | Merge pull request #1795 from ReinUsesLisp/vc-cleanup | bunnei | 2018-12-02 | 3 | -32/+3 |
|\ \ \ | | | | | | | | | video_core: Minor style changes | ||||
| * | | | gl_rasterizer: Signal UNIMPLEMENTED when rt_separate_frag_data is not zero | ReinUsesLisp | 2018-11-29 | 1 | -1/+1 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Use brackets for two-line single-expresion blocks | ReinUsesLisp | 2018-11-29 | 1 | -1/+2 |
| | | | | |||||
| * | | | gl_rasterizer: Remove unused struct declarations | ReinUsesLisp | 2018-11-29 | 1 | -14/+0 |
| | | | | |||||
| * | | | gl_rasterizer: Remove extension booleans | ReinUsesLisp | 2018-11-29 | 2 | -16/+0 |
| |/ / | |||||
* | | | Merge pull request #1823 from bunnei/fix-surface-copy | bunnei | 2018-12-02 | 1 | -145/+5 |
|\ \ \ | |_|/ |/| | | gl_rasterizer_cache: Fix several surface copy issues. | ||||
| * | | gl_rasterizer_cache: Update AccurateCopySurface to flush complete source surface. | bunnei | 2018-11-30 | 1 | -1/+4 |
| | | | | | | | | | | | | - Fixes issues with Breath of the Wild with use_accurate_gpu_emulation setting. | ||||
| * | | gl_rasterizer_cache: Remove BlitSurface and replace with more accurate copy. | bunnei | 2018-11-29 | 1 | -144/+1 |
| |/ | | | | | | | | | | | - BlitSurface with different texture targets is inherently broken. - When target is the same, we can just use FastCopySurface. - Fixes rendering issues with Breath of the Wild. | ||||
* / | Fix debug build | Lioncash | 2018-12-01 | 1 | -4/+2 |
|/ | | | | | A non-existent parameter was left in some formatting calls (the logging macro for which only does anything meaningful on debug builds) | ||||
* | Merge pull request #1808 from Tinob/master | bunnei | 2018-11-28 | 3 | -15/+31 |
|\ | | | | | Fix clip distance and viewport | ||||
| * | remove viewport_transform_enabled as it seems to be inactive when valid transforms are used. | Rodolfo Bogado | 2018-11-27 | 1 | -12/+5 |
| | | |||||
| * | Add support for Clip Distance enabled register | Rodolfo Bogado | 2018-11-27 | 3 | -3/+26 |
| | | |||||
* | | Merge pull request #1786 from Tinob/DepthClamp | bunnei | 2018-11-28 | 5 | -10/+58 |
|\ \ | | | | | | | Add Depth Clamp Support | ||||
| * | | Implement depth clamp | Rodolfo Bogado | 2018-11-27 | 5 | -10/+58 |
| |/ | |||||
* | | Merge pull request #1792 from bunnei/dma-pusher | bunnei | 2018-11-28 | 17 | -105/+355 |
|\ \ | | | | | | | gpu: Rewrite GPU command list processing with DmaPusher class. | ||||
| * | | dma_pushbuffer: Optimize to avoid loop and copy on Push. | bunnei | 2018-11-28 | 2 | -5/+17 |
| | | | |||||
| * | | gpu: Move command list profiling to DmaPusher::DispatchCalls. | bunnei | 2018-11-28 | 2 | -5/+5 |
| | | | |||||
| * | | gpu: Rewrite GPU command list processing with DmaPusher class. | bunnei | 2018-11-27 | 17 | -105/+343 |
| |/ | | | | | | | - More accurate impl., fixes Undertale (among other games). | ||||
* | | Merge pull request #1735 from FernandoS27/tex-spacing | bunnei | 2018-11-28 | 8 | -36/+55 |
|\ \ | | | | | | | Texture decoder: Implemented Tile Width Spacing | ||||
| * | | Implemented Tile Width Spacing | FernandoS27 | 2018-11-26 | 8 | -36/+55 |
| | | | |||||
* | | | gl_shader_decompiler: Fixup clip distance index | ReinUsesLisp | 2018-11-27 | 1 | -1/+1 |
| | | | |||||
* | | | gl_rasterizer: Fixup for #1723. | Markus Wick | 2018-11-27 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | On invalidating the streaming buffer, we need to reupload all vertex buffers. But we don't need to reconfigure the vertex format. This was a (silly) misstake in #1723. Thanks at Rodrigo for discovering the issue. Fun fact, as configuring the vertex format also invalidate the vertex buffer, this misstake had no affect on the behavior. | ||||
* | | | morton: Fixup compiler warning | ReinUsesLisp | 2018-11-27 | 1 | -1/+2 |
| |/ |/| | |||||
* | | Merge pull request #1794 from Tinob/master | bunnei | 2018-11-27 | 2 | -8/+32 |
|\ \ | | | | | | | Add support for viewport_transfom_enable register | ||||
| * | | Limit the amount of viewports tested for state changes only to the usable ones | Rodolfo Bogado | 2018-11-25 | 1 | -2/+10 |
| | | | |||||
| * | | Add support for viewport_transfom_enable register | Rodolfo Bogado | 2018-11-24 | 2 | -6/+22 |
| | | | |||||
* | | | Merge pull request #1723 from degasus/dirty_flags | bunnei | 2018-11-27 | 9 | -6/+60 |
|\ \ \ | | | | | | | | | gl_rasterizer: Skip VB upload if the state is clean. | ||||
| * | | | gl_rasterizer: Skip VB upload if the state is clean. | Markus Wick | 2018-11-17 | 9 | -6/+60 |
| | | | | |||||
* | | | | GPU States: Implement Polygon Offset. This is used in SMO all the time. (#1784) | Marcos | 2018-11-27 | 5 | -5/+107 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * GPU States: Implement Polygon Offset. This is used in SMO all the time. * Clang Format fixes. * Initialize polygon_offset in the constructor. | ||||
* | | | | Merge pull request #1713 from FernandoS27/bra-cc | bunnei | 2018-11-27 | 1 | -4/+14 |
|\ \ \ \ | | | | | | | | | | | Implemented BRA CC conditional and FSET CC Setting | ||||
| * | | | | Implemented BRA CC conditional and FSET CC Setting | FernandoS27 | 2018-11-24 | 1 | -4/+14 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #1798 from ReinUsesLisp/y-direction | bunnei | 2018-11-27 | 5 | -16/+26 |
|\ \ \ \ | |_|_|/ |/| | | | gl_shader_decompiler: Implement S2R's Y_DIRECTION | ||||
| * | | | gl_shader_decompiler: Implement S2R's Y_DIRECTION | ReinUsesLisp | 2018-11-25 | 5 | -16/+26 |
| |/ / | |||||
* | | | Merge pull request #1763 from ReinUsesLisp/bfi | bunnei | 2018-11-26 | 2 | -0/+23 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Implement BFI_IMM_R | ||||
| * | | | gl_shader_decompiler: Implement BFI_IMM_R | ReinUsesLisp | 2018-11-21 | 2 | -0/+23 |
| | | | | |||||
* | | | | Merge pull request #1760 from ReinUsesLisp/r2p | bunnei | 2018-11-26 | 2 | -0/+42 |
|\ \ \ \ | | | | | | | | | | | gl_shader_decompiler: Implement R2P_IMM | ||||
| * | | | | gl_shader_decompiler: Implement R2P_IMM | ReinUsesLisp | 2018-11-21 | 2 | -0/+42 |
| |/ / / | |||||
* | | | | Merge pull request #1782 from FernandoS27/dc | bunnei | 2018-11-26 | 1 | -116/+188 |
|\ \ \ \ | | | | | | | | | | | Fixed Coordinate Encodings in TEX and TEXS instructions | ||||
| * | | | | Fix Texture Overlapping | FernandoS27 | 2018-11-24 | 1 | -43/+70 |
| | | | | | |||||
| * | | | | Fix TEXS Instruction encodings | FernandoS27 | 2018-11-24 | 1 | -22/+48 |
| | | | | | |||||
| * | | | | Fix one encoding in TEX Instruction | FernandoS27 | 2018-11-24 | 1 | -3/+3 |
| | | | | | |||||
| * | | | | Corrected inputs indexing in TEX instruction | FernandoS27 | 2018-11-24 | 1 | -66/+85 |
| | | | | | |||||
* | | | | | Merge pull request #1783 from ReinUsesLisp/clip-distances | bunnei | 2018-11-26 | 3 | -21/+58 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_shader_decompiler: Implement clip distances | ||||
| * | | | | | gl_shader_decompiler: Implement clip distances | ReinUsesLisp | 2018-11-23 | 3 | -21/+58 |
| | | | | | | |||||
* | | | | | | Merge pull request #1796 from ReinUsesLisp/morton-move | bunnei | 2018-11-26 | 6 | -345/+391 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | video_core: Move morton functions out of gl_rasterizer_cache | ||||
| * | | | | | | morton: Style changes | ReinUsesLisp | 2018-11-25 | 1 | -12/+12 |
| | | | | | | | |||||
| * | | | | | | video_core: Move morton functions to their own file | ReinUsesLisp | 2018-11-25 | 6 | -345/+391 |
| | |_|_|/ / | |/| | | | | |||||
* | | | | | | Merge pull request #1787 from bunnei/fix-gpu-mm | bunnei | 2018-11-25 | 2 | -1/+9 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | memory_manager: Do not allow 0 to be a valid GPUVAddr. | ||||
| * | | | | | memory_manager: Do not allow 0 to be a valid GPUVAddr. | bunnei | 2018-11-23 | 2 | -1/+9 |
| | | | | | | | | | | | | | | | | | | | | | | | | - Fixes a bug with Undertale using 0 for a render target. | ||||
* | | | | | | Merge pull request #1725 from FernandoS27/gl43 | bunnei | 2018-11-24 | 5 | -20/+13 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Update OpenGL's backend version from 3.3 to 4.3 | ||||
| * | | | | | | Removed pre 4.3 ARB extensions | FernandoS27 | 2018-11-21 | 5 | -20/+13 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #1785 from Tinob/master | bunnei | 2018-11-24 | 5 | -28/+95 |
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | Add support for clear_flags register | ||||
| * | | | | | | | Add support for clear_flags register | Rodolfo Bogado | 2018-11-24 | 5 | -28/+95 |
| | |_|_|/ / / | |/| | | | | | |||||
* | | | | | | | Merge pull request #1769 from ReinUsesLisp/cc | bunnei | 2018-11-24 | 2 | -70/+81 |
|\ \ \ \ \ \ \ | |/ / / / / / |/| | | | | | | gl_shader_decompiler: Rename cc to condition code and name internal flags | ||||
| * | | | | | | gl_shader_decompiler: Add a message for unimplemented cc generation | ReinUsesLisp | 2018-11-22 | 1 | -23/+46 |
| | | | | | | | |||||
| * | | | | | | gl_shader_decompiler: Rename internal flag strings | ReinUsesLisp | 2018-11-22 | 1 | -15/+20 |
| | | | | | | | |||||
| * | | | | | | gl_shader_decompiler: Rename control codes to condition codes | ReinUsesLisp | 2018-11-22 | 2 | -67/+50 |
| | | | | | | | |||||
* | | | | | | | Merge pull request #1744 from degasus/shader_cache | bunnei | 2018-11-24 | 4 | -8/+24 |
|\ \ \ \ \ \ \ | |_|_|/ / / / |/| | | | | | | shader_cache: Only lock covered instructions. | ||||
| * | | | | | | shader_cache: Only lock covered instructions. | Markus Wick | 2018-11-20 | 4 | -8/+24 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | Added predicate comparison LessEqualWithNan (#1736) | Hexagon12 | 2018-11-23 | 2 | -5/+13 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Added predicate comparison LessEqualWithNan * oops * Clang fix | ||||
* | | | | | | Merge pull request #1756 from ReinUsesLisp/fix-textures | bunnei | 2018-11-23 | 1 | -60/+78 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_shader_decompiler: Fix register overwriting on texture calls | ||||
| * | | | | | | gl_shader_decompiler: Fix register overwriting on texture calls | ReinUsesLisp | 2018-11-22 | 1 | -60/+78 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | Merge pull request #1766 from FernandoS27/fix-txq | bunnei | 2018-11-23 | 1 | -2/+12 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | Properly Implemented TXQ Instruction | ||||
| * | | | | | Properly Implemented TXQ Instruction | FernandoS27 | 2018-11-21 | 1 | -2/+12 |
| | |/ / / | |/| | | | |||||
* | | | | | Merge pull request #1775 from bunnei/blend-eq | bunnei | 2018-11-22 | 2 | -0/+12 |
|\ \ \ \ \ | | | | | | | | | | | | | maxwell_3d: Implement alternate blend equations. | ||||
| * | | | | | maxwell_3d: Implement alternate blend equations. | bunnei | 2018-11-22 | 2 | -0/+12 |
| | | | | | | | | | | | | | | | | | | | | | | | | - Used by Undertale. | ||||
* | | | | | | Merge pull request #1764 from bunnei/macrointerpreter | bunnei | 2018-11-22 | 2 | -8/+25 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | macro_interpreter: Implement AddWithCarry and SubtractWithBorrow. | ||||
| * | | | | | | macro_interpreter: Implement AddWithCarry and SubtractWithBorrow. | bunnei | 2018-11-22 | 2 | -8/+25 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | - Used by Undertale. | ||||
* | | | | | | Merge pull request #1737 from FernandoS27/layer-copy | bunnei | 2018-11-22 | 2 | -2/+30 |
|\ \ \ \ \ \ | |_|_|/ / / |/| | | | | | Implemented Fast Layered Copy | ||||
| * | | | | | Implemented Fast Layered Copy | FernandoS27 | 2018-11-20 | 2 | -2/+30 |
| | |/ / / | |/| | | | |||||
* | | | | | Merge pull request #1753 from FernandoS27/ufbtype | bunnei | 2018-11-21 | 2 | -0/+8 |
|\ \ \ \ \ | | | | | | | | | | | | | Use default values for unknown framebuffer pixel format | ||||
| * | | | | | Use default values for unknown framebuffer pixel format | FernandoS27 | 2018-11-21 | 2 | -0/+8 |
| |/ / / / | |||||
* | | | | | Merge pull request #1752 from ReinUsesLisp/unimpl-decompiler | bunnei | 2018-11-21 | 1 | -371/+258 |
|\ \ \ \ \ | |_|/ / / |/| | | | | gl_shader_decompiler: Use UNIMPLEMENTED when applicable | ||||
| * | | | | gl_shader_decompiler: Use UNIMPLEMENTED instead of LOG+UNREACHABLE when applicable | ReinUsesLisp | 2018-11-21 | 1 | -371/+258 |
| |/ / / | |||||
* | | | | Merge pull request #1754 from ReinUsesLisp/zero-register | bunnei | 2018-11-21 | 1 | -2/+1 |
|\ \ \ \ | |_|/ / |/| | | | gl_shader_decompiler: Remove UNREACHABLE when setting RZ | ||||
| * | | | gl_shader_decompiler: Remove UNREACHABLE when setting RZ | ReinUsesLisp | 2018-11-21 | 1 | -2/+1 |
| |/ / | |||||
* / / | maxwell_3d: Initialize rasterizer color mask registers as enabled. | bunnei | 2018-11-21 | 1 | -0/+9 |
|/ / | | | | | | | - Fixes rendering regression with Sonic Mania. | ||||
* | | Merge pull request #1717 from FreddyFunk/swizzle-gob | bunnei | 2018-11-19 | 1 | -37/+33 |
|\ \ | | | | | | | textures/decoders: Replace magic numbers | ||||
| * | | textures/decoders: Replace magic numbers | Frederic Laing | 2018-11-17 | 1 | -37/+33 |
| |/ | |||||
* | | Merge pull request #1693 from Tinob/master | bunnei | 2018-11-19 | 9 | -211/+315 |
|\ \ | | | | | | | Missing ogl states | ||||
| * | | drop support for non separate alpha as it seems to cause issues in some games | Rodolfo Bogado | 2018-11-18 | 3 | -61/+35 |
| | | | |||||
| * | | fix sampler configuration, thanks to Marcos for his investigation | Rodolfo Bogado | 2018-11-17 | 3 | -19/+57 |
| | | | |||||
| * | | small type fix | Rodolfo Bogado | 2018-11-17 | 1 | -6/+6 |
| | | | |||||
| * | | small fix for alphaToOne bit location | Rodolfo Bogado | 2018-11-17 | 1 | -2/+2 |
| | | | |||||
| * | | fix for gcc compilation | Rodolfo Bogado | 2018-11-17 | 1 | -60/+61 |
| | | | |||||
| * | | add AlphaToCoverage and AlphaToOne | Rodolfo Bogado | 2018-11-17 | 5 | -1/+39 |
| | | | |||||
| * | | add support for fragment_color_clamp | Rodolfo Bogado | 2018-11-17 | 5 | -1/+24 |
| | | | |||||
| * | | add missing MirrorOnceBorder support where supported | Rodolfo Bogado | 2018-11-17 | 1 | -0/+6 |
| | | | |||||
| * | | set border color not depending on the wrap mode | Rodolfo Bogado | 2018-11-17 | 1 | -9/+9 |
| | | | | | | | | | | | | only enable color mask for the first framebuffer id independent blending is disabled | ||||
| * | | set default value for point size register | Rodolfo Bogado | 2018-11-17 | 2 | -5/+4 |
| | | | |||||
| * | | fix viewport and scissor behavior | Rodolfo Bogado | 2018-11-17 | 6 | -64/+89 |
| |/ | |||||
* | | Eliminated unnessessary memory allocation and copy (#1702) | Frederic L | 2018-11-19 | 3 | -9/+20 |
| | | |||||
* | | gl_rasterizer: Remove default clip distance | ReinUsesLisp | 2018-11-19 | 1 | -2/+0 |
|/ | |||||
* | Merge pull request #1700 from FreddyFunk/cleanup | bunnei | 2018-11-16 | 1 | -3/+3 |
|\ | | | | | gl_rasterizer_chache: Minor cleanup | ||||
| * | gl_rasterizer_chache: Minor cleanup | Frederic Laing | 2018-11-15 | 1 | -3/+3 |
| | | |||||
* | | Merge pull request #1701 from FreddyFunk/decoders | bunnei | 2018-11-16 | 1 | -16/+16 |
|\ \ | | | | | | | textures/decoders: Minor cleanup | ||||
| * | | textures/decoders: Minor cleanup | Frederic Laing | 2018-11-15 | 1 | -16/+16 |
| |/ | |||||
* | | Merge pull request #1676 from lioncash/warn | bunnei | 2018-11-16 | 2 | -3/+4 |
|\ \ | |/ |/| | gl_state: Amend compilation warnings | ||||
| * | gl_state: Amend compilation warnings | Lioncash | 2018-11-13 | 2 | -3/+4 |
| | | | | | | | | | | Makes float -> integral conversions explicit via casts and also silences a sign conversion warning. | ||||
* | | Merge pull request #1637 from FernandoS27/cache | bunnei | 2018-11-15 | 1 | -18/+17 |
|\ \ | | | | | | | Improved GPU Caches lookup Speed | ||||
| * | | Improved GPU Caches lookup Speed | FernandoS27 | 2018-11-11 | 1 | -18/+17 |
| | | | |||||
* | | | Merge pull request #1662 from FreddyFunk/CopySurface-Optimization | bunnei | 2018-11-14 | 1 | -10/+7 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: CopySurface optimization | ||||
| * | | | gl_rasterizer_cache: Remove unnecessary memory allocation and copy in CopySurface | Frederic Laing | 2018-11-08 | 1 | -10/+7 |
| | | | | |||||
* | | | | Merge pull request #1685 from lioncash/base | bunnei | 2018-11-14 | 1 | -1/+0 |
|\ \ \ \ | | | | | | | | | | | video_core/renderer_base: Remove GL include from the renderer base class files | ||||
| * | | | | video_core/renderer_base: Remove GL include from the renderer base class files | Lioncash | 2018-11-13 | 1 | -1/+0 |
| | |_|/ | |/| | | | | | | | | | | Keeps the base class source files implementation-agnostic. | ||||
* / | | | gl_rasterizer: Minor cleanup | Frederic L | 2018-11-13 | 1 | -4/+2 |
|/ / / | | | | | | | Minor code cleanup from unaddressed feedback in #1654 | ||||
* | | | Merge pull request #1628 from greggameplayer/Texture2DArray | bunnei | 2018-11-13 | 1 | -0/+1 |
|\ \ \ | | | | | | | | | Implement SurfaceTarget Texture2DArray | ||||
| * \ \ | Merge branch 'master' into Texture2DArray | greggameplayer | 2018-11-06 | 12 | -186/+338 |
| |\ \ \ | |||||
| * | | | | correct syntax | greggameplayer | 2018-11-02 | 1 | -4/+3 |
| | | | | | |||||
| * | | | | Merge branch 'master' into Texture2DArray | greggameplayer | 2018-11-02 | 14 | -1031/+1296 |
| |\ \ \ \ | |||||
| * | | | | | Implement SurfaceTarget Texture2DArray | greggameplayer | 2018-10-31 | 1 | -0/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | ( needed by Mario+Rabbids Kingdom Battle ) | ||||
* | | | | | | Implement ASTC_2D_10X8 & ASTC_2D_10X8_SRGB (#1666) | greggameplayer | 2018-11-13 | 4 | -71/+101 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Implement ASTC_2D_10X8 & ASTC_2D_10X8_SRGB ( needed by Mario+Rabbids Kingdom Battle ) * Small placement correction | ||||
* | | | | | | Merge pull request #1660 from Tinob/master | bunnei | 2018-11-12 | 9 | -88/+138 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Map more missing opengl states | ||||
| * | | | | | | Use core extensions when available to set max anisotropic filtering level | Rodolfo Bogado | 2018-11-11 | 1 | -2/+7 |
| | | | | | | | |||||
| * | | | | | | Improve state management by splitting some of the states id separated function to avoid a full apply overhead | Rodolfo Bogado | 2018-11-11 | 6 | -39/+40 |
| | | | | | | | |||||
| * | | | | | | Try to fix problems with stencil test in some games, relax translation to opengl enums to avoid crashing and only generate logs of the errors. | Rodolfo Bogado | 2018-11-11 | 4 | -37/+61 |
| | | | | | | | |||||
| * | | | | | | set sampler max lod, min lod, lod bias and max anisotropy | Rodolfo Bogado | 2018-11-11 | 3 | -13/+33 |
| | |_|_|_|/ | |/| | | | | |||||
* | | | | | | Merge pull request #1664 from FreddyFunk/cast2 | bunnei | 2018-11-11 | 1 | -2/+2 |
|\ \ \ \ \ \ | |/ / / / / |/| | | | | | gl_rasterizer: Fix compiler warnings | ||||
| * | | | | | gl_rasterizer: Fix compiler warnings | Frederic Laing | 2018-11-08 | 1 | -2/+2 |
| | |_|_|/ | |/| | | | |||||
* | | | | | Merge pull request #1669 from ReinUsesLisp/fixup-gs | bunnei | 2018-11-11 | 4 | -15/+24 |
|\ \ \ \ \ | | | | | | | | | | | | | gl_shader_decompiler: Guard out of bound geometry shader input reads | ||||
| * | | | | | gl_shader_decompiler: Guard out of bound geometry shader input reads | ReinUsesLisp | 2018-11-10 | 4 | -15/+24 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Geometry shaders follow a pattern that results in out of bound reads. This pattern is: - VSETP to predicate - Use that predicate to conditionally set a register a big number - Use the register to access geometry shaders At the time of writing this commit I don't know what's the intent of this number. Some drivers argue about these out of bound reads. To avoid this issue, input reads are guarded limiting reads to the highest posible vertex input of the current topology (e.g. points to 1 and triangles to 3). | ||||
* | | | | | | Merge pull request #1663 from lioncash/raster | bunnei | 2018-11-11 | 9 | -10/+25 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | rasterizer_cache: Remove reliance on the System singleton | ||||
| * | | | | | | rasterizer_cache: Remove reliance on the System singleton | Lioncash | 2018-11-08 | 9 | -10/+25 |
| |/ / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Rather than have a transparent dependency, we can make it explicit in the interface. This also gets rid of the need to put the core include in a header. | ||||
* | | | | | | Merge pull request #1648 from FernandoS27/texs-3-array | bunnei | 2018-11-11 | 1 | -7/+11 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | Implement 3 coordinate array in TEXS instruction | ||||
| * | | | | | | Correct issue where texturelod could not be applied to 2darrayshadow | FernandoS27 | 2018-11-08 | 1 | -1/+5 |
| | | | | | | | |||||
| * | | | | | | Implement 3 coordinate array in TEXS instruction | FernandoS27 | 2018-11-07 | 1 | -6/+6 |
| | |/ / / / | |/| | | | | |||||
* | | | | | | Merge pull request #1654 from degasus/dirty_flags | bunnei | 2018-11-11 | 4 | -7/+37 |
|\ \ \ \ \ \ | | | | | | | | | | | | | | | gl_rasterizer: Skip VAO binding if the state is clean. | ||||
| * | | | | | | gl_rasterizer: Skip VAO binding if the state is clean. | Markus Wick | 2018-11-06 | 3 | -2/+21 |
| | | | | | | | |||||
| * | | | | | | gl_rasterizer: Split VAO and VB setup functions. | Markus Wick | 2018-11-06 | 2 | -5/+16 |
| | |_|_|_|/ | |/| | | | | |||||
* | | | | | | rasterizer_cache: Add missing virtual destructor to RasterizerCacheObject | Lioncash | 2018-11-08 | 3 | -0/+10 |
| |_|/ / / |/| | | | | | | | | | | | | | | Ensures that destruction will always do the right thing in any context. | ||||
* | | | | | gl_resource_manager: Amend clang-format discrepancies | Lioncash | 2018-11-08 | 1 | -4/+2 |
| |/ / / |/| | | | | | | | | | | | Fixes the buildbot. | ||||
* | | | | Merge pull request #1630 from bunnei/fix-mapbufferex | bunnei | 2018-11-07 | 2 | -31/+52 |
|\ \ \ \ | | | | | | | | | | | memory_manager: Do not MapBufferEx over already in use memory. | ||||
| * | | | | memory_manager: Do not MapBufferEx over already in use memory. | bunnei | 2018-11-01 | 2 | -31/+52 |
| | |_|/ | |/| | | | | | | | | | | - This fixes rendering when changing areas in Super Mario Odyssey. | ||||
* | | | | Merge pull request #1635 from Tinob/master | bunnei | 2018-11-07 | 6 | -153/+338 |
|\ \ \ \ | | | | | | | | | | | Implement multi-target viewports and blending | ||||
| * | | | | Add support to color mask to avoid issues in blending caused by wrong values in the alpha channel in some render targets. | Rodolfo Bogado | 2018-11-05 | 5 | -25/+79 |
| | | | | | |||||
| * | | | | Implement multi-target viewports and blending | Rodolfo Bogado | 2018-11-05 | 6 | -128/+259 |
| | | | | | |||||
* | | | | | gl_rasterizer_cache: Add profiles for Copy and Blit. | Markus Wick | 2018-11-06 | 1 | -2/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | They were missed, and Copy is very high in profile here. It doesn't block the GPU, but it stalls the driver thread. So with our bad GL instructions, this might block quite a while. | ||||
* | | | | | gl_resource_manager: Profile creation and deletion. | Markus Wick | 2018-11-06 | 1 | -0/+42 |
| | | | | | |||||
* | | | | | gl_stream_buffer: Profile orphaning of stream buffer. | Markus Wick | 2018-11-06 | 1 | -0/+5 |
| |_|/ / |/| | | | | | | | | | | | | | | | This serialize to the driver thread and so it may block for a while. So if it is in the benchmark, we get noticed if it happens too often. | ||||
* | | | | gl_resource_manager: Split implementations in .cpp file. | Markus Wick | 2018-11-06 | 5 | -114/+167 |
| | | | | | | | | | | | | | | | | | | | | Those implementations are quite costly, so there is no need to inline them to the caller. Ressource deletion is often a performance bug, so in this way, we support to add breakpoints to them. | ||||
* | | | | Merge pull request #1616 from FernandoS27/cube-array | bunnei | 2018-11-05 | 4 | -0/+20 |
|\ \ \ \ | |/ / / |/| | | | Implement Cube Arrays | ||||
| * | | | Implement Cube Arrays | FernandoS27 | 2018-11-01 | 4 | -0/+20 |
| | | | | |||||
* | | | | Merge pull request #1625 from FernandoS27/astc | bunnei | 2018-11-05 | 8 | -72/+151 |
|\ \ \ \ | |_|/ / |/| | | | Implement ASTC Textures 5x5 and fix a bunch of ASTC texture problems | ||||
| * | | | Fix ASTC Decompressor to support depth parameter | FernandoS27 | 2018-11-02 | 6 | -62/+128 |
| | | | | |||||
| * | | | Fix ASTC formats | FernandoS27 | 2018-11-01 | 3 | -11/+20 |
| | | | | |||||
| * | | | Implemented ASTC 5x5 | FernandoS27 | 2018-11-01 | 1 | -1/+5 |
| | | | | |||||
* | | | | Merge pull request #1623 from Tinob/master | bunnei | 2018-11-01 | 3 | -105/+158 |
|\ \ \ \ | |/ / / |/| | | | Improve OpenGL state handling | ||||
| * | | | Improve OpenGL state handling | Rodolfo Bogado | 2018-10-31 | 3 | -105/+158 |
| | |/ | |/| | |||||
* | | | Merge pull request #1527 from FernandoS27/assert-flow | bunnei | 2018-11-01 | 2 | -2/+27 |
|\ \ \ | | | | | | | | | Assert Control Flow Instructions using Control Codes | ||||
| * | | | Assert Control Flow Instructions using Control Codes | FernandoS27 | 2018-10-29 | 2 | -3/+28 |
| | | | | |||||
* | | | | maxwell_3d: Restructure macro upload to use a single macro code memory. | bunnei | 2018-11-01 | 4 | -27/+55 |
| | | | | | | | | | | | | | | | | | | | | - Fixes an issue where macros could be skipped. - Fixes rendering of distant objects in Super Mario Odyssey. | ||||
* | | | | Merge pull request #1528 from FernandoS27/assert-control-codes | bunnei | 2018-11-01 | 2 | -1/+103 |
|\ \ \ \ | |_|_|/ |/| | | | Assert Control Codes Generation on Shader Instructions | ||||
| * | | | Assert Control Codes Generation | FernandoS27 | 2018-10-30 | 2 | -1/+103 |
| | | | | |||||
* | | | | video_core: Move surface declarations out of gl_rasterizer_cache | ReinUsesLisp | 2018-10-30 | 6 | -898/+954 |
| |_|/ |/| | | |||||
* | | | global: Use std::optional instead of boost::optional (#1578) | Frederic L | 2018-10-30 | 17 | -97/+107 |
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build | ||||
* | | Merge pull request #1580 from FernandoS27/mm-impl | bunnei | 2018-10-30 | 6 | -109/+254 |
|\ \ | | | | | | | Implemented Mipmaps | ||||
| * | | Fixed black textures, pixelation and we no longer require to auto-generate mipmaps | FernandoS27 | 2018-10-29 | 1 | -14/+2 |
| | | | |||||
| * | | Fixed mipmap block autosizing algorithm | FernandoS27 | 2018-10-29 | 3 | -13/+25 |
| | | | |||||
| * | | Fixed Invalid Image size and Mipmap calculation | FernandoS27 | 2018-10-29 | 1 | -4/+7 |
| | | | |||||
| * | | Fixed Block Resizing algorithm and Clang Format | FernandoS27 | 2018-10-29 | 3 | -12/+19 |
| | | | |||||
| * | | Implement Mip Filter | FernandoS27 | 2018-10-29 | 4 | -10/+33 |
| | | | |||||
| * | | Zero out memory region of recreated surface before flushing | FernandoS27 | 2018-10-29 | 1 | -0/+2 |
| | | | |||||
| * | | Implement Mipmaps | FernandoS27 | 2018-10-28 | 2 | -101/+211 |
| |/ | |||||
* | | Merge pull request #1613 from ReinUsesLisp/gl-utils | bunnei | 2018-10-29 | 6 | -30/+61 |
|\ \ | | | | | | | video_core: Move OpenGL specific utils to its renderer | ||||
| * | | video_core: Move OpenGL specific utils to its renderer | ReinUsesLisp | 2018-10-29 | 6 | -30/+61 |
| |/ | |||||
* | | Merge pull request #1610 from slashiee/dxt1-alpha | bunnei | 2018-10-29 | 1 | -2/+2 |
|\ \ | | | | | | | renderer_opengl: Enable alpha channel for DXT1 texture format | ||||
| * | | Enable alpha channel for DXT1 texture format | Michael | 2018-10-28 | 1 | -2/+2 |
| |/ | |||||
* / | renderer_opengl: Correct bpp value for ASTC_2D_8X5_SRGB | Rodolfo Bogado | 2018-10-29 | 1 | -1/+1 |
|/ | |||||
* | Correct bpp value for ASTC_2D_8X5 | Tobias | 2018-10-28 | 1 | -1/+1 |
| | |||||
* | Merge pull request #1601 from FernandoS27/shader-precision | bunnei | 2018-10-28 | 1 | -20/+35 |
|\ | | | | | Improved Shader accuracy on Vertex and Geometry Shaders. | ||||
| * | Refactor precise usage and add FMNMX, MUFU, FMUL32 and FADD332 | FernandoS27 | 2018-10-28 | 2 | -74/+37 |
| | | |||||
| * | Improved Shader accuracy on Vertex and Geometry Shaders with FFMA, FMUL and FADD | FernandoS27 | 2018-10-28 | 2 | -6/+58 |
| | | |||||
* | | Implement sRGB Support, including workarounds for nvidia driver issues and QT sRGB support | Rodolfo Bogado | 2018-10-28 | 8 | -40/+197 |
| | | |||||
* | | Merge pull request #1594 from FreddyFunk/static-cast | bunnei | 2018-10-28 | 1 | -2/+2 |
|\ \ | |/ |/| | gl_rasterizer_cache: Fix compiler warning | ||||
| * | gl_rasterizer_cache: Fix compiler warning | Frederic Laing | 2018-10-27 | 1 | -2/+2 |
| | | |||||
* | | Merge pull request #1592 from bunnei/prim-restart | bunnei | 2018-10-27 | 5 | -1/+40 |
|\ \ | | | | | | | gl_rasterizer: Implement primitive restart. | ||||
| * | | gl_rasterizer: Implement primitive restart. | bunnei | 2018-10-26 | 5 | -1/+40 |
| |/ | |||||
* / | Implement Default Block Height for each format | FernandoS27 | 2018-10-27 | 1 | -0/+62 |
|/ | |||||
* | Merge pull request #1533 from FernandoS27/lmem | bunnei | 2018-10-26 | 3 | -1/+138 |
|\ | | | | | Implemented Shader Local Memory | ||||
| * | Implemented LD_L and ST_L | FernandoS27 | 2018-10-24 | 3 | -12/+112 |
| | | |||||
| * | Implement Shader Local Memory | FernandoS27 | 2018-10-24 | 1 | -0/+37 |
| | | |||||
* | | maxwell_3d: Add code for initializing register defaults. | bunnei | 2018-10-26 | 2 | -1/+21 |
| | | |||||
* | | gl_rasterizer: Implement depth range. | bunnei | 2018-10-26 | 4 | -13/+20 |
| | | |||||
* | | Merge pull request #1524 from FernandoS27/layers-fix | bunnei | 2018-10-25 | 3 | -72/+109 |
|\ \ | |/ |/| | rasterizer: Fix Layered Textures Loading and Cubemaps | ||||
| * | Fixed Layered Textures Loading and Cubemaps | FernandoS27 | 2018-10-23 | 3 | -72/+109 |
| | | |||||
* | | Merge pull request #1554 from FernandoS27/pointsize | bunnei | 2018-10-24 | 3 | -5/+28 |
|\ \ | | | | | | | Implement PointSize Output Attribute. | ||||
| * | | Implement PointSize | FernandoS27 | 2018-10-23 | 3 | -5/+28 |
| | | | |||||
* | | | decoders: Remove unused variable within SwizzledData() | Lioncash | 2018-10-24 | 1 | -1/+0 |
| | | | |||||
* | | | maxwell_3d: Remove unused variable within ProcessQueryGet() | Lioncash | 2018-10-24 | 1 | -1/+0 |
|/ / | |||||
* | | Merge pull request #1519 from ReinUsesLisp/vsetp | bunnei | 2018-10-23 | 2 | -75/+108 |
|\ \ | | | | | | | gl_shader_decompiler: Implement VSETP | ||||
| * | | gl_shader_decompiler: Implement VSETP | ReinUsesLisp | 2018-10-23 | 2 | -0/+26 |
| | | | |||||
| * | | gl_shader_decompiler: Abstract VMAD into a video subset | ReinUsesLisp | 2018-10-23 | 2 | -75/+82 |
| | | | |||||
* | | | Merge pull request #1539 from lioncash/dma | bunnei | 2018-10-23 | 3 | -19/+10 |
|\ \ \ | | | | | | | | | maxwell_dma: Silence compilation warnings | ||||
| * | | | engines/maxwell_*: Use nested namespace specifiers where applicable | Lioncash | 2018-10-20 | 3 | -12/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent. | ||||
| * | | | maxwell_dma: Make variables const where applicable within HandleCopy() | Lioncash | 2018-10-20 | 1 | -3/+3 |
| | | | | | | | | | | | | | | | | These are never modified, so we can make that assumption explicit. | ||||
| * | | | maxwell_dma: Make FlushAndInvalidate's size parameter a u64 | Lioncash | 2018-10-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | This prevents truncation warnings at the lambda's usage sites. | ||||
| * | | | maxwell_dma: Remove unused variables in HandleCopy() | Lioncash | 2018-10-20 | 1 | -3/+0 |
| | |/ | |/| | | | | | | | These pointer variables are never used, so we can get rid of them. | ||||
* | | | Merge pull request #1470 from FernandoS27/alpha_testing | bunnei | 2018-10-23 | 7 | -20/+87 |
|\ \ \ | | | | | | | | | Implemented Alpha Test using Shader Emulation | ||||
| * | | | Assert that multiple render targets are not set while alpha testing | FernandoS27 | 2018-10-22 | 3 | -3/+17 |
| | | | | |||||
| * | | | Use standard UBO and fix/stylize the code | FernandoS27 | 2018-10-22 | 8 | -91/+51 |
| | | | | |||||
| * | | | Cache uniform locations and restructure the implementation | FernandoS27 | 2018-10-22 | 3 | -33/+29 |
| | | | | |||||
| * | | | Remove SyncAlphaTest and clang format | FernandoS27 | 2018-10-22 | 4 | -8/+9 |
| | | | | |||||
| * | | | Added Alpha Func | FernandoS27 | 2018-10-22 | 2 | -3/+43 |
| | | | | |||||
| * | | | Implemented Alpha Testing | FernandoS27 | 2018-10-22 | 6 | -3/+59 |
| | | | | |||||
* | | | | Merge pull request #1512 from ReinUsesLisp/brk | bunnei | 2018-10-23 | 2 | -22/+43 |
|\ \ \ \ | |_|_|/ |/| | | | gl_shader_decompiler: Implement PBK and BRK | ||||
| * | | | gl_shader_decompiler: Implement PBK and BRK | ReinUsesLisp | 2018-10-18 | 2 | -22/+43 |
| | | | | |||||
* | | | | Merge pull request #1550 from FernandoS27/fmul32 | bunnei | 2018-10-23 | 2 | -3/+8 |
|\ \ \ \ | | | | | | | | | | | Added Saturation to FMUL32I | ||||
| * | | | | Added Saturation to FMUL32I | FernandoS27 | 2018-10-23 | 2 | -3/+8 |
| | |/ / | |/| | | |||||
* | | | | Merge pull request #1537 from lioncash/shader | bunnei | 2018-10-23 | 1 | -6/+7 |
|\ \ \ \ | |/ / / |/| | | | gl_shader_decompiler: Minor changes | ||||
| * | | | gl_shader_decompiler: Allow std::move to function in SetPredicate | Lioncash | 2018-10-20 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | If the variable being moved is const, then std::move will always perform a copy (since it can't actually move the data). | ||||
| * | | | gl_shader_decompiler: Get rid of variable shadowing warnings | Lioncash | 2018-10-20 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | A variable with the same name was previously declared in an outer scope. | ||||
| * | | | gl_shader_decompiler: Fix a few comment typos | Lioncash | 2018-10-20 | 1 | -3/+4 |
| | |/ | |/| | |||||
* | | | Merge pull request #1547 from FernandoS27/fix-fset | bunnei | 2018-10-22 | 2 | -30/+12 |
|\ \ \ | | | | | | | | | Fixed FSETP and FSET | ||||
| * | | | Fixed FSETP and FSET | FernandoS27 | 2018-10-22 | 2 | -30/+12 |
| |/ / | |||||
* / / | Fixed VAOs Float types only returning GL_FLOAT in cases that they had to return GL_HALF_FLOAT | FernandoS27 | 2018-10-22 | 1 | -2/+14 |
|/ / | |||||
* | | gl_shader_decompiler: Move position varying declaration back to gl_shader_gen | ReinUsesLisp | 2018-10-20 | 3 | -13/+9 |
| | | | | | | | | | | | | | | The intention of declaring them in gl_shader_decompiler was to be able to use blocks to implement geometry shaders. But that wasn't needed in the end and it caused issues when both vertex stages were being used, resulting in a redeclaration of "position". | ||||
* | | Merge pull request #1501 from ReinUsesLisp/half-float | bunnei | 2018-10-20 | 2 | -0/+458 |
|\ \ | | | | | | | gl_shader_decompiler: Implement H* instructions | ||||
| * | | gl_shader_decompiler: Implement HSET2_R | ReinUsesLisp | 2018-10-15 | 2 | -0/+62 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HSETP2_R | ReinUsesLisp | 2018-10-15 | 2 | -0/+65 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HFMA2 instructions | ReinUsesLisp | 2018-10-15 | 2 | -0/+85 |
| | | | |||||
| * | | gl_shader_decompiler: Implement HADD2_IMM and HMUL2_IMM | ReinUsesLisp | 2018-10-15 | 2 | -0/+73 |
| | | | |||||
| * | | gl_shader_decompiler: Implement non-immediate HADD2 and HMUL2 instructions | ReinUsesLisp | 2018-10-15 | 2 | -0/+75 |
| | | | |||||
| * | | gl_shader_decompiler: Setup base for half float unpacking and setting | ReinUsesLisp | 2018-10-15 | 2 | -0/+98 |
| | | | |||||
* | | | GPU: Improved implementation of maxwell DMA (Subv). | bunnei | 2018-10-19 | 3 | -17/+66 |
| | | | |||||
* | | | decoders: Introduce functions for un/swizzling subrects. | bunnei | 2018-10-19 | 2 | -0/+49 |
| | | | |||||
* | | | GPU: Invalidate destination address of kepler_memory writes. | bunnei | 2018-10-19 | 3 | -3/+17 |
| | | | |||||
* | | | fermi_2d: Add support for more accurate surface copies. | bunnei | 2018-10-19 | 2 | -3/+12 |
| | | | |||||
* | | | Merge pull request #1505 from FernandoS27/tex-3d | bunnei | 2018-10-18 | 4 | -2/+13 |
|\ \ \ | | | | | | | | | Implemented 3D Textures | ||||
| * | | | Clang format and other fixes | FernandoS27 | 2018-10-18 | 1 | -16/+0 |
| | | | | |||||
| * | | | Implement Reinterpret Surface, to accurately blit 3D textures | FernandoS27 | 2018-10-18 | 1 | -2/+4 |
| | | | | |||||
| * | | | Implement GetInRange in the Rasterizer Cache | FernandoS27 | 2018-10-18 | 1 | -0/+16 |
| | | | | |||||
| * | | | Implement 3D Textures | FernandoS27 | 2018-10-18 | 4 | -1/+10 |
| | | | | |||||
* | | | | Merge pull request #1489 from FernandoS27/fix-tlds | bunnei | 2018-10-18 | 1 | -1/+5 |
|\ \ \ \ | |/ / / |/| | | | shader_decompiler: Fix TLDS | ||||
| * | | | Fix TLDS | FernandoS27 | 2018-10-14 | 1 | -1/+5 |
| |/ / | |||||
* | | | Merge pull request #1497 from bunnei/flush-framebuffers | bunnei | 2018-10-18 | 8 | -174/+415 |
|\ \ \ | | | | | | | | | Implement flushing in the rasterizer cache | ||||
| * | | | gl_rasterizer_cache: Remove unnecessary block_depth=1 on Flush. | bunnei | 2018-10-18 | 1 | -1/+0 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Remove unnecessary temporary buffer with unswizzle. | bunnei | 2018-10-18 | 1 | -5/+2 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Use AccurateCopySurface for use_accurate_gpu_emulation. | bunnei | 2018-10-16 | 2 | -2/+18 |
| | | | | |||||
| * | | | config: Rename use_accurate_framebuffers -> use_accurate_gpu_emulation. | bunnei | 2018-10-16 | 3 | -6/+6 |
| | | | | | | | | | | | | | | | | - This will be used as a catch-all for slow-but-accurate GPU emulation paths. | ||||
| * | | | rasterizer_cache: Refactor to support in-order flushing. | bunnei | 2018-10-16 | 6 | -63/+116 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Refactor to only call GetRegionEnd on surface creation. | bunnei | 2018-10-16 | 2 | -16/+23 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Only flush when use_accurate_framebuffers is enabled. | bunnei | 2018-10-16 | 2 | -2/+13 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Separate guest and host surface size managment. | bunnei | 2018-10-16 | 2 | -92/+94 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Rename GetGLBytesPerPixel to GetBytesPerPixel. | bunnei | 2018-10-16 | 2 | -17/+18 |
| | | | | | | | | | | | | | | | | - This does not really have anything to do with OpenGL. | ||||
| * | | | gl_rasterizer_cache: Remove unused FlushSurface method. | bunnei | 2018-10-16 | 2 | -7/+0 |
| | | | | |||||
| * | | | gl_rasterizer: Implement flushing. | bunnei | 2018-10-16 | 1 | -1/+25 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Remove usage of Memory::Read/Write functions. | bunnei | 2018-10-16 | 1 | -13/+8 |
| | | | | | | | | | | | | | | | | - These cannot be used within the cache, as they change cache state. | ||||
| * | | | gl_rasterizer_cache: Clamp cached surface size to mapped GPU region size. | bunnei | 2018-10-16 | 2 | -19/+37 |
| | | | | |||||
| * | | | memory_manager: Add a method for querying the end of a mapped GPU region. | bunnei | 2018-10-16 | 2 | -0/+11 |
| | | | | |||||
| * | | | rasterizer_cache: Reintroduce method for flushing. | bunnei | 2018-10-16 | 3 | -0/+23 |
| | | | | |||||
| * | | | gl_rasterizer_cache: Reintroduce code for handling swizzle and flush to guest RAM. | bunnei | 2018-10-16 | 2 | -28/+119 |
| | |/ | |/| | |||||
* | | | Merge pull request #1496 from FernandoS27/tex-array | bunnei | 2018-10-18 | 1 | -14/+55 |
|\ \ \ | |/ / |/| | | Implement Arrays on Tex Instruction | ||||
| * | | Implement Arrays on Tex Instruction | FernandoS27 | 2018-10-14 | 1 | -14/+55 |
| |/ | |||||
* | | shader_bytecode: Add Control Code enum 0xf | ReinUsesLisp | 2018-10-15 | 1 | -1/+1 |
| | | | | | | | | | | | | Control Code 0xf means to unconditionally execute the instruction. This value is passed to most BRA, EXIT and SYNC instructions (among others) but this may not always be the case. | ||||
* | | gl_shader_decompiler: Fixup style inconsistencies | ReinUsesLisp | 2018-10-15 | 1 | -5/+3 |
| | | |||||
* | | gl_rasterizer: Silence implicit cast warning in glBindBufferRange | ReinUsesLisp | 2018-10-15 | 1 | -1/+2 |
| | | |||||
* | | Merge pull request #1488 from Hexagon12/astc-types | bunnei | 2018-10-14 | 3 | -6/+32 |
|\ \ | | | | | | | video_core: Added ASTC 5x4; 8x5 types | ||||
| * | | Added ASTC 5x4; 8x5 | Hexagon12 | 2018-10-13 | 3 | -6/+32 |
| | | | |||||
* | | | Shorten the implementation of 3D swizzle to only 3 functions | FernandoS27 | 2018-10-14 | 1 | -70/+27 |
| | | | |||||
* | | | Fix a Crash on Zelda BotW and Splatoon 2, and simplified LoadGLBuffer | FernandoS27 | 2018-10-13 | 2 | -19/+2 |
| | | | |||||
* | | | Propagate depth and depth_block on modules using decoders | FernandoS27 | 2018-10-13 | 7 | -52/+64 |
| | | | |||||
* | | | Remove old Swizzle algorithms and use 3d Swizzle | FernandoS27 | 2018-10-13 | 1 | -93/+69 |
| | | | |||||
* | | | Implement Precise 3D Swizzle | FernandoS27 | 2018-10-13 | 1 | -3/+71 |
| | | | |||||
* | | | Implement Fast 3D Swizzle | FernandoS27 | 2018-10-13 | 1 | -2/+74 |
| |/ |/| | |||||
* | | Implemented helper function to correctly calculate a texture's size | FernandoS27 | 2018-10-12 | 2 | -0/+22 |
| | | |||||
* | | gl_shader_decompiler: Implement VMAD | ReinUsesLisp | 2018-10-11 | 2 | -0/+118 |
|/ | |||||
* | Merge pull request #1458 from FernandoS27/fix-render-target-block-settings | bunnei | 2018-10-11 | 5 | -18/+81 |
|\ | | | | | Fixed block height settings for RenderTargets and Depth Buffers | ||||
| * | Add memory Layout to Render Targets and Depth Buffers | FernandoS27 | 2018-10-10 | 3 | -21/+33 |
| | | |||||
| * | Fixed block height settings for RenderTargets and Depth Buffers, and added block width and block depth | FernandoS27 | 2018-10-10 | 5 | -12/+63 |
| | | |||||
* | | Merge pull request #1460 from FernandoS27/scissor_test | bunnei | 2018-10-10 | 3 | -1/+36 |
|\ \ | | | | | | | Implemented Scissor Testing | ||||
| * | | Implement Scissor Test | FernandoS27 | 2018-10-09 | 1 | -4/+9 |
| | | | |||||
| * | | Assert Scissor tests | FernandoS27 | 2018-10-09 | 3 | -1/+31 |
| |/ | |||||
* | | Merge pull request #1425 from ReinUsesLisp/geometry-shaders | bunnei | 2018-10-10 | 11 | -120/+543 |
|\ \ | | | | | | | gl_shader_decompiler: Implement geometry shaders | ||||
| * | | gl_shader_decompiler: Move position varying location from 15 to 0 and apply an offset | ReinUsesLisp | 2018-10-07 | 1 | -6/+10 |
| | | | |||||
| * | | gl_shader_decompiler: Implement geometry shaders | ReinUsesLisp | 2018-10-07 | 10 | -107/+522 |
| | | | |||||
| * | | video_core: Allow LabelGLObject to use extra info on any object | ReinUsesLisp | 2018-10-07 | 1 | -10/+14 |
| |/ | |||||
* / | gl_shader_decompiler: Remove unused variables in TMML's implementation | Lioncash | 2018-10-09 | 1 | -7/+3 |
|/ | | | | | Given "y" isn't always used, but "x" is, we can rearrange this to avoid unused variable warnings by changing the names of op_a and op_b | ||||
* | Merge pull request #1446 from bunnei/fast_fermi_copy | bunnei | 2018-10-07 | 8 | -40/+117 |
|\ | | | | | gl_rasterizer: Implement accelerated Fermi2D copies. | ||||
| * | fermi_2d: Implement simple copies with AccelerateSurfaceCopy. | bunnei | 2018-10-06 | 3 | -24/+36 |
| | | |||||
| * | gl_rasterizer: Add rasterizer cache code to handle accerated fermi copies. | bunnei | 2018-10-06 | 5 | -16/+60 |
| | | |||||
| * | gl_rasterizer_cache: Implement a simpler surface copy using glCopyImageSubData. | bunnei | 2018-10-06 | 1 | -0/+21 |
| | | |||||
* | | Merge pull request #1437 from FernandoS27/tex-mode2 | bunnei | 2018-10-07 | 6 | -69/+265 |
|\ \ | | | | | | | Implemented Depth Compare, Shadow Samplers and Texture Processing Modes for TEXS and TLDS | ||||
| * | | Implemented Depth Compare and Shadow Samplers | FernandoS27 | 2018-10-06 | 6 | -65/+224 |
| | | | |||||
| * | | Implemented Texture Processing Modes in TEXS and TLDS | FernandoS27 | 2018-10-03 | 1 | -5/+42 |
| | | | |||||
* | | | gl_rasterizer: Fixup undefined behaviour in SetupDraw | ReinUsesLisp | 2018-10-07 | 1 | -0/+1 |
| |/ |/| | |||||
* | | gl_rasterizer: Implement quads topology | ReinUsesLisp | 2018-10-04 | 8 | -46/+236 |
|/ | |||||
* | gl_rasterizer: Fixup unassigned point sizes | ReinUsesLisp | 2018-10-01 | 1 | -1/+4 |
| | |||||
* | Merge pull request #1330 from raven02/tlds | bunnei | 2018-10-01 | 1 | -7/+15 |
|\ | | | | | TLDS: Add 1D sampler | ||||
| * | Fix trailing whitespace | raven02 | 2018-09-30 | 1 | -1/+4 |
| | | |||||
| * | Merge branch 'master' into tlds | raven02 | 2018-09-19 | 24 | -195/+651 |
| |\ | |||||
| * | | Add 1D sampler for TLDS - TexelFetch (Mario Rabbids) | raven02 | 2018-09-17 | 1 | -7/+12 |
| | | | |||||
* | | | gl_rasterizer_cache: Fixes to how we do render to cubemap. | bunnei | 2018-09-30 | 2 | -32/+5 |
| | | | | | | | | | | | | - Fixes issues with Splatoon 2. | ||||
* | | | gl_rasterizer_cache: Add check for array rendering to cubemap texture. | bunnei | 2018-09-30 | 1 | -0/+8 |
| | | | |||||
* | | | gl_rasterizer_cache: Implement render to cubemap. | bunnei | 2018-09-30 | 3 | -119/+218 |
| | | | |||||
* | | | gl_shader_decompiler: TEXS: Implement TextureType::TextureCube. | bunnei | 2018-09-30 | 1 | -0/+8 |
| | | | |||||
* | | | gl_rasterizer_cache: Add support for SurfaceTarget::TextureCubemap. | bunnei | 2018-09-30 | 2 | -1/+36 |
| | | | |||||
* | | | gl_rasterizer_cache: Implement LoadGLBuffer for Texture2DArray. | bunnei | 2018-09-30 | 1 | -0/+8 |
| | | | |||||
* | | | gl_rasterizer_cache: Update BlitTextures to support non-Texture2D ColorTexture surfaces. | bunnei | 2018-09-30 | 1 | -23/+88 |
| | | | |||||
* | | | gl_rasterizer_cache: Track texture target and depth in the cache. | bunnei | 2018-09-30 | 1 | -2/+3 |
| | | | |||||
* | | | gl_rasterizer_cache: Workaround for Texture2D -> Texture2DArray scenario. | bunnei | 2018-09-30 | 3 | -6/+21 |
| | | | |||||
* | | | gl_rasterizer_cache: Keep track of surface 2D size separately from total size. | bunnei | 2018-09-30 | 2 | -32/+46 |
| | | | |||||
* | | | Merge pull request #1411 from ReinUsesLisp/point-size | bunnei | 2018-09-29 | 5 | -1/+27 |
|\ \ \ | | | | | | | | | video_core: Implement point_size and add point state sync | ||||
| * | | | video_core: Implement point_size and add point state sync | ReinUsesLisp | 2018-09-28 | 5 | -1/+27 |
| | | | | |||||
* | | | | Merge pull request #1406 from ReinUsesLisp/multibind-samplers | bunnei | 2018-09-29 | 5 | -8/+25 |
|\ \ \ \ | |/ / / |/| | | | gl_state: Pack sampler bindings into a single ARB_multi_bind | ||||
| * | | | gl_state: Pack sampler bindings into a single ARB_multi_bind | ReinUsesLisp | 2018-09-28 | 5 | -8/+25 |
| | | | | |||||
* | | | | Merge pull request #1377 from FernandoS27/faster-swizzle | bunnei | 2018-09-27 | 1 | -51/+66 |
|\ \ \ \ | |/ / / |/| | | | Improved Fast Swizzle and Legacy Swizzle | ||||
| * | | | Reverse stride align restriction on FastSwizzle due to lost performance | FernandoS27 | 2018-09-21 | 1 | -3/+2 |
| | | | | |||||
| * | | | Join both Swizzle methods within one interface function | FernandoS27 | 2018-09-21 | 1 | -11/+19 |
| | | | | |||||
| * | | | Standarized Legacy Swizzle to look alike FastSwizzle and use a Swizzling Table instead | FernandoS27 | 2018-09-21 | 1 | -42/+38 |
| | | | | |||||
| * | | | Remove same output bpp restriction on FastSwizzle | FernandoS27 | 2018-09-21 | 1 | -4/+5 |
| | | | | |||||
| * | | | Improved Legacy Swizzler to be better documented and work better | FernandoS27 | 2018-09-21 | 1 | -15/+21 |
| | | | | |||||
| * | | | Improved fast swizzle and removed restrictions to it | FernandoS27 | 2018-09-21 | 1 | -7/+12 |
| | | | | |||||
* | | | | video_core: Add asserts for CS, TFB and alpha testing | ReinUsesLisp | 2018-09-26 | 5 | -3/+92 |
| | | | | | | | | | | | | | | | | | | | | | | | | Add asserts for compute shader dispatching, transform feedback being enabled and alpha testing. These have in common that they'll probably break rendering without logging. | ||||
* | | | | Added glObjectLabels for renderdoc for textures and shader programs (#1384) | David | 2018-09-23 | 4 | -0/+48 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Added glObjectLabels for renderdoc for textures and shader programs * Changed hardcoded "Texture" name to reflect the texture type instead * Removed string initialize | ||||
* | | | | correct BC6H | greggameplayer | 2018-09-23 | 1 | -2/+2 |
| | | | | |||||
* | | | | Merge pull request #1380 from lioncash/const | bunnei | 2018-09-22 | 1 | -8/+19 |
|\ \ \ \ | | | | | | | | | | | shader_bytecode: Make operator== and operator!= of IpaMode const qualified | ||||
| * | | | | shader_bytecode: Lay out the Ipa-related enums better | Lioncash | 2018-09-21 | 1 | -2/+12 |
| | | | | | | | | | | | | | | | | | | | | This is more consistent with the surrounding enums. | ||||
| * | | | | shader_bytecode: Make operator== and operator!= of IpaMode const qualified | Lioncash | 2018-09-21 | 1 | -6/+7 |
| |/ / / | | | | | | | | | | | | | | | | | These don't affect the state of the struct and can be const member functions. | ||||
* | | | | Merge pull request #1382 from lioncash/inc | bunnei | 2018-09-22 | 2 | -4/+1 |
|\ \ \ \ | | | | | | | | | | | gl_state: Remove unused type alias | ||||
| * | | | | gl_state: Remove unused type alias | Lioncash | 2018-09-22 | 2 | -4/+1 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | This isn't used anywhere within the header, so we can remove it, along with the include that was previously necessary. This also uncovers an indirect include in the cpp file for the assertion macros. | ||||
* | | | | Merge pull request #1379 from lioncash/bitwise | bunnei | 2018-09-21 | 1 | -1/+1 |
|\ \ \ \ | | | | | | | | | | | gl_stream_buffer: Fix use of bitwise OR instead of logical OR in Map() | ||||
| * | | | | gl_stream_buffer: Fix use of bitwise OR instead of logical OR in Map() | Lioncash | 2018-09-21 | 1 | -1/+1 |
| |/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This was very likely intended to be a logical OR based off the conditioning and testing of inversion in one case. Even if this was intentional, this is the kind of non-obvious thing one should be clarifying with a comment. | ||||
* / / / | RasterizerGL: Use the correct framebuffer when clearing via the CLEAR_BUFFERS register. | Subv | 2018-09-21 | 1 | -1/+1 |
|/ / / | | | | | | | | | | | | | | | | Previously we were clearing the default backbuffer framebuffer. Found thanks to a Piglit test :) | ||||
* | / | gl_rasterizer: Fix StartAddress handling with indexed draw calls. | Markus Wick | 2018-09-19 | 1 | -6/+7 |
| |/ |/| | | | | | | | We uploaded the wrong data before. So the offset on the host GPU pointer may work for the first vertices, the last ones run out bounds. Let's just offset the upload instead. | ||||
* | | Merge pull request #1342 from lioncash/trunc | bunnei | 2018-09-19 | 1 | -4/+4 |
|\ \ | | | | | | | gl_shader_decompiler: Avoid truncation warnings within LD_A and ST_A code | ||||
| * | | gl_shader_decompiler: Avoid truncation warnings within LD_A and ST_A code | Lioncash | 2018-09-18 | 1 | -4/+4 |
| | | | | | | | | | | | | | | | | | | These are internally stored as u64 values, so using u32 here causes truncation warnings. Instead, we can just use u64 and preserve the bit width. | ||||
* | | | Merge pull request #1279 from FernandoS27/csetp | bunnei | 2018-09-19 | 2 | -21/+133 |
|\ \ \ | | | | | | | | | shader_decompiler: Implemented (Partialy) Control Codes and CSETP | ||||
| * | | | Implemented Internal Flags | FernandoS27 | 2018-09-18 | 1 | -13/+35 |
| | | | | |||||
| * | | | Implemented I2I.CC on the NEU control code, used by SMO | FernandoS27 | 2018-09-17 | 2 | -14/+18 |
| | | | | |||||
| * | | | Implemented CSETP | FernandoS27 | 2018-09-17 | 2 | -14/+49 |
| | | | | |||||
| * | | | Implemented Control Codes | FernandoS27 | 2018-09-17 | 2 | -0/+51 |
| |/ / | |||||
* | | | Merge pull request #1299 from FernandoS27/texture-sanatize | bunnei | 2018-09-19 | 2 | -3/+192 |
|\ \ \ | | | | | | | | | shader_decompiler: Asserts for Texture Instructions | ||||
| * | | | Added asserts for texture misc modes to texture instructions | FernandoS27 | 2018-09-17 | 1 | -2/+45 |
| | | | | |||||
| * | | | Added texture misc modes to texture instructions | FernandoS27 | 2018-09-17 | 1 | -1/+147 |
| |/ / | |||||
* | | | Merge pull request #1290 from FernandoS27/shader-header | bunnei | 2018-09-18 | 3 | -24/+111 |
|\ \ \ | |/ / |/| | | Implemented (Partialy) Shader Header | ||||
| * | | Replace old FragmentHeader for the new Header | FernandoS27 | 2018-09-11 | 2 | -31/+18 |
| | | | |||||
| * | | Implemented (Partialy) Shader Header | FernandoS27 | 2018-09-11 | 3 | -2/+102 |
| | | | |||||
* | | | Merge pull request #1311 from FernandoS27/fast-swizzle | bunnei | 2018-09-17 | 1 | -2/+49 |
|\ \ \ | | | | | | | | | Optimized Texture Swizzling | ||||
| * | | | Optimized Texture Swizzling | FernandoS27 | 2018-09-14 | 1 | -2/+49 |
| | | | | |||||
* | | | | Merge pull request #1316 from lioncash/shadow | bunnei | 2018-09-17 | 1 | -2/+0 |
|\ \ \ \ | | | | | | | | | | | gl_shader_decompiler: Get rid of variable shadowing within LEA instructions | ||||
| * | | | | gl_shader_decompiler: Get rid of variable shadowing within LEA instructions | Lioncash | 2018-09-14 | 1 | -2/+0 |
| | | | | | | | | | | | | | | | | | | | | These variables are already defined within an outer scope. | ||||
* | | | | | Merge pull request #1326 from FearlessTobi/port-4182 | bunnei | 2018-09-17 | 20 | -133/+138 |
|\ \ \ \ \ | | | | | | | | | | | | | Port #4182 from Citra: "Prefix all size_t with std::" | ||||
| * | | | | | Port #4182 from Citra: "Prefix all size_t with std::" | fearlessTobi | 2018-09-15 | 20 | -133/+138 |
| |/ / / / | |||||
* | | | | | Merge pull request #1329 from raven02/bgr5a1u | bunnei | 2018-09-17 | 2 | -0/+4 |
|\ \ \ \ \ | | | | | | | | | | | | | Implement RenderTargetFormat::BGR5A1_UNORM | ||||
| * | | | | | Implement RenderTargetFormat::BGR5A1_UNORM (Pokken Tournament DX) | raven02 | 2018-09-15 | 2 | -0/+4 |
| |/ / / / | |||||
* | | | / | Implement ASTC_2D_8X8 (Bayonetta 2) | raven02 | 2018-09-16 | 3 | -6/+20 |
| |_|_|/ |/| | | | |||||
* | | | | Merge pull request #1273 from Subv/ld_sizes | bunnei | 2018-09-15 | 2 | -7/+58 |
|\ \ \ \ | | | | | | | | | | | Shaders: Implemented multiple-word loads and stores to and from attribute memory. | ||||
| * | | | | Shaders: Implemented multiple-word loads and stores to and from attribute memory. | Subv | 2018-09-15 | 2 | -7/+58 |
| |/ / / | | | | | | | | | | | | | This seems to be an optimization performed by nouveau. | ||||
* | | | | Merge pull request #1271 from Subv/kepler_engine | bunnei | 2018-09-15 | 6 | -0/+146 |
|\ \ \ \ | |/ / / |/| | | | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf). | ||||
| * | | | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf). | Subv | 2018-09-12 | 6 | -0/+146 |
| | | | | | | | | | | | | | | | | This engine writes data from a FIFO register into the configured address. | ||||
* | | | | Use ARB_multi_bind for uniform buffers (#1287) | ReinUsesLisp | 2018-09-13 | 2 | -3/+23 |
| | | | | | | | | | | | | | | | | | | | | | | | | * gl_rasterizer: use ARB_multi_bind for uniform buffers * address feedback | ||||
* | | | | gl_rasterizer_cache: B5G6R5U should use GL_RGB8 as an internal format. | bunnei | 2018-09-13 | 1 | -1/+1 |
| |/ / |/| | | | | | | | | - Fixes a regression with Sonic Mania with ARB_texture_storage. | ||||
* | | | Merge pull request #1263 from FernandoS27/tex-mode | bunnei | 2018-09-12 | 2 | -1/+43 |
|\ \ \ | |/ / |/| | | shader_decompiler: Implemented (Partially) Texture Processing Modes | ||||
| * | | Implemented Texture Processing Modes | FernandoS27 | 2018-09-12 | 2 | -1/+43 |
| |/ | |||||
* | | Merge pull request #1278 from tech4me/bg-color-fix | bunnei | 2018-09-12 | 3 | -0/+8 |
|\ \ | | | | | | | Port Citra #4047 & #4052: add change background color support | ||||
| * | | Port Citra #4047 & #4052: add change background color support | tech4me | 2018-09-09 | 3 | -0/+8 |
| | | | |||||
* | | | Merge pull request #1295 from bunnei/accurate-copies | bunnei | 2018-09-12 | 2 | -18/+12 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Improve accuracy of caching and copies. | ||||
| * | | | gl_rasterizer_cache: Always blit on recreate, regardless of format. | bunnei | 2018-09-12 | 1 | -6/+10 |
| | | | | | | | | | | | | | | | | - Fixes several rendering issues with Super Mario Odyssey. | ||||
| * | | | gl_shader_cache: Remove cache_width/cache_height. | bunnei | 2018-09-12 | 2 | -12/+2 |
| | |/ | |/| | | | | | | | | | | - This was once an optimization, but we no longer need it with the cache reserve. - This is also inaccurate. | ||||
* | | | Merge pull request #1294 from degasus/optimizations | bunnei | 2018-09-12 | 1 | -11/+8 |
|\ \ \ | | | | | | | | | gl_rasterizer: Use ARB_texture_storage. | ||||
| * | | | gl_rasterizer: Use ARB_texture_storage. | Markus Wick | 2018-09-11 | 1 | -11/+8 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | It allows us to use texture views and it reduces the overhead within the GPU driver. But it disallows us to reallocate the texture, but we don't do so anyways. In the end, it is the new way to allocate textures, so there is no need to use the old way. | ||||
* | | | Implemented LEA and PSET | FernandoS27 | 2018-09-11 | 1 | -0/+91 |
| | | | |||||
* | | | Implemented encodings for LEA and PSET | FernandoS27 | 2018-09-11 | 1 | -0/+64 |
|/ / | |||||
* | | Fixed renderdoc input/output textures not working due to render targets | David Marcec | 2018-09-11 | 2 | -2/+9 |
| | | |||||
* | | Merge pull request #1286 from bunnei/multi-clear | bunnei | 2018-09-11 | 2 | -50/+66 |
|\ \ | | | | | | | gl_rasterizer: Implement clear for non-zero render targets. | ||||
| * | | gl_rasterizer: Implement clear for non-zero render targets. | bunnei | 2018-09-10 | 2 | -50/+66 |
| | | | | | | | | | | | | - Several misc. changes to ConfigureFramebuffers in support of this. | ||||
* | | | Merge pull request #1285 from bunnei/depth-fix | bunnei | 2018-09-11 | 1 | -6/+22 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Only use depth for applicable texture formats. | ||||
| * | | | gl_rasterizer_cache: Only use depth for applicable texture formats. | bunnei | 2018-09-10 | 1 | -6/+22 |
| |/ / | | | | | | | | | | - Fixes an issue with Octopath Traveler leaving stale data here. | ||||
* | | | Merge pull request #1284 from bunnei/bgra8_srgb | bunnei | 2018-09-11 | 3 | -0/+4 |
|\ \ \ | | | | | | | | | gl_rasterizer_cache: Implement RenderTargetFormat::BGRA8_SRGB. | ||||
| * | | | gl_rasterizer_cache: Implement RenderTargetFormat::BGRA8_SRGB. | bunnei | 2018-09-10 | 3 | -0/+4 |
| |/ / | | | | | | | | | | - Used by Octopath Traveler (with multiple render targets). | ||||
* | | | video_core: Refactor command_processor. | Markus Wick | 2018-09-10 | 2 | -44/+42 |
| | | | | | | | | | | | | Inline the WriteReg helper as it is called ~20k times per frame. | ||||
* | | | video_core: Move command buffer loop. | Markus Wick | 2018-09-10 | 3 | -46/+72 |
| | | | | | | | | | | | | This moves the hot loop into video_core. This refactoring shall reduce the CPU overhead of calling ProcessCommandList. | ||||
* | | | rasterizer: Drop unused handler. | Markus Wick | 2018-09-10 | 4 | -8/+0 |
|/ / | | | | | | | | | | | | | | | This virtual function is called in a very hot spot, and it does nothing. If this kind of feature is required, please be more specific and add callbacks in the switch statement within Maxwell3D::WriteReg. There is no point in having another switch statement within the rasterizer. | ||||
* | | gl_rasterizer: Implement multiple color attachments. | bunnei | 2018-09-10 | 5 | -132/+95 |
| | | |||||
* | | Merge pull request #1268 from FernandoS27/tmml | bunnei | 2018-09-10 | 2 | -5/+67 |
|\ \ | | | | | | | shader_decompiler: Implemented TMML | ||||
| * | | Implemented TMML | FernandoS27 | 2018-09-10 | 2 | -5/+67 |
| | | | |||||
* | | | Merge pull request #1272 from Subv/dma_2d | bunnei | 2018-09-10 | 1 | -2/+10 |
|\ \ \ | | | | | | | | | GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine. | ||||
| * | | | GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine. | Subv | 2018-09-08 | 1 | -2/+10 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers. | ||||
* | | | | Merge pull request #1280 from zero334/improvements | bunnei | 2018-09-10 | 5 | -89/+101 |
|\ \ \ \ | |_|/ / |/| | | | video_core: fixed arithmetic overflow warnings & improved code style | ||||
| * | | | video_core: fixed arithmetic overflow warnings & improved code style | Patrick Elsässer | 2018-09-09 | 5 | -89/+101 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | - Fixed all warnings, for renderer_opengl items, which were indicating a possible incorrect behavior from integral promotion rules and types larger than those in which arithmetic is typically performed. - Added const for variables where possible and meaningful. - Added constexpr where possible. | ||||
* | | | Implemented TXQ dimension query type, used by SMO. | FernandoS27 | 2018-09-09 | 2 | -1/+36 |
| | | | |||||
* | | | Change name of TEXQ to TXQ, in order to match NVIDIA's naming | FernandoS27 | 2018-09-09 | 1 | -2/+2 |
|/ / | |||||
* | | Merge pull request #1256 from bunnei/tex-target-support | bunnei | 2018-09-08 | 11 | -229/+422 |
|\ \ | |/ |/| | Initial support for non-2D textures | ||||
| * | gl_rasterizer_cache: Improve accuracy of RecreateSurface for non-2D textures. | bunnei | 2018-09-08 | 2 | -27/+45 |
| | | |||||
| * | maxwell_3d: Remove assert that no longer applies. | bunnei | 2018-09-08 | 1 | -4/+0 |
| | | |||||
| * | gl_rasterizer_cache: Partially implement several non-2D texture types. | bunnei | 2018-09-08 | 1 | -30/+111 |
| | | |||||
| * | gl_shader_decompiler: Partially implement several non-2D texture types (Subv). | bunnei | 2018-09-08 | 2 | -32/+143 |
| | | |||||
| * | gl_rasterizer: Implement texture wrap mode p. | bunnei | 2018-09-08 | 2 | -2/+8 |
| | | |||||
| * | gl_rasterizer_cache: Track texture depth. | bunnei | 2018-09-08 | 3 | -4/+15 |
| | | |||||
| * | gl_rasterizer_cache: Remove impl. of FlushGLBuffer. | bunnei | 2018-09-08 | 1 | -34/+1 |
| | | | | | | | | - Will not work for non-2d textures, and was not used anyways. | ||||
| * | gl_rasterizer_cache: Keep track of texture type per surface. | bunnei | 2018-09-08 | 3 | -32/+84 |
| | | |||||
| * | gl_rasterizer_cache: Remove unused DownloadGLTexture. | bunnei | 2018-09-08 | 2 | -51/+0 |
| | | |||||
| * | gl_state: Keep track of texture target. | bunnei | 2018-09-08 | 5 | -26/+28 |
| | | |||||
* | | gl_rasterizer: Use baseInstance instead of moving the buffer points. | bunnei | 2018-09-08 | 1 | -21/+25 |
| | | | | | | | | | | | | | | This hopefully helps our cache not to redundant upload the vertex buffer. # Conflicts: # src/video_core/renderer_opengl/gl_rasterizer.cpp | ||||
* | | video_core: Arithmetic overflow warning fix for gl_rasterizer (#1262) | Patrick Elsässer | 2018-09-08 | 1 | -12/+14 |
|/ | | | | | | | | | | | | | | | * video_core: Arithmetic overflow fix for gl_rasterizer - Fixed warnings, which were indicating incorrect behavior from integral promotion rules and types larger than those in which arithmetic is typically performed. - Added const for variables where possible and meaningful. * Changed the casts from C to C++ style Changed the C-style casts to C++ casts as proposed. Took also care about signed / unsigned behaviour. | ||||
* | Merge pull request #1252 from lioncash/header | bunnei | 2018-09-07 | 1 | -0/+1 |
|\ | | | | | video_core/CMakeLists: Add missing gl_buffer_cache.h | ||||
| * | video_core/CMakeLists: Add missing gl_buffer_cache.h | Lioncash | 2018-09-06 | 1 | -0/+1 |
| | | | | | | | | | | Without this, the header file won't show up by default within IDEs such as Visual Studio. | ||||
* | | Merge pull request #1253 from lioncash/explicit | bunnei | 2018-09-07 | 2 | -8/+10 |
|\ \ | | | | | | | video_core/gl_buffer_cache: Minor tidying changes | ||||
| * | | gl_buffer_cache: Default initialize member variables | Lioncash | 2018-09-06 | 1 | -3/+3 |
| | | | | | | | | | | | | Ensures that the cache always has a deterministic initial state. | ||||
| * | | gl_buffer_cache: Make GetHandle() a const member function | Lioncash | 2018-09-06 | 2 | -2/+2 |
| | | | | | | | | | | | | | | | GetHandle() internally calls GetHandle() on the stream_buffer instance, which is a const member function, so this can be made const as well. | ||||
| * | | gl_buffer_cache: Remove unnecessary includes | Lioncash | 2018-09-06 | 2 | -2/+4 |
| | | | |||||
| * | | gl_buffer_cache: Make constructor explicit | Lioncash | 2018-09-06 | 1 | -1/+1 |
| |/ | | | | | | | Implicit conversions during construction isn't desirable here. | ||||
* | | Merge pull request #1255 from bunnei/minor-opt | bunnei | 2018-09-07 | 1 | -4/+2 |
|\ \ | | | | | | | gl_rasterizer: Call state.Apply only once on SetupShaders. | ||||
| * | | gl_rasterizer: Call state.Apply only once on SetupShaders. | bunnei | 2018-09-06 | 1 | -4/+2 |
| |/ | |||||
* / | gl_shader_decompiler: Implement saturate mode for IPA. | bunnei | 2018-09-06 | 1 | -1/+5 |
|/ | |||||
* | gl_shader_gen: Initialize position. | Markus Wick | 2018-09-06 | 1 | -0/+1 |
| | | | | | IMO the old code is fine, but nvidia raises shader compiler warnings. Trivial fix through... | ||||
* | Merge pull request #1243 from degasus/VAO_cache | bunnei | 2018-09-06 | 3 | -53/+60 |
|\ | | | | | gl_rasterizer: Implement a VAO cache. | ||||
| * | gl_rasterizer: Implement a VAO cache. | Markus Wick | 2018-09-05 | 3 | -53/+60 |
| | | | | | | | | | | | | This patch caches VAO objects instead of re-emiting all pointers per draw call. Configuring this pointers is known as a fast task, but it yields too many GL calls. So for better performance, just bind the VAO instead of 16 pointers. | ||||
* | | Merge pull request #1244 from FernandoS27/ipa | bunnei | 2018-09-06 | 2 | -47/+98 |
|\ \ | | | | | | | shader_decompiler: Implemented IPA Properly (Stage 1) | ||||
| * | | Implemented IPA Properly | FernandoS27 | 2018-09-06 | 2 | -47/+98 |
| | | | |||||
* | | | gl_rasterizer: Skip TODO log. | Markus Wick | 2018-09-05 | 1 | -1/+1 |
| |/ |/| | | | | | | | | | | | This is called ~3k times per frame in SMO ingame. My laptop spends ~3ms per frame on allocating and freeing this string. Let's just stop printing this kind of redundant information. | ||||
* | | renderer_opengl: Implement a buffer cache. | Markus Wick | 2018-09-05 | 5 | -86/+182 |
|/ | | | | | | | | | The idea of this cache is to avoid redundant uploads. So we are going to cache the uploaded buffers within the stream_buffer and just reuse the old pointers. The next step is to implement a VBO cache on GPU memory, but for now, I want to check the overhead of the cache management. Fetching the buffer over PCI-E should be quite fast. | ||||
* | gl_shader_cache: Use an u32 for the binding point cache. | Markus Wick | 2018-09-04 | 4 | -15/+23 |
| | | | | | | | The std::string generation with its malloc and free requirement was a noticeable overhead. Also switch to an ordered_map to avoid the std::hash call. As those maps usually have a size of two elements, the lookup time shall not matter. | ||||
* | Merge pull request #1237 from degasus/optimizations | bunnei | 2018-09-04 | 2 | -4/+4 |
|\ | | | | | Optimizations | ||||
| * | command_processor: Use std::array for bound_engines. | Markus Wick | 2018-09-04 | 2 | -4/+4 |
| | | | | | | | | | | subchannel is a 3 bit field. So there must not be more than 8 bound engines. And using a hashmap for up to 8 values is a bit overpowered. | ||||
* | | Merge pull request #1232 from lioncash/copy | bunnei | 2018-09-04 | 1 | -1/+1 |
|\ \ | | | | | | | gl_shader_decompiler: Use used_shaders member variable directly within GenerateDeclarations() | ||||
| * | | gl_shader_decompiler: Use used_shaders member variable directly within GenerateDeclarations() | Lioncash | 2018-09-02 | 1 | -1/+1 |
| |/ | | | | | | | | | Using the getter function intended for external code here makes an unnecessary copy of the already-accessible used_shaders vector. | ||||
* / | Update microprofile scopes. | Markus Wick | 2018-09-04 | 1 | -3/+11 |
|/ | | | | | | Blame the subsystems which deserve the blame :) The updated list is not complete, just the ones I've spotted on random sampling the stack trace. | ||||
* | Merge pull request #1213 from DarkLordZach/octopath-fs | bunnei | 2018-09-02 | 1 | -2/+3 |
|\ | | | | | filesystem/maxwell_3d: Various changes to boot Project Octopath Traveller | ||||
| * | maxwell_3d: Use CoreTiming for query timestamp | Zach Hilman | 2018-09-01 | 1 | -2/+3 |
| | | |||||
* | | Merge pull request #1215 from ogniK5377/texs-nodep-assert | bunnei | 2018-09-02 | 2 | -0/+3 |
|\ \ | | | | | | | Added assert for TEXS nodep | ||||
| * | | Added assert for TEXS nodep | David Marcec | 2018-09-01 | 2 | -0/+3 |
| |/ | |||||
* | | Merge pull request #1214 from ogniK5377/ipa-assert | bunnei | 2018-09-02 | 2 | -6/+13 |
|\ \ | | | | | | | Added better asserts to IPA, Renamed IPA modes to match mesa | ||||
| * | | Added better asserts to IPA, Renamed IPA modes to match mesa | David Marcec | 2018-09-01 | 2 | -6/+13 |
| |/ | | | | | | | | | | | | | | | | | | | IpaMode is changed to IpaInterpMode IpaMode is suppose to be 2 bits not 3 Added IpaSampleMode Added Saturate Renamed modes based on https://github.com/mesa3d/mesa/blob/d27c7918916cdc8092959124955f887592e37d72/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp#L2530 | ||||
* | | Merge pull request #1216 from ogniK5377/ffma-assert | bunnei | 2018-09-02 | 2 | -0/+9 |
|\ \ | | | | | | | Added FFMA asserts and missing fields | ||||
| * | | Removed saturate assert | David Marcec | 2018-09-01 | 2 | -2/+0 |
| | | | | | | | | | | | | Saturate already implemented | ||||
| * | | Changed tab5980_0 default from 0 -> 1 | David Marcec | 2018-09-01 | 1 | -2/+2 |
| | | | |||||
| * | | Added FFMA asserts | David Marcec | 2018-09-01 | 2 | -0/+11 |
| |/ | |||||
* | | Removed saturate assert | David Marcec | 2018-09-01 | 2 | -2/+0 |
| | | | | | | | | Unneeded as we already implement it | ||||
* | | Added FMUL asserts | David Marcec | 2018-09-01 | 2 | -0/+15 |
|/ | |||||
* | core/core: Replace includes with forward declarations where applicable | Lioncash | 2018-08-31 | 3 | -4/+4 |
| | | | | | | | | | | | The follow-up to e2457418dae19b889b2ad85255bb95d4cd0e4bff, which replaces most of the includes in the core header with forward declarations. This makes it so that if any of the headers the core header was previously including change, then no one will need to rebuild the bulk of the core, due to core.h being quite a prevalent inclusion. This should make turnaround for changes much faster for developers. | ||||
* | gl_rasterizer_cache: Use accurate framebuffer setting for accurate copies. | bunnei | 2018-08-31 | 2 | -73/+54 |
| | |||||
* | gl_rasterizer_cache: Also use reserve cache for RecreateSurface. | bunnei | 2018-08-31 | 2 | -24/+18 |
| | |||||
* | rasterizer_cache: Use boost::interval_map for a more accurate cache. | bunnei | 2018-08-31 | 1 | -33/+45 |
| | |||||
* | gl_renderer: Cache textures, framebuffers, and shaders based on CPU address. | bunnei | 2018-08-31 | 8 | -100/+53 |
| | |||||
* | gl_rasterizer: Fix issues with the rasterizer cache. | bunnei | 2018-08-31 | 4 | -46/+57 |
| | | | | | - Use a single cached page map. - Fix calculation of ending page. | ||||
* | Implement BC6H_UF16 & BC6H_SF16 (#1092) | greggameplayer | 2018-08-31 | 3 | -31/+55 |
| | | | | | | | | | * Implement BC6H_UF16 & BC6H_SF16 Require by ARMS * correct coding style * correct coding style part 2 | ||||
* | Merge pull request #1204 from lioncash/pimpl | bunnei | 2018-08-31 | 1 | -3/+4 |
|\ | | | | | core: Make the main System class use the PImpl idiom | ||||
| * | core: Make the main System class use the PImpl idiom | Lioncash | 2018-08-31 | 1 | -3/+4 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | core.h is kind of a massive header in terms what it includes within itself. It includes VFS utilities, kernel headers, file_sys header, ARM-related headers, etc. This means that changing anything in the headers included by core.h essentially requires you to rebuild almost all of core. Instead, we can modify the System class to use the PImpl idiom, which allows us to move all of those headers to the cpp file and forward declare the bulk of the types that would otherwise be included, reducing compile times. This change specifically only performs the PImpl portion. | ||||
* | | Merge pull request #1207 from degasus/hotfix | bunnei | 2018-08-31 | 1 | -1/+1 |
|\ \ | | | | | | | Report correct shader size. | ||||
| * | | Report correct shader size. | Markus Wick | 2018-08-31 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | Seems like this was an oversee in regards to 1fd979f50a9f4c21fa8cafba7268d959e3076924 It changed GLShader::ProgramCode to a std::vector, so sizeof is wrong. | ||||
* | | | Added predicate comparison GreaterEqualWithNan | Hexagon12 | 2018-08-31 | 2 | -3/+4 |
|/ / | |||||
* | | gl_shader_decompiler: Implement POPC (#1203) | Laku | 2018-08-31 | 2 | -0/+19 |
| | | | | | | | | | | | | * Implement POPC * implement invert | ||||
* | | Merge pull request #1200 from bunnei/improve-ipa | bunnei | 2018-08-30 | 2 | -1/+39 |
|\ \ | |/ |/| | gl_shader_decompiler: Improve IPA for Pass mode with Position attribute. | ||||
| * | gl_shader_decompiler: Improve IPA for Pass mode with Position attribute. | bunnei | 2018-08-29 | 2 | -1/+39 |
| | | |||||
* | | Shaders: Implemented IADD3 | tech4me | 2018-08-29 | 2 | -1/+84 |
|/ | |||||
* | Merge pull request #1193 from lioncash/priv | bunnei | 2018-08-28 | 4 | -16/+30 |
|\ | | | | | gpu: Make memory_manager private | ||||
| * | gpu: Make memory_manager private | Lioncash | 2018-08-28 | 4 | -16/+30 |
| | | | | | | | | | | | | | | | | | | | | Makes the class interface consistent and provides accessors for obtaining a reference to the memory manager instance. Given we also return references, this makes our more flimsy uses of const apparent, given const doesn't propagate through pointers in the way one would typically expect. This makes our mutable state more apparent in some places. | ||||
* | | Merge pull request #1192 from lioncash/unused | bunnei | 2018-08-28 | 1 | -2/+0 |
|\ \ | | | | | | | gl_rasterizer: Remove unused variables | ||||
| * | | gl_rasterizer: Remove unused variables | Lioncash | 2018-08-28 | 1 | -2/+0 |
| |/ | |||||
* / | gl_shader_cache: Remove unused program_code vector in GetShaderAddress() | Lioncash | 2018-08-28 | 1 | -2/+1 |
|/ | | | | | | | | Given std::vector is a type with a non-trivial destructor, this variable cannot be optimized away by the compiler, even if unused. Because of that, something that was intended to be fairly lightweight, was actually allocating 32KB and deallocating it at the end of the function. | ||||
* | renderer_opengl: Implement a new shader cache. | bunnei | 2018-08-28 | 9 | -285/+250 |
| | |||||
* | gl_rasterizer_cache: Update to use RasterizerCache base class. | bunnei | 2018-08-28 | 3 | -132/+20 |
| | |||||
* | video_core: Add RasterizerCache class for common cache management code. | bunnei | 2018-08-28 | 2 | -0/+117 |
| | |||||
* | Merge pull request #1169 from Lakumakkara/sel | bunnei | 2018-08-28 | 1 | -1/+1 |
|\ | | | | | shader_bytecode: fix SEL_IMM bitstring | ||||
| * | fix SEL_IMM bitstring | Laku | 2018-08-24 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #1174 from lioncash/debug | bunnei | 2018-08-27 | 2 | -26/+3 |
|\ \ | | | | | | | debug_utils: Minor individual interface changes | ||||
| * | | debug_utils: Remove unused includes | Lioncash | 2018-08-25 | 2 | -23/+0 |
| | | | | | | | | | | | | | | | Quite a bit of these aren't necessary directly within the debug_utils header and can be removed or included where actually necessary. | ||||
| * | | debug_utils: Make BreakpointObserver class' constructor explicit | Lioncash | 2018-08-25 | 1 | -1/+1 |
| | | | | | | | | | | | | Avoids implicit conversions. | ||||
| * | | debug_utils: Initialize active_breakpoint member of DebugContext | Lioncash | 2018-08-25 | 1 | -2/+2 |
| | | | | | | | | | | | | Ensures that all class members are initialized. | ||||
* | | | Merge pull request #1173 from lioncash/batch | bunnei | 2018-08-25 | 1 | -4/+4 |
|\ \ \ | | | | | | | | | maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch() | ||||
| * | | | maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch() | Lioncash | 2018-08-25 | 1 | -4/+4 |
| |/ / | | | | | | | | | | | | | The start and finish events should likely not be right after one another like this, otherwise the batch will appear to complete immediately | ||||
* | | | Merge pull request #1167 from lioncash/assert | bunnei | 2018-08-25 | 1 | -1/+2 |
|\ \ \ | |_|/ |/| | | gl_rasterizer: Correct assertion condition in SyncLogicOpState() | ||||
| * | | gl_rasterizer: Correct assertion condition in SyncLogicOpState() | Lioncash | 2018-08-24 | 1 | -1/+2 |
| |/ | | | | | | | | | Previously the assert would always be hit, since it was the equivalent of: array == nullptr, which is never true. | ||||
* / | Shaders: Added decodings for IADD3 instructions | tech4me | 2018-08-23 | 1 | -0/+6 |
|/ | |||||
* | Merge pull request #1160 from bunnei/surface-reserve | bunnei | 2018-08-23 | 2 | -17/+91 |
|\ | | | | | gl_rasterizer_cache: Several improvements | ||||
| * | gl_rasterizer_cache: Blit when possible on RecreateSurface. | bunnei | 2018-08-23 | 1 | -5/+12 |
| | | |||||
| * | gl_rasterizer_cache: Reserve surfaces that have already been created for later use. | bunnei | 2018-08-23 | 2 | -3/+61 |
| | | |||||
| * | gl_rasterizer_cache: Remove assert for RecreateSurface type. | bunnei | 2018-08-23 | 1 | -1/+0 |
| | | |||||
| * | gl_rasterizer_cache: Implement compressed texture copies. | bunnei | 2018-08-23 | 1 | -8/+18 |
| | | |||||
* | | gl_rasterizer: Implement stencil test. | bunnei | 2018-08-23 | 3 | -4/+58 |
| | | | | | | | | - Used by Splatoon 2. | ||||
* | | gl_rasterizer: Implement partial color clear and stencil clear. | bunnei | 2018-08-23 | 1 | -12/+42 |
| | | |||||
* | | maxwell_3d: Update to include additional stencil registers. | bunnei | 2018-08-23 | 1 | -20/+50 |
| | | |||||
* | | gl_state: Update to handle stencil front/back face separately. | bunnei | 2018-08-23 | 2 | -33/+38 |
|/ | |||||
* | Merge pull request #1157 from lioncash/vec | bunnei | 2018-08-23 | 2 | -11/+16 |
|\ | | | | | gl_shader_gen: Use a std::vector to represent program code instead of std::array | ||||
| * | gl_shader_gen: Make ShaderSetup's constructor explicit | Lioncash | 2018-08-22 | 1 | -1/+1 |
| | | | | | | | | Prevents implicit conversions. | ||||
| * | gl_shader_gen: Use a std::vector to represent program code instead of std::array | Lioncash | 2018-08-22 | 2 | -11/+16 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | While convenient as a std::array, it's also quite a large set of data as well (32KB). It being an array also means data cannot be std::moved. Any situation where the code is being set or relocated means that a full copy of that 32KB data must be done. If we use a std::vector we do need to allocate on the heap, however, it does allow us to std::move the data we have within the std::vector into another std::vector instance, eliminating the need to always copy the program data (as std::move in this case would just transfer the pointers and bare necessities over to the new vector instance). | ||||
* | | Merge pull request #1156 from Lakumakkara/lop3 | bunnei | 2018-08-23 | 2 | -0/+60 |
|\ \ | | | | | | | gl_shader_decompiler: Implement LOP3 | ||||
| * | | more fixes | Laku | 2018-08-22 | 1 | -6/+7 |
| | | | |||||
| * | | fixes | Laku | 2018-08-22 | 1 | -6/+12 |
| | | | |||||
| * | | remove debug logging | Laku | 2018-08-22 | 1 | -2/+0 |
| | | | |||||
| * | | implement lop3 | Laku | 2018-08-22 | 2 | -0/+55 |
| |/ | |||||
* / | renderer_opengl: Namespace OpenGL code | Lioncash | 2018-08-22 | 21 | -23/+70 |
|/ | | | | | | | Namespaces all OpenGL code under the OpenGL namespace. Prevents polluting the global namespace and allows clear distinction between other renderers' code in the future. | ||||
* | Merge pull request #840 from FearlessTobi/port-3353 | bunnei | 2018-08-22 | 1 | -1/+1 |
|\ | | | | | Port #3353 from Citra: "citra-qt: Add customizable speed limit target " | ||||
| * | Port #3353 from Citra | fearlessTobi | 2018-08-21 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #1154 from OatmealDome/topology-lines | bunnei | 2018-08-22 | 1 | -0/+2 |
|\ \ | | | | | | | maxwell_to_gl: Implement PrimitiveTopology::Lines | ||||
| * | | maxwell_to_gl: Implement PrimitiveTopology::Lines | OatmealDome | 2018-08-22 | 1 | -0/+2 |
| | | | | | | | | | Used by Splatoon 2's debug menu. | ||||
* | | | Merge pull request #1124 from Subv/logic_ops | bunnei | 2018-08-22 | 6 | -7/+108 |
|\ \ \ | |/ / |/| | | GPU: Implemented logic ops. | ||||
| * | | GPU: Implemented the logic op functionality of the GPU. | Subv | 2018-08-21 | 3 | -0/+61 |
| | | | | | | | | | | | | This will ASSERT if blending is enabled at the same time as logic ops. | ||||
| * | | GLState: Allow enabling/disabling GL_COLOR_LOGIC_OP independently from blending. | Subv | 2018-08-21 | 2 | -6/+19 |
| | | | |||||
| * | | GPU: Added registers for the logicop functionality. | Subv | 2018-08-21 | 1 | -1/+28 |
| |/ | |||||
* | | Revert "Shader: Use the right sampler type in the TEX, TEXS and TLDS instructions." | bunnei | 2018-08-22 | 2 | -153/+31 |
| | | | | | | | | | | - This reverts commit 3ef4b3d4b445960576f10d1ba6521580d03e3da8. - This commit had broken a lot of games. We really should do a full implementation of this in one change. | ||||
* | | shader_bytecode: Parenthesize conditional expression within GetTextureType() | Lioncash | 2018-08-21 | 1 | -1/+1 |
| | | | | | | | | Resolves a -Wlogical-op-parentheses warning. | ||||
* | | Merge pull request #1123 from lioncash/screen | bunnei | 2018-08-21 | 7 | -30/+25 |
|\ \ | | | | | | | rasterizer_interface: Remove renderer-specific ScreenInfo type from AccelerateDraw() in RasterizerInterface | ||||
| * | | rasterizer_interface: Remove ScreenInfo from AccelerateDraw()'s signature | Lioncash | 2018-08-21 | 5 | -17/+14 |
| | | | | | | | | | | | | | | | | | | This is an OpenGL renderer-specific data type. Given that, this type shouldn't be used within the base interface for the rasterizer. Instead, we can pass this information to the rasterizer via reference. | ||||
| * | | renderer_base: Make creation of the rasterizer, the responsibility of the renderers themselves | Lioncash | 2018-08-21 | 4 | -14/+12 |
| |/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Given we use a base-class type within the renderer for the rasterizer (RasterizerInterface), we want to allow renderers to perform more complex initialization if they need to do such a thing. This makes it important to reserve type information. Given the OpenGL renderer is quite simple settings-wise, this is just a simple shuffling of the initialization code. For something like Vulkan however this might involve doing something like: // Initialize and call rasterizer-specific function that requires // the full type of the instance created. auto raster = std::make_unique<VulkanRasterizer>(some, params); raster->CallSomeVulkanRasterizerSpecificFunction(); // Assign to base class variable rasterizer = std::move(raster) | ||||
* | | Merge pull request #1132 from Subv/gl_FragDepth | bunnei | 2018-08-21 | 1 | -1/+6 |
|\ \ | | | | | | | Shaders: Implement depth writing in fragment shaders. | ||||
| * | | Shaders: Implement depth writing in fragment shaders. | Subv | 2018-08-21 | 1 | -1/+6 |
| | | | | | | | | | | | | We'll write <last color output reg + 2> to gl_FragDepth. | ||||
* | | | Merge pull request #1134 from lioncash/log | bunnei | 2018-08-21 | 1 | -1/+1 |
|\ \ \ | | | | | | | | | renderer_opengl: Use LOG_DEBUG for GL_DEBUG_SEVERITY_NOTIFICATION and GL_DEBUG_SEVERITY_LOW logs | ||||
| * | | | renderer_opengl: Use LOG_DEBUG for GL_DEBUG_SEVERITY_NOTIFICATION and GL_DEBUG_SEVERITY_LOW logs | Lioncash | 2018-08-21 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | | | | | | | LOG_TRACE is only enabled on debug builds which can be quite slow when trying to debug graphics issues. Instead we can log the messages to the debug log, which is available on both release and debug builds. | ||||
* | | | | Merge pull request #1121 from Subv/tex_reinterpret | bunnei | 2018-08-21 | 4 | -16/+70 |
|\ \ \ \ | |/ / / |/| | | | Rasterizer: Use PBOs to reinterpret texture formats when games re-use the same memory. | ||||
| * | | | Rasterizer: Reinterpret the raw texture bytes instead of blitting (and thus doing format conversion) to a new texture when a game requests an old texture address with a different format. | Subv | 2018-08-20 | 1 | -3/+49 |
| | | | | |||||
| * | | | Rasterizer: Don't attempt to copy over the old texture's data when doing a format reinterpretation if we're only going to clear the framebuffer. | Subv | 2018-08-20 | 4 | -13/+21 |
| | |/ | |/| | |||||
* | | | Merge pull request #1133 from lioncash/guard | bunnei | 2018-08-21 | 1 | -0/+2 |
|\ \ \ | |_|/ |/| | | gl_stream_buffer: Add missing header guard | ||||
| * | | gl_stream_buffer: Add missing header guard | Lioncash | 2018-08-21 | 1 | -0/+2 |
| |/ | | | | | | | | | Prevents potential compilation errors from occuring due to multiple inclusions | ||||
* | | Merge pull request #1131 from bunnei/impl-tex3d-texcube | bunnei | 2018-08-21 | 2 | -2/+21 |
|\ \ | | | | | | | gl_shader_decompiler: Implement TextureCube/Texture3D for TEX/TEXS. | ||||
| * | | shader_bytecode: Replace some UNIMPLEMENTED logs. | bunnei | 2018-08-21 | 1 | -2/+6 |
| | | | |||||
| * | | gl_shader_decompiler: Implement Texture3D for TEXS. | bunnei | 2018-08-21 | 1 | -0/+7 |
| | | | |||||
| * | | gl_shader_decompiler: Implement TextureCube for TEX. | bunnei | 2018-08-21 | 1 | -0/+8 |
| | | | |||||
* | | | Merge pull request #1106 from Subv/multiple_rendertargets | bunnei | 2018-08-21 | 2 | -6/+45 |
|\ \ \ | |/ / |/| | | Shaders: Write all the enabled color outputs when a fragment shader exits. | ||||
| * | | Shaders: Write all the enabled color outputs when a fragment shader exits. | Subv | 2018-08-21 | 2 | -6/+45 |
| | | | | | | | | | | | | | | | | | | | | | We were only writing to the first render target before. Note that this is only the GLSL side of the implementation, supporting multiple render targets requires more changes in the OpenGL renderer. Dual Source blending is not implemented and stuff that uses it might not work at all. | ||||
* | | | Shaders: Fixed the coords in TEX with Texture2D. | Subv | 2018-08-21 | 1 | -1/+1 |
| | | | | | | | | | | | | | | | | | | The X and Y coordinates should be in gpr8 and gpr8+1, respectively. This fixes the cutscene rendering in Sonic Mania. | ||||
* | | | Shaders: Log and crash when using an unimplemented texture type in a texture sampling instruction. | Subv | 2018-08-21 | 1 | -5/+14 |
| |/ |/| | |||||
* | | Merge pull request #1104 from Subv/instanced_arrays | bunnei | 2018-08-20 | 2 | -4/+30 |
|\ \ | | | | | | | GLRasterizer: Implemented instanced vertex arrays. | ||||
| * | | GLRasterizer: Implemented instanced vertex arrays. | Subv | 2018-08-18 | 2 | -4/+30 |
| |/ | | | | | | | Before each draw call, for every enabled vertex array configured as instanced, we take the current instance id and divide it by its configured divisor, then we multiply that by the corresponding stride and increment the start address by the resulting amount. This way we can simulate the vertex array being incremented once per instance without actually using OpenGL's instancing functions. | ||||
* | | Merge pull request #1115 from Subv/texs_mask | bunnei | 2018-08-20 | 1 | -18/+18 |
|\ \ | | | | | | | Shaders/TEXS: Write to the correct output register when swizzling. | ||||
| * | | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | Subv | 2018-08-20 | 1 | -18/+18 |
| | | | | | | | | | | | | Previously we could end up with a TEXS that didn't write any outputs, this was wrong. | ||||
* | | | Merge pull request #1112 from Subv/sampler_types | bunnei | 2018-08-20 | 3 | -33/+250 |
|\ \ \ | | | | | | | | | Shaders: Use the correct shader type when sampling textures. | ||||
| * | | | Shader: Implemented the TLD4 and TLD4S opcodes using GLSL's textureGather. | Subv | 2018-08-19 | 1 | -0/+51 |
| | | | | | | | | | | | | | | | | It is unknown how TLD4S determines the sampler type, more research is needed. | ||||
| * | | | Shader: Use the right sampler type in the TEX, TEXS and TLDS instructions. | Subv | 2018-08-19 | 2 | -29/+127 |
| | | | | | | | | | | | | | | | | Different sampler types have their parameters in different registers. | ||||
| * | | | Shader: Added bitfields for the texture type of the various sampling instructions. | Subv | 2018-08-19 | 1 | -1/+65 |
| | | | | |||||
| * | | | Shaders: Added decodings for TLD4 and TLD4S | Subv | 2018-08-19 | 1 | -3/+7 |
| | | | | |||||
* | | | | Implemented RGBA8_UINT | David Marcec | 2018-08-20 | 4 | -45/+58 |
| |/ / |/| | | | | | | | | Needed by kirby | ||||
* | | | Merge pull request #1089 from Subv/neg_bits | bunnei | 2018-08-19 | 2 | -16/+38 |
|\ \ \ | | | | | | | | | Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions. | ||||
| * | | | Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions. | Subv | 2018-08-18 | 2 | -16/+38 |
| | | | | | | | | | | | | | | | | We should definitely audit our shader generator for more errors like this. | ||||
* | | | | Merge pull request #1105 from Subv/convert_neg | bunnei | 2018-08-19 | 1 | -2/+0 |
|\ \ \ \ | | | | | | | | | | | Shader: Remove an unneeded assert, the negate bit is implemented for conversion instructions. | ||||
| * | | | | Shader: Remove an unneeded assert, the negate bit is implemented for conversion instructions. | Subv | 2018-08-18 | 1 | -2/+0 |
| | |_|/ | |/| | | |||||
* | | | | Merge pull request #1113 from Subv/texs_mask | bunnei | 2018-08-19 | 1 | -6/+11 |
|\ \ \ \ | | | | | | | | | | | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | ||||
| * | | | | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | Subv | 2018-08-19 | 1 | -6/+11 |
| | |_|/ | |/| | | | | | | | | | | Previously we could end up with a TEXS that didn't write any outputs, this was wrong. | ||||
* | | | | Merge pull request #1102 from ogniK5377/mirror-clamp-edge | bunnei | 2018-08-19 | 1 | -0/+2 |
|\ \ \ \ | | | | | | | | | | | Added WrapMode MirrorOnceClampToEdge | ||||
| * | | | | Added WrapMode MirrorOnceClampToEdge | David Marcec | 2018-08-18 | 1 | -0/+2 |
| | |/ / | |/| | | | | | | | | | | Used by splatoon 2 | ||||
* | | | | Merge pull request #1101 from Subv/ssy_stack | bunnei | 2018-08-19 | 1 | -3/+36 |
|\ \ \ \ | |_|/ / |/| | | | Shaders: Implemented a stack for the SSY/SYNC instructions. | ||||
| * | | | Shaders: Implemented a stack for the SSY/SYNC instructions. | Subv | 2018-08-18 | 1 | -3/+36 |
| |/ / | | | | | | | | | | The SSY instruction pushes an address into the stack, and the SYNC instruction pops it. The current stack depth is 20, we should figure out if this is enough or not. | ||||
* | | | Merge pull request #1109 from Subv/ldg_decode | bunnei | 2018-08-19 | 1 | -0/+4 |
|\ \ \ | | | | | | | | | Shaders: Added decodings for the LDG and STG instructions. | ||||
| * | | | Shaders: Added decodings for the LDG and STG instructions. | Subv | 2018-08-19 | 1 | -0/+4 |
| |/ / | |||||
* | | | Merge pull request #1108 from Subv/front_facing | bunnei | 2018-08-19 | 2 | -0/+7 |
|\ \ \ | | | | | | | | | Shaders: Implemented the gl_FrontFacing input attribute (attr 63). | ||||
| * | | | Shaders: Implemented the gl_FrontFacing input attribute (attr 63). | Subv | 2018-08-19 | 2 | -0/+7 |
| |/ / | |||||
* / / | Shader: Implemented the predicate and mode arguments of LOP. | Subv | 2018-08-18 | 2 | -11/+39 |
|/ / | | | | | | | | | | | The mode can be used to set the predicate to true depending on the result of the logic operation. In some cases, this means discarding the result (writing it to register 0xFF (Zero)). This is used by Super Mario Odyssey. | ||||
* | | Added predcondition GreaterThanWithNan | David Marcec | 2018-08-18 | 2 | -5/+8 |
| | | |||||
* | | Merge pull request #1096 from bunnei/supported-blits | bunnei | 2018-08-18 | 1 | -2/+0 |
|\ \ | | | | | | | gl_rasterizer_cache: Remove asserts for supported blits. | ||||
| * | | gl_rasterizer_cache: Remove asserts for supported blits. | bunnei | 2018-08-17 | 1 | -2/+0 |
| | | | |||||
* | | | renderer_opengl: Treat OpenGL errors as critical. | bunnei | 2018-08-17 | 1 | -1/+1 |
|/ / | |||||
* | | Merge pull request #1019 from Subv/vertex_divisor | bunnei | 2018-08-17 | 7 | -5/+28 |
|\ \ | | | | | | | Rasterizer: Manually implemented instanced rendering. | ||||
| * | | Rasterizer: Implemented instanced rendering. | Subv | 2018-08-15 | 7 | -5/+28 |
| | | | | | | | | | | | | | | | | | | We keep track of the current instance and update an uniform in the shaders to let them know which instance they are. Instanced vertex arrays are not yet implemented. | ||||
* | | | gl_rasterizer_cache: Treat Depth formats differently from DepthStencil. | bunnei | 2018-08-16 | 2 | -16/+26 |
| |/ |/| | |||||
* | | Shader/Conversion: Implemented the negate bit in F2F and I2I instructions. | Subv | 2018-08-15 | 1 | -4/+12 |
| | | |||||
* | | Shader/I2F: Implemented the negate I2F_C instruction variant. | Subv | 2018-08-15 | 1 | -7/+23 |
| | | |||||
* | | Shader/F2I: Implemented the negate bit in the I2F instruction | Subv | 2018-08-15 | 1 | -0/+4 |
| | | |||||
* | | Shader/F2I: Implemented the F2I_C instruction variant. | Subv | 2018-08-15 | 1 | -2/+10 |
| | | |||||
* | | Shader/F2I: Implemented the negate bit in the F2I instruction. | Subv | 2018-08-15 | 1 | -0/+4 |
| | | |||||
* | | Merge pull request #1077 from bunnei/rgba16u | bunnei | 2018-08-15 | 1 | -1/+9 |
|\ \ | | | | | | | gl_rasterizer_cache: Add RGBA16U to PixelFormatFromTextureFormat. | ||||
| * | | gl_rasterizer_cache: Add RGBA16U to PixelFormatFromTextureFormat. | bunnei | 2018-08-15 | 1 | -1/+9 |
| |/ | | | | | | | - Used by Breath of the Wild. | ||||
* / | gl_rasterizer_cache: Cleanup some PixelFormat names and logging. | bunnei | 2018-08-15 | 2 | -41/+71 |
|/ | |||||
* | Merge pull request #1069 from bunnei/vtx-sz | bunnei | 2018-08-15 | 1 | -5/+20 |
|\ | | | | | maxwell_to_gl: Properly handle UnsignedInt/SignedInt sizes. | ||||
| * | maxwell_to_gl: Properly handle UnsignedInt/SignedInt sizes. | bunnei | 2018-08-15 | 1 | -5/+20 |
| | | |||||
* | | Merge pull request #1070 from bunnei/cbuf-sz | bunnei | 2018-08-15 | 1 | -3/+3 |
|\ \ | | | | | | | gl_rasterizer: Fix upload size for constant buffers. | ||||
| * | | gl_rasterizer: Fix upload size for constant buffers. | bunnei | 2018-08-15 | 1 | -3/+3 |
| |/ | |||||
* | | Merge pull request #1071 from bunnei/fix-ldc | bunnei | 2018-08-15 | 1 | -13/+22 |
|\ \ | | | | | | | gl_shader_decompiler: Several fixes for indirect constant buffer loads. | ||||
| * | | gl_shader_decompiler: Several fixes for indirect constant buffer loads. | bunnei | 2018-08-15 | 1 | -13/+22 |
| |/ | |||||
* | | Merge pull request #1068 from bunnei/g8r8s | bunnei | 2018-08-15 | 2 | -34/+49 |
|\ \ | | | | | | | gl_rasterizer_cache: Implement G8R8S format. | ||||
| * | | gl_rasterizer_cache: Implement G8R8S format. | bunnei | 2018-08-15 | 2 | -34/+49 |
| |/ | | | | | | | - Used by Super Mario Odyssey. | ||||
* / | Implement Z16_UNORM in PixelFormatFromTextureFormat function | greggameplayer | 2018-08-15 | 1 | -0/+2 |
|/ | | | Require by Zelda Breath Of The Wild | ||||
* | Merge pull request #1058 from greggameplayer/BC7U_Fix | bunnei | 2018-08-14 | 1 | -1/+1 |
|\ | | | | | Fix BC7U | ||||
| * | Fix BC7U | greggameplayer | 2018-08-14 | 1 | -1/+1 |
| | | |||||
* | | renderer_opengl: Implement RenderTargetFormat::RGBA16_UNORM. | bunnei | 2018-08-14 | 4 | -37/+48 |
|/ | | | | - Used by Breath of the Wild. | ||||
* | Merge pull request #1052 from ogniK5377/xeno | bunnei | 2018-08-13 | 4 | -7/+45 |
|\ | | | | | Implement RG32UI and R32UI | ||||
| * | Implement RG32UI and R32UI | David Marcec | 2018-08-13 | 4 | -7/+45 |
| | | | | | | | | Needed for xenoblade | ||||
* | | maxwell_to_gl: Implement VertexAttribute::Size::Size_8. | bunnei | 2018-08-13 | 1 | -0/+1 |
|/ | | | | - Used by Breath of the Wild. | ||||
* | renderer_opengl: Implement RenderTargetFormat::RGBA16_UINT. | bunnei | 2018-08-13 | 4 | -34/+45 |
| | | | | - Used by Breath of the Wild. | ||||
* | Merge pull request #1045 from bunnei/rg8-unorm | bunnei | 2018-08-13 | 4 | -26/+61 |
|\ | | | | | renderer_opengl: Implement RenderTargetFormat::RG8_UNORM. | ||||
| * | renderer_opengl: Implement RenderTargetFormat::RG8_UNORM. | bunnei | 2018-08-13 | 4 | -26/+61 |
| | | | | | | | | - Used by Breath of the Wild. | ||||
* | | maxwell_to_gl: Implement PrimitiveTopology::LineStrip. | bunnei | 2018-08-13 | 1 | -0/+2 |
|/ | | | | - Used by Breath of the Wild. | ||||
* | gl_shader_decompiler: Implement XMAD instruction. | bunnei | 2018-08-13 | 2 | -4/+120 |
| | |||||
* | gl_rasterizer: Use a shared helper to upload from CPU memory. | Markus Wick | 2018-08-12 | 2 | -28/+33 |
| | |||||
* | gl_state: Don't track constant buffer mappings. | Markus Wick | 2018-08-12 | 3 | -41/+3 |
| | |||||
* | gl_rasterizer: Use the stream buffer for constant buffers. | Markus Wick | 2018-08-12 | 4 | -29/+32 |
| | |||||
* | gl_rasterizer: Use the streaming buffer itself for the constant buffer. | Markus Wick | 2018-08-12 | 2 | -33/+15 |
| | | | | Don't emut copies, especially not for data, which is used once. They just end in a huge GPU overhead. | ||||
* | gl_rasterizer: Use a helper for aligning the buffer. | Markus Wick | 2018-08-12 | 2 | -15/+22 |
| | |||||
* | Update the stream_buffer helper from Citra. | Markus Wick | 2018-08-12 | 4 | -184/+98 |
| | | | | Please see https://github.com/citra-emu/citra/pull/3666 for more details. | ||||
* | gl_shader_decompiler: Fix SetOutputAttributeToRegister empty check. | bunnei | 2018-08-12 | 1 | -2/+2 |
| | |||||
* | gl_shader_decompiler: Fix GLSL compiler error with KIL instruction. | bunnei | 2018-08-12 | 1 | -0/+8 |
| | |||||
* | Merge pull request #1020 from lioncash/namespace | bunnei | 2018-08-12 | 8 | -15/+26 |
|\ | | | | | core: Namespace EmuWindow | ||||
| * | core: Namespace EmuWindow | Lioncash | 2018-08-12 | 8 | -15/+26 |
| | | | | | | | | Gets the class out of the global namespace. | ||||
* | | Merge pull request #1021 from lioncash/warn | bunnei | 2018-08-12 | 1 | -1/+1 |
|\ \ | | | | | | | gl_rasterizer: Silence implicit truncation warning in SetupShaders() | ||||
| * | | gl_rasterizer: Silence implicit truncation warning in SetupShaders() | Lioncash | 2018-08-12 | 1 | -1/+1 |
| |/ | | | | | | | | | | | Previously this would warn of truncating a std::size_t to a u32. This is safe because we'll obviously never have more than UINT32_MAX amount of uniform buffers. | ||||
* | | Merge pull request #1024 from Subv/blend_gl | bunnei | 2018-08-12 | 2 | -0/+40 |
|\ \ | | | | | | | GPU/Maxwell3D: Implemented an alternative set of blend factors. | ||||
| * | | GPU/Maxwell3D: Implemented an alternative set of blend factors. | Subv | 2018-08-12 | 2 | -0/+40 |
| |/ | | | | | | | These are used by nouveau and some games like SMO. | ||||
* | | Merge pull request #1023 from Subv/invalid_attribs | bunnei | 2018-08-12 | 2 | -1/+11 |
|\ \ | | | | | | | RasterizerGL: Ignore invalid/unset vertex attributes. | ||||
| * | | RasterizerGL: Ignore invalid/unset vertex attributes. | Subv | 2018-08-12 | 2 | -1/+11 |
| |/ | | | | | | | This should make the es2gears example not crash anymore. | ||||
* / | Implement R8_UINT RenderTargetFormat & PixelFormat (#1014) | greggameplayer | 2018-08-12 | 4 | -55/+74 |
|/ | | | | - Used by Go Vacation | ||||
* | Merge pull request #1010 from bunnei/unk-vert-attrib-shader | bunnei | 2018-08-12 | 2 | -10/+11 |
|\ | | | | | gl_shader_decompiler: Improve handling of unknown input/output attributes. | ||||
| * | gl_shader_decompiler: Improve handling of unknown input/output attributes. | bunnei | 2018-08-12 | 2 | -10/+11 |
| | | |||||
* | | Merge pull request #1009 from bunnei/rg8-rgba8-snorm | bunnei | 2018-08-12 | 4 | -64/+93 |
|\ \ | | | | | | | Implement render target formats RGBA8_SNORM and RG8_SNORM. | ||||
| * | | gl_rasterizer: Implement render target format RG8_SNORM. | bunnei | 2018-08-12 | 4 | -8/+18 |
| | | | | | | | | | | | | - Used by Super Mario Odyssey. | ||||
| * | | gl_rasterizer: Implement render target format RGBA8_SNORM. | bunnei | 2018-08-12 | 4 | -64/+83 |
| |/ | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | Merge pull request #1018 from Subv/ssy_sync | bunnei | 2018-08-12 | 2 | -8/+38 |
|\ \ | |/ |/| | GPU/Shader: Implemented SSY and SYNC as a set_target/jump pair. | ||||
| * | GPU/Shader: Don't predicate instructions that don't have a predicate field (SSY). | Subv | 2018-08-11 | 2 | -2/+13 |
| | | |||||
| * | GPU/Shaders: Implemented SSY and SYNC as a way to modify control flow during shader execution. | Subv | 2018-08-11 | 1 | -6/+25 |
| | | | | | | | | SSY sets the target label to jump to when the SYNC instruction is executed. | ||||
* | | Merge pull request #1016 from lioncash/video | bunnei | 2018-08-11 | 6 | -30/+42 |
|\ \ | | | | | | | video_core: Get rid of global variable g_toggle_framelimit_enabled | ||||
| * | | video_core; Get rid of global g_toggle_framelimit_enabled variable | Lioncash | 2018-08-11 | 6 | -25/+42 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead, we make a struct for renderer settings and allow the renderer to update all of these settings, getting rid of the need for global-scoped variables. This also uncovered a few indirect inclusions for certain headers, which this commit also fixes. | ||||
| * | | renderer_base: Remove unused kFramebuffer enumeration | Lioncash | 2018-08-11 | 1 | -3/+0 |
| | | | | | | | | | | | | This is entirely unused and can be removed. | ||||
| * | | video_core: Remove unused Renderer enumeration | Lioncash | 2018-08-11 | 1 | -2/+0 |
| |/ | | | | | | | | | Currently we only have an OpenGL renderer, so this is unused in code (and occupies the Renderer identifier in the VideoCore namespace). | ||||
* | | Merge pull request #1003 from lioncash/var | bunnei | 2018-08-11 | 2 | -4/+2 |
|\ \ | | | | | | | video_core: Use variable template variants of type_traits interfaces where applicable | ||||
| * | | video_core: Use variable template variants of type_traits interfaces where applicable | Lioncash | 2018-08-10 | 2 | -4/+2 |
| | | | |||||
* | | | Implement R16S & R16UI & R16I RenderTargetFormats & PixelFormats and more (R16_UNORM needed by Fate Extella) (#848) | greggameplayer | 2018-08-11 | 4 | -19/+92 |
| |/ |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | * Implement R16S & R16UI & R16I RenderTargetFormats & PixelFormats Do a separate function in order to get Bytes Per Pixel of DepthFormat Apply the new function in gpu.h delete unneeded white space * correct merging error | ||||
* | | maxwell_to_gl: Implement VertexAttribute::Size::Size_8_8. | bunnei | 2018-08-10 | 1 | -0/+1 |
| | | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | maxwell_to_gl: Implement VertexAttribute::Size::Size_32_32_32. | bunnei | 2018-08-10 | 1 | -0/+2 |
| | | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | Merge pull request #1004 from lioncash/unused | bunnei | 2018-08-10 | 3 | -8/+6 |
|\ \ | | | | | | | gl_rasterizer_cache: Remove unused viewport parameter of GetFramebufferSurfaces() | ||||
| * | | gl_rasterizer_cache: Remove unused viewport parameter of GetFramebufferSurfaces() | Lioncash | 2018-08-10 | 3 | -8/+6 |
| |/ | |||||
* | | Merge pull request #1008 from yuzu-emu/revert-697-disable-depth-cull | bunnei | 2018-08-10 | 1 | -3/+1 |
|\ \ | | | | | | | Revert "gl_state: Temporarily disable culling and depth test." | ||||
| * | | Revert "gl_state: Temporarily disable culling and depth test." | bunnei | 2018-08-10 | 1 | -3/+1 |
| | | | |||||
* | | | textures: Refactor out for Texture/Depth FormatFromPixelFormat. | bunnei | 2018-08-10 | 4 | -179/+27 |
|/ / | |||||
* | | Merge pull request #995 from bunnei/gl-buff-bounds | bunnei | 2018-08-10 | 1 | -10/+12 |
|\ \ | |/ |/| | gl_rasterizer_cache: Add bounds checking for gl_buffer copies. | ||||
| * | gl_rasterizer_cache: Add bounds checking for gl_buffer copies. | bunnei | 2018-08-10 | 1 | -10/+12 |
| | | |||||
* | | Merge pull request #1001 from lioncash/reserve | bunnei | 2018-08-10 | 1 | -0/+2 |
|\ \ | |/ |/| | gl_shader_decompiler: Reserve element memory beforehand in BuildRegisterList() | ||||
| * | gl_shader_decompiler: Reserve element memory beforehand in BuildRegisterList() | Lioncash | 2018-08-09 | 1 | -0/+2 |
| | | | | | | | | | | Avoids potentially perfoming multiple reallocations when we know the total amount of memory we need beforehand. | ||||
* | | Merge pull request #991 from bunnei/ignore-mac | bunnei | 2018-08-10 | 1 | -4/+9 |
|\ \ | | | | | | | maxwell_3d: Ignore macros that have not been uploaded yet. | ||||
| * | | maxwell_3d: Ignore macros that have not been uploaded yet. | bunnei | 2018-08-09 | 1 | -4/+9 |
| | | | | | | | | | | | | - Used by Super Mario Odyssey (in game). | ||||
* | | | Implement SNORM for BC5/DXN2 (#998) | Khangaroo | 2018-08-10 | 2 | -38/+55 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Implement BC5/DXN2 (#996) - Used by Kirby Star Allies. * Implement BC5/DXN2 SNORM UNORM for Kirby Star Allies SNORM for Super Mario Odyssey | ||||
* | | | gl_rasterizer_cache: Avoid iterator invalidation issues within InvalidateRegion() | Lioncash | 2018-08-09 | 1 | -2/+4 |
| |/ |/| | | | | | | | A range-based for loop can't be used when the container being iterated is also being erased from. | ||||
* | | Merge pull request #992 from bunnei/declr-pred | bunnei | 2018-08-09 | 1 | -4/+5 |
|\ \ | | | | | | | gl_shader_decompiler: Declare predicates on use. | ||||
| * | | gl_shader_decompiler: Declare predicates on use. | bunnei | 2018-08-09 | 1 | -4/+5 |
| |/ | | | | | | | - Used by Super Mario Odyssey (when going in game). | ||||
* | | Merge pull request #994 from lioncash/const | bunnei | 2018-08-09 | 1 | -7/+9 |
|\ \ | | | | | | | gl_rasterizer_cache: Use std::vector::assign vs resize() then copy for the non-tiled case | ||||
| * | | gl_rasterizer_cache: Invert conditional in LoadGLBuffer() | Lioncash | 2018-08-09 | 1 | -5/+5 |
| | | | | | | | | | | | | | | | | | | It's generally easier to follow code using conditionals that operate in terms of the true case followed by the false case (no chance of overlooking the exclamation mark). | ||||
| * | | gl_rasterizer_cache: Use std::vector::assign in LoadGLBuffer() for the non-tiled case | Lioncash | 2018-08-09 | 1 | -4/+6 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | resize() causes the vector to expand and zero out the added members to the vector, however we can avoid this zeroing by using assign(). Given we have the pointer to the data we want to copy, we can calculate the end pointer and directly copy the range of data without the need to perform the resize() beforehand. | ||||
| * | | gl_rasterizer_cache: Make pointer const in LoadGLBuffer() | Lioncash | 2018-08-09 | 1 | -1/+1 |
| |/ | | | | | | | | | This is only ever read from, so we can make the data it's pointing to const. | ||||
* | | Merge pull request #993 from bunnei/smo-vtx-pts | bunnei | 2018-08-09 | 1 | -0/+3 |
|\ \ | | | | | | | Implement VertexAttribute::Size::Size_16_16_16_16 and PrimitiveTopology::Points. | ||||
| * | | maxwell_to_gl: Implement VertexAttribute::Size::Size_16_16_16_16. | bunnei | 2018-08-09 | 1 | -0/+1 |
| | | | | | | | | | | | | - Used by Super Mario Odyssey (in game). | ||||
| * | | maxwell_to_gl: Implement PrimitiveTopology::Points. | bunnei | 2018-08-09 | 1 | -0/+2 |
| |/ | | | | | | | - Used by Super Mario Odyssey (in game). | ||||
* | | Merge pull request #984 from bunnei/rt-none | bunnei | 2018-08-09 | 1 | -0/+5 |
|\ \ | | | | | | | gl_rasterizer: Do not render when no render target is configured. | ||||
| * | | gl_rasterizer: Do not render when no render target is configured. | bunnei | 2018-08-09 | 1 | -0/+5 |
| | | | | | | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | | Implement BC5/DXN2 (#996) | Khangaroo | 2018-08-09 | 3 | -33/+45 |
| | | | | | | | | | - Used by Kirby Star Allies. | ||||
* | | | Merge pull request #977 from bunnei/bgr565 | bunnei | 2018-08-09 | 2 | -0/+4 |
|\ \ \ | |_|/ |/| | | gl_rasterizer_cached: Implement RenderTargetFormat::B5G6R5_UNORM. | ||||
| * | | gl_rasterizer_cached: Implement RenderTargetFormat::B5G6R5_UNORM. | bunnei | 2018-08-08 | 2 | -0/+4 |
| | | | | | | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | | Merge pull request #982 from bunnei/stub-unk-63 | bunnei | 2018-08-09 | 2 | -0/+9 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Stub input attribute Unknown_63. | ||||
| * | | | gl_shader_decompiler: Stub input attribute Unknown_63. | bunnei | 2018-08-08 | 2 | -0/+9 |
| |/ / | |||||
* | | | Merge pull request #976 from bunnei/shader-imm | bunnei | 2018-08-09 | 2 | -11/+6 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Let OpenGL interpret floats. | ||||
| * | | | gl_shader_decompiler: Let OpenGL interpret floats. | bunnei | 2018-08-08 | 2 | -11/+6 |
| |/ / | | | | | | | | | | | | | - Accuracy is lost in translation to string, e.g. with NaN. - Needed for Super Mario Odyssey. | ||||
* | | | Merge pull request #981 from bunnei/cbuf-corrupt | bunnei | 2018-08-09 | 4 | -3/+12 |
|\ \ \ | |_|/ |/| | | maxwell_3d: Use correct const buffer size and check bounds. | ||||
| * | | maxwell_3d: Use correct const buffer size and check bounds. | bunnei | 2018-08-08 | 4 | -3/+12 |
| |/ | | | | | | | - Fixes mem corruption with Super Mario Odyssey and Pokkén Tournament DX. | ||||
* | | Merge pull request #985 from bunnei/rt-r11g11b10 | bunnei | 2018-08-09 | 1 | -0/+1 |
|\ \ | | | | | | | gpu: Add R11G11B10_FLOAT to RenderTargetBytesPerPixel. | ||||
| * | | gpu: Add R11G11B10_FLOAT to RenderTargetBytesPerPixel. | bunnei | 2018-08-08 | 1 | -0/+1 |
| |/ | | | | | | | - Used by Super Mario Odyssey. | ||||
* | | Merge pull request #979 from bunnei/vtx88 | bunnei | 2018-08-09 | 1 | -0/+1 |
|\ \ | | | | | | | maxwell_to_gl: Implement VertexAttribute::Size::Size_8_8. | ||||
| * | | maxwell_to_gl: Implement VertexAttribute::Size::Size_8_8. | bunnei | 2018-08-08 | 1 | -0/+1 |
| |/ | |||||
* / | renderer_opengl: Use trace log in a few places. | bunnei | 2018-08-08 | 2 | -2/+2 |
|/ | |||||
* | Merge pull request #964 from Hexagon12/lower-logs | bunnei | 2018-08-08 | 1 | -4/+4 |
|\ | | | | | Lowered down the logging for command processor methods | ||||
| * | Lowered down the logging for methods | Hexagon12 | 2018-08-07 | 1 | -4/+4 |
| | | |||||
* | | Fixed the sRGB pixel format (#963) | Hexagon12 | 2018-08-08 | 1 | -1/+2 |
| | | | | | | | | | | | | * Changed the sRGB pixel format return * Add a message about SRGBA -> RGBA conversion | ||||
* | | maxwell_3d: Remove outdated assert. | bunnei | 2018-08-06 | 1 | -2/+0 |
| | | |||||
* | | gl_rasterizer_cache: Avoid superfluous surface copies. | bunnei | 2018-08-06 | 2 | -4/+21 |
| | | |||||
* | | Merge pull request #927 from bunnei/fix-texs | bunnei | 2018-08-05 | 1 | -2/+5 |
|\ \ | | | | | | | gl_shader_decompiler: Fix TEXS mask and dest. | ||||
| * | | gl_shader_decompiler: Fix TEXS mask and dest. | bunnei | 2018-08-05 | 1 | -2/+5 |
| | | | |||||
* | | | Merge pull request #912 from lioncash/global-var | bunnei | 2018-08-05 | 10 | -50/+50 |
|\ \ \ | |/ / |/| | | video_core: Eliminate the g_renderer global variable | ||||
| * | | renderer_base: Make Rasterizer() return the rasterizer by reference | Lioncash | 2018-08-04 | 2 | -4/+8 |
| | | | | | | | | | | | | | | | | | | | | | All calling code assumes that the rasterizer will be in a valid state, which is a totally fine assumption. The only way the rasterizer wouldn't be is if initialization is done incorrectly or fails, which is checked against in System::Init(). | ||||
| * | | video_core: Eliminate the g_renderer global variable | Lioncash | 2018-08-04 | 10 | -47/+43 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We move the initialization of the renderer to the core class, while keeping the creation of it and any other specifics in video_core. This way we can ensure that the renderer is initialized and doesn't give unfettered access to the renderer. This also makes dependencies on types more explicit. For example, the GPU class doesn't need to depend on the existence of a renderer, it only needs to care about whether or not it has a rasterizer, but since it was accessing the global variable, it was also making the renderer a part of its dependency chain. By adjusting the interface, we can get rid of this dependency. | ||||
* | | | added braces for conditions | David Marcec | 2018-08-05 | 1 | -2/+3 |
| | | | |||||
* | | | fix the attrib format for ints | David Marcec | 2018-08-05 | 1 | -2/+7 |
| | | | |||||
* | | | Merge pull request #919 from lioncash/sign | bunnei | 2018-08-04 | 1 | -8/+9 |
|\ \ \ | |/ / |/| | | gl_shader_manager: Amend sign differences in an assertion comparison in SetShaderUniformBlockBinding() | ||||
| * | | gl_shader_manager: Invert conditional in SetShaderUniformBlockBinding() | Lioncash | 2018-08-04 | 1 | -7/+9 |
| | | | | | | | | | | | | | | | This lets us indent the majority of the code and places the error case first. | ||||
| * | | gl_shader_manager: Amend sign differences in an assertion comparison in SetShaderUniformBlockBinding() | Lioncash | 2018-08-04 | 1 | -3/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | Ensures both operands have the same sign in the comparison. While we're at it, we can get rid of the redundant casting of ub_size to an int. This type will always be trivial and alias a built-in type (not doing so would break backwards compatibility at a standard level). | ||||
* | | | Merge pull request #911 from lioncash/prototype | bunnei | 2018-08-04 | 1 | -3/+0 |
|\ \ \ | | | | | | | | | video_core: Remove unimplemented Start() function prototype | ||||
| * | | | video_core: Remove unimplemented Start() function prototype | Lioncash | 2018-08-03 | 1 | -3/+0 |
| | | | | | | | | | | | | | | | | Given this has no definition, we can just remove it entirely. | ||||
* | | | | Merge pull request #910 from lioncash/unused | bunnei | 2018-08-03 | 1 | -2/+0 |
|\ \ \ \ | |_|/ / |/| | | | gl_shader_decompiler: Remove unused variable in GenerateDeclarations() | ||||
| * | | | gl_shader_decompiler: Remove unused variable in GenerateDeclarations() | Lioncash | 2018-08-03 | 1 | -2/+0 |
| |/ / | | | | | | | | | | | | | This variable was being incremented, but we were never actually using it. | ||||
* / / | gl_shader_manager: Make ProgramManager's GetCurrentProgramStage() a const member function | Lioncash | 2018-08-03 | 1 | -1/+1 |
|/ / | | | | | | | This function doesn't modify class state, so it can be made const. | ||||
* | | Merge pull request #892 from lioncash/global | bunnei | 2018-08-03 | 8 | -51/+41 |
|\ \ | | | | | | | video_core: Make global EmuWindow instance part of the base renderer … | ||||
| * | | video_core: Make global EmuWindow instance part of the base renderer class | Lioncash | 2018-08-02 | 8 | -51/+41 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Makes the global a member of the RendererBase class. We also change this to be a reference. Passing any form of null pointer to these functions is incorrect entirely, especially given the code itself assumes that the pointer would always be in a valid state. This also makes it easier to follow the lifecycle of instances being used, as we explicitly interact the renderer with the rasterizer, rather than it just operating on a global pointer. | ||||
* | | | Merge pull request #901 from lioncash/ref | bunnei | 2018-08-03 | 1 | -2/+2 |
|\ \ \ | | | | | | | | | gl_shader_manager: Take ShaderSetup instances by const reference in UseProgrammableVertexShader() and UseProgrammableFragmentShader() | ||||
| * | | | gl_shader_manager: Take ShaderSetup instances by const reference in UseProgrammableVertexShader() and UseProgrammableFragmentShader() | Lioncash | 2018-08-02 | 1 | -2/+2 |
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | Avoids performing unnecessary copies of 65560 byte sized ShaderSetup instances, considering it's only used as part of lookup and not modified. Given the parameters were already const, it's likely taking these parameters by reference was intended but the ampersand was forgotten. | ||||
* | | | Merge pull request #902 from lioncash/array | bunnei | 2018-08-02 | 1 | -2/+3 |
|\ \ \ | | | | | | | | | gl_state: Make texture_units a std::array | ||||
| * | | | gl_state: Make texture_units a std::array | Lioncash | 2018-08-02 | 1 | -2/+3 |
| |/ / | | | | | | | | | | Gets rid of the use of a raw C array. | ||||
* / / | Implement RGB32F PixelFormat (#886) (used by Go Vacation) | greggameplayer | 2018-08-02 | 3 | -9/+23 |
|/ / | |||||
* | | Implement R32_FLOAT RenderTargetFormat | Unknown | 2018-08-01 | 3 | -0/+5 |
| | | |||||
* | | Merge pull request #869 from Subv/ubsan | bunnei | 2018-07-31 | 2 | -2/+6 |
|\ \ | |/ |/| | Corrected a few error cases detected by asan/ubsan | ||||
| * | MacroInterpreter: Avoid left shifting negative values. | Subv | 2018-07-31 | 2 | -2/+6 |
| | | | | | | | | The branch target is signed, so multiply by 4 instead of left shifting by 2 | ||||
* | | Merge pull request #808 from lioncash/mem-dedup | bunnei | 2018-07-26 | 1 | -14/+22 |
|\ \ | | | | | | | video_core/memory_manager: Avoid repeated unnecessary page slot lookups | ||||
| * | | video_core/memory_manager: Replace a loop with std::array's fill() function in PageSlot() | Lioncash | 2018-07-24 | 1 | -3/+1 |
| | | | | | | | | | | | | | | | We already have a function that does what this code was doing, so let's use that instead. | ||||
| * | | video_core/memory_manager: Avoid repeated unnecessary page slot lookups | Lioncash | 2018-07-24 | 1 | -11/+21 |
| | | | | | | | | | | | | | | | | | | | | | We don't need to keep calling the same function over and over again in a loop, especially when the behavior is slightly non-trivial. We can just keep a reference to the looked up location and do all the checking and assignments based off it instead. | ||||
* | | | GPU: Allow using R16F as a render target format. | Subv | 2018-07-26 | 2 | -1/+4 |
| |/ |/| | |||||
* | | Implement R16_G16 | Unknown | 2018-07-26 | 4 | -19/+100 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | correct trailing white spaces Delete tabs correct placement Add RG16F & RG16UI & RG16I & RG16S PixelFormats Return correct data according to changes done previously correct PixelFormat declaration correct coding style error correct coding style error part 2 correct RG16S Declaration error correct alignment | ||||
* | | Merge pull request #819 from Subv/srgb | bunnei | 2018-07-25 | 2 | -9/+17 |
|\ \ | | | | | | | GPU: Use the right texture format for sRGBA framebuffers. | ||||
| * | | GPU: Use the right texture format for sRGBA framebuffers. | Subv | 2018-07-25 | 2 | -9/+17 |
| | | | |||||
* | | | GPU: Allow the use of Z24S8 as a texture format. | Subv | 2018-07-25 | 1 | -0/+4 |
|/ / | |||||
* | | Merge pull request #816 from Subv/z32_s8 | bunnei | 2018-07-25 | 4 | -1/+16 |
|\ \ | | | | | | | GPU: Implemented the Z32_S8_X24 depth buffer format. | ||||
| * | | GPU: Implemented the Z32_S8_X24 depth buffer format. | Subv | 2018-07-25 | 4 | -1/+16 |
| | | | |||||
* | | | Merge pull request #815 from Subv/z32f_tex | bunnei | 2018-07-25 | 1 | -0/+4 |
|\ \ \ | | | | | | | | | GPU: Allow using Z32 as a texture format. | ||||
| * | | | GPU: Allow using Z32 as a texture format. | Subv | 2018-07-25 | 1 | -0/+4 |
| |/ / | |||||
* | | | Merge pull request #814 from Subv/rt_r8 | bunnei | 2018-07-25 | 2 | -0/+4 |
|\ \ \ | | | | | | | | | GPU: Allow the usage of R8 as a render target format. | ||||
| * | | | GPU: Allow the usage of R8 as a render target format. | Subv | 2018-07-25 | 2 | -0/+4 |
| |/ / | |||||
* | | | Merge pull request #809 from lioncash/rasterizer | bunnei | 2018-07-25 | 2 | -16/+13 |
|\ \ \ | | | | | | | | | gl_rasterizer: Minor cleanup | ||||
| * | | | gl_rasterizer: Replace magic number with GL_INVALID_INDEX in SetupConstBuffers() | Lioncash | 2018-07-24 | 1 | -3/+5 |
| | | | | | | | | | | | | | | | | | | | | This is just the named constant that OpenGL provides, so we can use that instead of using a literal -1 | ||||
| * | | | gl_rasterizer: Use std::string_view instead of std::string when checking for extensions | Lioncash | 2018-07-24 | 1 | -1/+3 |
| | | | | | | | | | | | | | | | | | | | | We can avoid heap allocations here by just using a std::string_view instead of performing unnecessary copying of the string data. | ||||
| * | | | gl_rasterizer: Use in-class member initializers where applicable | Lioncash | 2018-07-24 | 2 | -12/+5 |
| | |/ | |/| | | | | | | | We can just assign to the members directly in these cases. | ||||
* | | | Merge pull request #811 from Subv/code_address_assert | bunnei | 2018-07-25 | 1 | -8/+0 |
|\ \ \ | |_|/ |/| | | GPU: Remove the assert that required the CODE_ADDRESS to be 0. | ||||
| * | | GPU: Remove the assert that required the CODE_ADDRESS to be 0. | Subv | 2018-07-24 | 1 | -8/+0 |
| |/ | | | | | | | Games usually just leave it at 0 but nouveau sets it to something else. This already works fine, the assert is useless. | ||||
* / | GPU: Implemented the R16 and R16F texture formats. | Subv | 2018-07-24 | 3 | -5/+32 |
|/ | |||||
* | Merge pull request #799 from Subv/tex_r32f | bunnei | 2018-07-24 | 3 | -6/+19 |
|\ | | | | | GPU: Implement texture format R32F. | ||||
| * | GPU: Implement texture format R32F. | Subv | 2018-07-24 | 3 | -6/+19 |
| | | |||||
* | | Merge pull request #796 from bunnei/gl-uint | bunnei | 2018-07-24 | 1 | -0/+3 |
|\ \ | | | | | | | maxwell_to_gl: Implement VertexAttribute::Type::UnsignedInt. | ||||
| * | | maxwell_to_gl: Implement VertexAttribute::Type::UnsignedInt. | bunnei | 2018-07-24 | 1 | -0/+3 |
| | | | |||||
* | | | gl_rasterizer: Implement texture border color. | bunnei | 2018-07-24 | 3 | -11/+11 |
| | | | |||||
* | | | maxwell_to_gl: Implement Texture::WrapMode::Border. | bunnei | 2018-07-24 | 1 | -0/+2 |
| |/ |/| | |||||
* | | Merge pull request #791 from bunnei/rg32f-rgba32f-bgra8 | bunnei | 2018-07-24 | 5 | -12/+70 |
|\ \ | |/ |/| | gl_rasterizer_cache: Implement formats BGRA8_UNORM/RGBA32_FLOAT/RG32_FLOAT | ||||
| * | gl_rasterizer_cache: Implement RenderTargetFormat RG32_FLOAT. | bunnei | 2018-07-24 | 5 | -7/+25 |
| | | |||||
| * | gl_rasterizer_cache: Implement RenderTargetFormat RGBA32_FLOAT. | bunnei | 2018-07-24 | 2 | -10/+34 |
| | | |||||
| * | gl_rasterizer_cache: Implement RenderTargetFormat BGRA8_UNORM. | bunnei | 2018-07-24 | 4 | -8/+22 |
| | | |||||
| * | gl_rasterizer_cache: Add missing log statements. | bunnei | 2018-07-24 | 1 | -0/+2 |
| | | |||||
* | | Merge pull request #792 from lioncash/retval | bunnei | 2018-07-24 | 1 | -2/+2 |
|\ \ | | | | | | | gl_shader_decompiler: Correct return value of WriteTexsInstruction() | ||||
| * | | gl_shader_decompiler: Correct return value of WriteTexsInstruction() | Lioncash | 2018-07-24 | 1 | -2/+2 |
| | | | | | | | | | | | | This should be returning void, not a std::string | ||||
* | | | Merge pull request #790 from bunnei/shader-print-instr | bunnei | 2018-07-24 | 1 | -1/+2 |
|\ \ \ | | | | | | | | | gl_shader_decompiler: Print instruction value in shader comments. | ||||
| * | | | gl_shader_decompiler: Print instruction value in shader comments. | bunnei | 2018-07-24 | 1 | -1/+2 |
| | |/ | |/| | |||||
* | | | Merge pull request #788 from bunnei/shader-check-zero | bunnei | 2018-07-24 | 1 | -0/+6 |
|\ \ \ | |_|/ |/| | | gl_shader_decompiler: Check if SetRegister result is ZeroIndex. | ||||
| * | | gl_shader_decompiler: Check if SetRegister result is ZeroIndex. | bunnei | 2018-07-24 | 1 | -0/+6 |
| |/ | |||||
* | | gl_shader_decompiler: Implement shader instruction TLDS. | bunnei | 2018-07-24 | 1 | -29/+43 |
| | | |||||
* | | gl_shader_decompiler: Simplify GetCommonDeclarations() | Lioncash | 2018-07-23 | 1 | -5/+5 |
|/ | |||||
* | Merge pull request #769 from bunnei/shader-mask-fixes | bunnei | 2018-07-23 | 1 | -5/+9 |
|\ | | | | | shader_bytecode: Implement other TEXS masks. | ||||
| * | shader_bytecode: Implement other TEXS masks. | bunnei | 2018-07-22 | 1 | -5/+9 |
| | | |||||
* | | gl_shader_decompiler: Remove redundant Subroutine construction in AddSubroutine() | Lioncash | 2018-07-22 | 1 | -4/+8 |
|/ | | | | | | We don't need to toss away the Subroutine instance after the find() call and reconstruct another instance with the same data right after it. Particularly give Subroutine contains a std::set. | ||||
* | Merge pull request #767 from bunnei/shader-cleanup | bunnei | 2018-07-22 | 1 | -78/+15 |
|\ | | | | | gl_shader_decompiler: Remove unused state tracking and minor cleanup. | ||||
| * | gl_shader_decompiler: Remove unused state tracking and minor cleanup. | bunnei | 2018-07-22 | 1 | -78/+15 |
| | | |||||
* | | gl_shader_decompiler: Implement SEL instruction. | bunnei | 2018-07-22 | 2 | -0/+20 |
|/ | |||||
* | gl_rasterizer_cache: Blit surfaces on recreation instead of flush and load. | bunnei | 2018-07-22 | 2 | -2/+86 |
| | |||||
* | gl_rasterizer_cache: Use GPUVAddr as cache key, not parameter set. | bunnei | 2018-07-22 | 3 | -57/+46 |
| | |||||
* | gl_rasterizer_cache: Use zeta_width and zeta_height registers for depth buffer. | bunnei | 2018-07-22 | 2 | -11/+11 |
| | |||||
* | gl_rasterizer: Use zeta_enable register to enable depth buffer. | bunnei | 2018-07-22 | 1 | -2/+2 |
| | |||||
* | maxwell_3d: Add depth buffer enable, width, and height registers. | bunnei | 2018-07-22 | 1 | -2/+14 |
| | |||||
* | Merge pull request #748 from lioncash/namespace | bunnei | 2018-07-22 | 11 | -48/+24 |
|\ | | | | | video_core: Use nested namespaces where applicable | ||||
| * | video_core: Use nested namespaces where applicable | Lioncash | 2018-07-21 | 11 | -48/+24 |
| | | | | | | | | Compresses a few namespace specifiers to be more compact. | ||||
* | | Merge pull request #747 from lioncash/unimplemented | bunnei | 2018-07-21 | 2 | -3/+3 |
|\ \ | | | | | | | gl_shader_manager: Remove unimplemented function prototype | ||||
| * | | gl_shader_manager: Replace unimplemented function prototype | Lioncash | 2018-07-21 | 2 | -3/+3 |
| |/ | | | | | | | This was just a linker error waiting to happen. | ||||
* / | gpu: Rename Get3DEngine() to Maxwell3D() | Lioncash | 2018-07-21 | 3 | -11/+14 |
|/ | | | | This makes it match its const qualified equivalent. | ||||
* | Merge pull request #738 from lioncash/sign | bunnei | 2018-07-20 | 1 | -16/+20 |
|\ | | | | | gl_state: Get rid of mismatched sign conversions in Apply() | ||||
| * | gl_state: Make references const where applicable in Apply() | Lioncash | 2018-07-20 | 1 | -2/+3 |
| | | |||||
| * | gl_state: Get rid of mismatched sign conversions | Lioncash | 2018-07-20 | 1 | -14/+17 |
| | | | | | | | | | | While we're at it, amend the loop variable type to be the same width as that returned by the .size() call. | ||||
* | | Merge pull request #735 from lioncash/video-unused | bunnei | 2018-07-20 | 1 | -2/+0 |
|\ \ | | | | | | | maxwell_3d: Remove unused variable within GetStageTextures() | ||||
| * | | maxwell_3d: Remove unused variable within GetStageTextures() | Lioncash | 2018-07-20 | 1 | -2/+0 |
| |/ | |||||
* | | Merge pull request #731 from lioncash/shadow | bunnei | 2018-07-20 | 1 | -6/+4 |
|\ \ | | | | | | | gl_shader_decompiler: Eliminate variable and declaration shadowing | ||||
| * | | gl_shader_decompiler: Eliminate variable and declaration shadowing | Lioncash | 2018-07-20 | 1 | -6/+4 |
| |/ | | | | | | | | | Ensures that no identifiers are being hidden, which also reduces compiler warnings. | ||||
* / | gl_shader_decompiler: Remove unnecessary const from return values | Lioncash | 2018-07-20 | 1 | -2/+2 |
|/ | | | | | This adds nothing from a behavioral point of view, and can inhibit the move constructor/RVO | ||||
* | gl_state: Temporarily disable culling and depth test. | bunnei | 2018-07-19 | 1 | -1/+3 |
| | |||||
* | Merge pull request #687 from lioncash/instance | bunnei | 2018-07-19 | 3 | -15/+15 |
|\ | | | | | core: Don't construct instance of Core::System, just to access its live instance | ||||
| * | core: Don't construct instance of Core::System, just to access its live instance | Lioncash | 2018-07-19 | 3 | -15/+15 |
| | | | | | | | | | | | | | | | | | | This would result in a lot of allocations and related object construction, just to toss it all away immediately after the call. These are definitely not intentional, and it was intended that all of these should have been accessing the static function GetInstance() through the name itself, not constructed instances. | ||||
* | | decoders: Fix calc of swizzle image_width_in_gobs. | bunnei | 2018-07-19 | 1 | -1/+4 |
|/ | |||||
* | Merge pull request #678 from lioncash/astc | bunnei | 2018-07-18 | 1 | -78/+60 |
|\ | | | | | astc: Minor changes | ||||
| * | astc: Initialize vector size directly in Decompress | Lioncash | 2018-07-18 | 1 | -2/+1 |
| | | | | | | | | There's no need to perform a separate resize. | ||||
| * | astc: Mark functions as internally linked where applicable | Lioncash | 2018-07-18 | 1 | -17/+20 |
| | | |||||
| * | astc: const-correctness changes where applicable | Lioncash | 2018-07-18 | 1 | -14/+13 |
| | | | | | | | | | | A few member functions didn't actually modify class state, so these can be amended as necessary. | ||||
| * | astc: Delete Bits' copy contstructor and assignment operator | Lioncash | 2018-07-18 | 1 | -8/+6 |
| | | | | | | | | | | This also potentially avoids warnings, considering the copy assignment operator is supposed to have a return value. | ||||
| * | astc: In-class initialize member variables where appropriate | Lioncash | 2018-07-18 | 1 | -39/+22 |
| | | |||||
* | | vi: Partially implement buffer crop parameters. | bunnei | 2018-07-18 | 3 | -4/+20 |
|/ | |||||
* | GPU: Added register definitions for the stencil parameters. | Subv | 2018-07-17 | 1 | -2/+25 |
| | |||||
* | gl_rasterizer_cache: Implement texture format G8R8. | bunnei | 2018-07-15 | 3 | -9/+40 |
| | |||||
* | Merge pull request #665 from bunnei/fix-z24-s8 | bunnei | 2018-07-15 | 1 | -1/+2 |
|\ | | | | | gl_rasterizer_cache: Fix incorrect offset in ConvertS8Z24ToZ24S8. | ||||
| * | gl_rasterizer_cache: Fix incorrect offset in ConvertS8Z24ToZ24S8. | bunnei | 2018-07-15 | 1 | -1/+2 |
| | | |||||
* | | gl_rasterizer_cache: Implement depth format Z16_UNORM. | bunnei | 2018-07-15 | 3 | -1/+15 |
|/ | |||||
* | Merge pull request #598 from bunnei/makedonecurrent | bunnei | 2018-07-15 | 3 | -1/+27 |
|\ | | | | | OpenGL: Use MakeCurrent/DoneCurrent for multithreaded rendering. | ||||
| * | OpenGL: Use MakeCurrent/DoneCurrent for multithreaded rendering. | bunnei | 2018-07-14 | 3 | -1/+27 |
| | | |||||
* | | GPU: Always enable the depth write when clearing the depth buffer. | Subv | 2018-07-14 | 1 | -3/+8 |
|/ | | | | The GPU ignores that register when clearing, but OpenGL obeys the glDepthMask parameter, so we set the depth mask to GL_TRUE when clearing the depth buffer. It will be restored to the correct value automatically on the next draw call. | ||||
* | gl_rasterizer: Fix check for if a shader stage is enabled. | bunnei | 2018-07-13 | 3 | -35/+11 |
| | |||||
* | gl_shader_gen: Implement dual vertex shader mode. | bunnei | 2018-07-13 | 5 | -55/+139 |
| | | | | - When VertexA shader stage is enabled, we combine with VertexB program to make a single Vertex Shader stage. | ||||
* | Merge pull request #655 from bunnei/pred-lt-nan | bunnei | 2018-07-13 | 2 | -5/+7 |
|\ | | | | | gl_shader_decompiler: Implement PredCondition::LessThanWithNan. | ||||
| * | gl_shader_decompiler: Implement PredCondition::LessThanWithNan. | bunnei | 2018-07-13 | 2 | -5/+7 |
| | | |||||
* | | gl_shader_decompiler: Use FlowCondition field in EXIT instruction. | bunnei | 2018-07-13 | 2 | -8/+34 |
|/ | |||||
* | Merge pull request #652 from Subv/fadd32i | Sebastian Valle | 2018-07-13 | 2 | -0/+32 |
|\ | | | | | GPU: Implement the FADD32I shader instruction. | ||||
| * | GPU: Implement the FADD32I shader instruction. | Subv | 2018-07-12 | 2 | -0/+32 |
| | | |||||
* | | Merge pull request #651 from Subv/ffma_decode | bunnei | 2018-07-12 | 1 | -1/+1 |
|\ \ | | | | | | | GPU: Corrected the decoding of FFMA for immediate operands. | ||||
| * | | GPU: Corrected the decoding of FFMA for immediate operands. | Subv | 2018-07-12 | 1 | -1/+1 |
| |/ | |||||
* | | gl_rasterizer: Flip triangles when regs.viewport_transform[0].scale_y is negative. | bunnei | 2018-07-08 | 1 | -1/+4 |
| | | | | | | | | - Fixes a regression with Binding of Isaac. | ||||
* | | Merge pull request #625 from Subv/imnmx | bunnei | 2018-07-08 | 2 | -3/+31 |
|\ \ | | | | | | | GPU: Implemented the IMNMX shader instruction. | ||||
| * | | GPU: Implemented the IMNMX shader instruction. | Subv | 2018-07-04 | 2 | -3/+31 |
| | | | | | | | | | | | | It's similar to the FMNMX instruction but it works on integers. | ||||
* | | | GPU: Implemented the BC7U texture format. | Subv | 2018-07-07 | 3 | -7/+21 |
| |/ |/| | | | | | Note: Our version of glad exports GL_COMPRESSED_RGBA_BPTC_UNORM as GL_COMPRESSED_RGBA_BPTC_UNORM_ARB, maybe it's time we update it. | ||||
* | | Merge pull request #629 from Subv/depth_test | bunnei | 2018-07-05 | 2 | -9/+29 |
|\ \ | | | | | | | GPU: Allow using the old NV04 values for the depth test function. | ||||
| * | | GPU: Allow using the old NV04 values for the depth test function. | Subv | 2018-07-05 | 2 | -9/+29 |
| | | | | | | | | | | | | | | | | | | These seem to be just a valid as the GL token values. Thanks @ReinUsesLisp This restores graphical output to Disgaea 5 | ||||
* | | | Merge pull request #626 from Subv/shader_sync | bunnei | 2018-07-05 | 2 | -0/+12 |
|\ \ \ | |/ / |/| | | GPU: Stub the shader SYNC and DEPBAR instructions. | ||||
| * | | GPU: Stub the shader SYNC and DEPBAR instructions. | Subv | 2018-07-04 | 2 | -0/+12 |
| |/ | | | | | | | It is unknown at this moment if we actually need to do something with these instructions or if the GLSL compiler takes care of that for us. | ||||
* | | Merge pull request #624 from Subv/f2f_round | bunnei | 2018-07-05 | 1 | -0/+3 |
|\ \ | | | | | | | GPU: Implemented the F2F 'round' rounding mode. | ||||
| * | | GPU: Implemented the F2F 'round' rounding mode. | Subv | 2018-07-04 | 1 | -0/+3 |
| |/ | | | | | | | It's implemented via the GLSL 'roundEven()' function. | ||||
* | | Merge pull request #623 from Subv/vertex_types | bunnei | 2018-07-05 | 1 | -0/+8 |
|\ \ | | | | | | | GPU: Implement the Size_16_16 and Size_10_10_10_2 vertex attribute types | ||||
| * | | GPU: Implement the Size_16_16 and Size_10_10_10_2 vertex attribute types. | Subv | 2018-07-04 | 1 | -0/+8 |
| |/ | | | | | | | Both signed and unsigned variants. | ||||
* | | Merge pull request #622 from Subv/unused_tex | bunnei | 2018-07-05 | 2 | -2/+5 |
|\ \ | | | | | | | GPU: Ignore unused textures and corrected the TEX shader instruction decoding. | ||||
| * | | GPU: Ignore textures that the GLSL compiler deemed unused when binding textures to the shaders. | Subv | 2018-07-04 | 1 | -1/+4 |
| | | | |||||
| * | | GPU: Corrected the decoding for the TEX shader instruction. | Subv | 2018-07-04 | 1 | -1/+1 |
| |/ | |||||
* | | Merge pull request #621 from Subv/psetp_ | bunnei | 2018-07-05 | 2 | -0/+43 |
|\ \ | | | | | | | GPU: Implemented the PSETP shader instruction. | ||||
| * | | GPU: Implemented the PSETP shader instruction. | Subv | 2018-07-04 | 2 | -0/+43 |
| |/ | | | | | | | It's similar to the isetp and fsetp instructions but it works on predicates instead. | ||||
* | | Merge pull request #620 from Subv/depth_z32f | bunnei | 2018-07-05 | 3 | -2/+15 |
|\ \ | | | | | | | GPU: Implemented the 32 bit float depth buffer format. | ||||
| * | | GPU: Implemented the 32 bit float depth buffer format. | Subv | 2018-07-04 | 3 | -2/+15 |
| |/ | |||||
* / | GPU: Flip the triangle front face winding if the GPU is configured to not flip the triangles. | Subv | 2018-07-04 | 2 | -3/+29 |
|/ | | | | | | OpenGL's default behavior is already correct when the GPU is configured to flip the triangles. This fixes 1-2 Switch's splash screen. | ||||
* | GPU: Only configure the used framebuffers during clear. | Subv | 2018-07-04 | 4 | -17/+48 |
| | | | | Don't try to configure the color buffer if it is not being cleared, it may not be completely valid at this point. | ||||
* | Merge pull request #609 from Subv/clear_buffers | bunnei | 2018-07-04 | 5 | -16/+105 |
|\ | | | | | GPU: Implemented the CLEAR_BUFFERS register. | ||||
| * | GPU: Factor out the framebuffer configuration code for both Clear and Draw commands. | Subv | 2018-07-03 | 2 | -72/+39 |
| | | |||||
| * | GPU: Support clears that don't clear the color buffer. | Subv | 2018-07-03 | 2 | -6/+17 |
| | | |||||
| * | GPU: Bind and clear the render target when the CLEAR_BUFFERS register is written to. | Subv | 2018-07-03 | 4 | -0/+86 |
| | | |||||
| * | GPU: Added registers for the CLEAR_BUFFERS and CLEAR_COLOR methods. | Subv | 2018-07-03 | 1 | -2/+27 |
| | | |||||
* | | gl_rasterizer_cache: Implement PixelFormat S8Z24. | bunnei | 2018-07-03 | 3 | -11/+83 |
| | | |||||
* | | Merge pull request #607 from jroweboy/logging | bunnei | 2018-07-03 | 13 | -88/+87 |
|\ \ | | | | | | | Logging - Customizable backends | ||||
| * | | Update clang format | James Rowe | 2018-07-03 | 7 | -21/+20 |
| | | | |||||
| * | | Rename logging macro back to LOG_* | James Rowe | 2018-07-03 | 13 | -70/+70 |
| |/ | |||||
* | | Merge pull request #612 from bunnei/fix-cull | bunnei | 2018-07-03 | 1 | -2/+5 |
|\ \ | | | | | | | gl_rasterizer: Only set cull mode and front face if enabled. | ||||
| * | | gl_rasterizer: Only set cull mode and front face if enabled. | bunnei | 2018-07-03 | 1 | -2/+5 |
| |/ | |||||
* | | Merge pull request #611 from Subv/enabled_depth_test | bunnei | 2018-07-03 | 2 | -9/+13 |
|\ \ | | | | | | | GPU: Don't try to parse the depth test function if the depth test is disabled and use only the least significant 3 bits in the depth test func | ||||
| * | | GPU: Use only the least significant 3 bits when reading the depth test func. | Subv | 2018-07-03 | 1 | -9/+9 |
| | | | | | | | | | | | | Some games set the full GL define value here (including nouveau), but others just seem to set those last 3 bits. | ||||
| * | | GPU: Don't try to parse the depth test function if the depth test is disabled. | Subv | 2018-07-03 | 1 | -0/+4 |
| |/ | |||||
* | | Merge pull request #610 from Subv/mufu_8 | bunnei | 2018-07-03 | 2 | -0/+5 |
|\ \ | |/ |/| | GPU: Implemented MUFU suboperation 8, sqrt. | ||||
| * | GPU: Implemented MUFU suboperation 8, sqrt. | Subv | 2018-07-03 | 2 | -0/+5 |
| | | |||||
* | | Merge pull request #608 from Subv/depth | bunnei | 2018-07-03 | 9 | -32/+246 |
|\ \ | | | | | | | GPU: Implemented the depth buffer and depth test + culling | ||||
| * | | GPU: Set up the culling configuration on each draw. | Subv | 2018-07-03 | 1 | -6/+8 |
| | | | |||||
| * | | GPU: Set up the depth test state on every draw. | Subv | 2018-07-02 | 2 | -0/+14 |
| | | | |||||
| * | | MaxwellToGL: Added conversion functions for depth test and cull mode. | Subv | 2018-07-02 | 1 | -0/+50 |
| | | | |||||
| * | | GPU: Added registers for depth test and cull mode. | Subv | 2018-07-02 | 1 | -3/+51 |
| | | | |||||
| * | | GPU: Implemented the Z24S8 depth format and load the depth framebuffer. | Subv | 2018-07-02 | 7 | -24/+124 |
| |/ | |||||
* | | Merge pull request #606 from Subv/base_vertex | Sebastian Valle | 2018-07-02 | 2 | -8/+15 |
|\ \ | | | | | | | GPU: Fixed the index offset and implement BaseVertex when doing indexed rendering. | ||||
| * | | GPU: Implement offsetted rendering when using non-indexed drawing. | Subv | 2018-07-02 | 1 | -1/+1 |
| | | | |||||
| * | | GPU: Fixed the index offset rendering, and implemented the base vertex functionality. | Subv | 2018-07-02 | 1 | -6/+8 |
| | | | | | | | | | | | | This fixes Stardew Valley. | ||||
| * | | GPU: Added register definitions for the vertex buffer base element. | Subv | 2018-07-02 | 1 | -1/+6 |
| |/ | |||||
* | | Merge pull request #605 from Subv/dma_copy | Sebastian Valle | 2018-07-02 | 1 | -1/+5 |
|\ \ | | | | | | | GPU: Directly copy the pixels when performing a same-layout DMA. | ||||
| * | | GPU: Directly copy the pixels when performing a same-layout DMA. | Subv | 2018-07-02 | 1 | -1/+5 |
| | | | |||||
* | | | Merge pull request #604 from Subv/invalid_textures | bunnei | 2018-07-02 | 3 | -3/+12 |
|\ \ \ | |_|/ |/| | | GPU: Ignore invalid and disabled textures when drawing. | ||||
| * | | GPU: Ignore disabled textures and textures with an invalid address. | Subv | 2018-07-02 | 2 | -1/+10 |
| | | | |||||
| * | | GPU: Allow GpuToCpuAddress to return boost::none for unmapped addresses. | Subv | 2018-07-02 | 1 | -2/+2 |
| |/ | |||||
* | | Merge pull request #602 from Subv/mufu_subop | bunnei | 2018-07-01 | 2 | -6/+1 |
|\ \ | | | | | | | GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation. | ||||
| * | | GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation. | Subv | 2018-06-30 | 2 | -6/+1 |
| |/ | |||||
* | | Merge pull request #601 from Subv/rgba32_ui | bunnei | 2018-07-01 | 4 | -25/+48 |
|\ \ | | | | | | | GPU: Implement the RGBA32_UINT rendertarget format. | ||||
| * | | GPU: Implemented the RGBA32_UINT rendertarget format. | Subv | 2018-06-30 | 4 | -9/+28 |
| | | | |||||
| * | | GLCache: Specify the component type along the texture type in the format tuple. | Subv | 2018-06-30 | 1 | -17/+21 |
| |/ | |||||
* / | gl_shader_decompiler: Implement predicate NotEqualWithNan. | bunnei | 2018-06-30 | 2 | -17/+24 |
|/ | |||||
* | Merge pull request #595 from bunnei/raster-cache | bunnei | 2018-06-29 | 8 | -1454/+404 |
|\ | | | | | Rewrite the OpenGL rasterizer cache | ||||
| * | gl_rasterizer_cache: Only dereference color_surface/depth_surface if valid. | bunnei | 2018-06-29 | 1 | -2/+6 |
| | | |||||
| * | gl_rasterizer_cache: Implement caching for texture and framebuffer surfaces. | bunnei | 2018-06-27 | 3 | -16/+168 |
| | | | | | | | | | | | | gl_rasterizer_cache: Improved cache management based on Citra's implementation. gl_surface_cache: Add some docstrings. | ||||
| * | gl_rasterizer_cache: Various fixes for ASTC handling. | bunnei | 2018-06-27 | 2 | -35/+39 |
| | | |||||
| * | gl_rasterizer_cache: Use SurfaceParams as a key for surface caching. | bunnei | 2018-06-27 | 2 | -43/+72 |
| | | |||||
| * | maxwell_3d: Add a struct for RenderTargetConfig. | bunnei | 2018-06-27 | 1 | -17/+19 |
| | | |||||
| * | gl_rasterizer: Implement AccelerateDisplay to forward textures to framebuffers. | bunnei | 2018-06-27 | 6 | -8/+62 |
| | | |||||
| * | gl_rasterizer_cache: Cache size_in_bytes as a const per surface. | bunnei | 2018-06-27 | 2 | -9/+13 |
| | | |||||
| * | gl_rasterizer_cache: Refactor to make SurfaceParams members const. | bunnei | 2018-06-27 | 2 | -52/+37 |
| | | |||||
| * | gl_rasterizer_cache: Remove Citra's rasterizer cache, always load/flush surfaces. | bunnei | 2018-06-27 | 4 | -1494/+210 |
| | | |||||
* | | gl_shader_decompiler: Add a return path for unknown instructions. | bunnei | 2018-06-27 | 1 | -0/+1 |
|/ | |||||
* | gl_rasterizer: Workaround for when exceeding max UBO size. | bunnei | 2018-06-27 | 2 | -1/+7 |
| | |||||
* | Merge pull request #593 from bunnei/fix-swizzle | bunnei | 2018-06-27 | 5 | -12/+20 |
|\ | | | | | gl_state: Fix state management for texture swizzle. | ||||
| * | gl_state: Fix state management for texture swizzle. | bunnei | 2018-06-26 | 5 | -12/+20 |
| | | |||||
* | | Merge pull request #592 from bunnei/cleanup-gl-state | bunnei | 2018-06-27 | 2 | -94/+0 |
|\ \ | | | | | | | gl_state: Remove unused state management from 3DS. | ||||
| * | | gl_state: Remove unused state management from 3DS. | bunnei | 2018-06-26 | 2 | -94/+0 |
| |/ | |||||
* / | gl_rasterizer_cache: Fix inverted B5G6R5 format. | bunnei | 2018-06-26 | 1 | -1/+1 |
|/ | |||||
* | Merge pull request #554 from Subv/constbuffer_ubo | bunnei | 2018-06-26 | 4 | -18/+39 |
|\ | | | | | Rasterizer: Use UBOs instead of SSBOs for uploading const buffers. | ||||
| * | Rasterizer: Use UBOs instead of SSBOs for uploading const buffers. | Subv | 2018-06-10 | 4 | -18/+39 |
| | | | | | | | | This should help a bit with GPU performance once we're GPU-bound. | ||||
* | | Fix crash at exit | mailwl | 2018-06-25 | 1 | -2/+4 |
| | | |||||
* | | Build: Fixed some MSVC warnings in various parts of the code. | Subv | 2018-06-20 | 7 | -12/+13 |
| | | |||||
* | | Merge pull request #574 from Subv/shader_abs_neg | bunnei | 2018-06-19 | 1 | -7/+14 |
|\ \ | | | | | | | GPU: Perform negation after absolute value in the float shader instructions. | ||||
| * | | GPU: Perform negation after absolute value in the float shader instructions. | Subv | 2018-06-19 | 1 | -7/+14 |
| | | | |||||
* | | | GPU: Don't mark uniform buffers and registers as used for instructions which don't have them. | Subv | 2018-06-19 | 2 | -14/+18 |
|/ / | | | | | | | | | Like the MOV32I and FMUL32I instructions. This fixes a potential crash when using these instructions. | ||||
* | | Merge pull request #570 from bunnei/astc | bunnei | 2018-06-19 | 6 | -1/+1709 |
|\ \ | | | | | | | gl_rasterizer: Implement texture format ASTC_2D_4X4. | ||||
| * | | gl_rasterizer: Implement texture format ASTC_2D_4X4. | bunnei | 2018-06-18 | 6 | -1/+1709 |
| | | | |||||
* | | | Merge pull request #571 from Armada651/loose-blend | bunnei | 2018-06-18 | 1 | -1/+1 |
|\ \ \ | |/ / |/| | | gl_rasterizer: Get loose on independent blending. | ||||
| * | | gl_rasterizer: Get loose on independent blending. | Jules Blok | 2018-06-18 | 1 | -1/+1 |
| | | | |||||
* | | | gl_rasterizer_cache: Loosen things up a bit. | bunnei | 2018-06-18 | 1 | -26/+8 |
| | | | |||||
* | | | gl_shader_decompiler: Implement LOP instructions. | bunnei | 2018-06-17 | 2 | -6/+42 |
| | | | |||||
* | | | gl_shader_decompiler: Refactor LOP32I instruction a bit in support of LOP. | bunnei | 2018-06-17 | 2 | -57/+42 |
|/ / | |||||
* | | gl_shader_decompiler: Implement integer size conversions for I2I/I2F/F2I. | bunnei | 2018-06-16 | 2 | -14/+43 |
| | | |||||
* | | Merge pull request #564 from bunnei/lop32i_passb | bunnei | 2018-06-16 | 1 | -6/+12 |
|\ \ | | | | | | | gl_shader_decompiler: Implement LOP32I LogicOperation PassB. | ||||
| * | | gl_shader_decompiler: Implement LOP32I LogicOperation PassB. | bunnei | 2018-06-16 | 1 | -6/+12 |
| | | | |||||
* | | | gl_shader_gen: Set position.w to 1. | bunnei | 2018-06-16 | 1 | -0/+4 |
|/ / | |||||
* | | Merge pull request #556 from Subv/dma_engine | bunnei | 2018-06-12 | 7 | -1/+237 |
|\ \ | | | | | | | GPU: Partially implemented the Maxwell DMA engine. | ||||
| * | | GPU: Partially implemented the Maxwell DMA engine. | Subv | 2018-06-12 | 7 | -1/+237 |
| | | | | | | | | | | | | Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported. | ||||
* | | | Merge pull request #558 from Subv/iadd32i | bunnei | 2018-06-12 | 2 | -2/+31 |
|\ \ \ | | | | | | | | | GPU: Implemented the iadd32i shader instruction. | ||||
| * | | | GPU: Implemented the iadd32i shader instruction. | Subv | 2018-06-12 | 2 | -2/+31 |
| | | | | |||||
* | | | | gl_shader_decompiler: Implement saturate for float instructions. | bunnei | 2018-06-12 | 2 | -39/+32 |
|/ / / | |||||
* / / | GPU: Convert the gl_InstanceId and gl_VertexID variables to floats when reading from them. | Subv | 2018-06-10 | 1 | -1/+1 |
|/ / | | | | | | | This corrects the invalid position values in some games when doing attribute-less rendering. | ||||
* | | GPU: Implement the iset family of shader instructions. | Subv | 2018-06-09 | 2 | -2/+46 |
| | | |||||
* | | GPU: Added decodings for the ISET family of instructions. | Subv | 2018-06-09 | 1 | -0/+7 |
|/ | |||||
* | Merge pull request #550 from Subv/ssy | bunnei | 2018-06-09 | 2 | -0/+7 |
|\ | | | | | GPU: Stub the SSY shader instruction. | ||||
| * | GPU: Stub the SSY shader instruction. | Subv | 2018-06-09 | 2 | -0/+7 |
| | | | | | | | | This instruction tells the GPU where the flow reconverges in a non-uniform control flow scenario, we can ignore this when generating GLSL code. | ||||
* | | Merge pull request #551 from bunnei/shr | bunnei | 2018-06-09 | 2 | -0/+17 |
|\ \ | | | | | | | gl_shader_decompiler: Implement SHR instruction. | ||||
| * | | gl_shader_decompiler: Implement SHR instruction. | bunnei | 2018-06-09 | 2 | -0/+17 |
| |/ | |||||
* | | gl_shader_decompiler: Implement IADD instruction. | bunnei | 2018-06-09 | 2 | -11/+37 |
| | | |||||
* | | gl_shader_decompiler: Add missing asserts for saturate_a instructions. | bunnei | 2018-06-09 | 2 | -8/+18 |
|/ | |||||
* | GPU: Synchronize the blend state on every draw call. | Subv | 2018-06-09 | 2 | -16/+20 |
| | | | | | | Only independent blending on render target 0 is implemented for now. This fixes the elongated squids in Splatoon 2's boot screen. | ||||
* | GPU: Added registers for normal and independent blending. | Subv | 2018-06-09 | 2 | -31/+27 |
| | |||||
* | Merge pull request #547 from Subv/compressed_alignment | bunnei | 2018-06-08 | 1 | -2/+7 |
|\ | | | | | GLCache: Align compressed texture sizes to their compression ratio, and then align that compressed size to the block height for tiled textures. | ||||
| * | GLCache: Align compressed texture sizes to their compression ratio, and then align that compressed size to the block height for tiled textures. | Subv | 2018-06-08 | 1 | -2/+7 |
| | | | | | | | | This fixes issues with retrieving non-block-aligned tiled compressed textures from the cache. | ||||
* | | Rasterizer: Flush the written region when writing shader uniform data before copying it to the uniform buffers. | Subv | 2018-06-08 | 1 | -0/+3 |
|/ | | | | This fixes the flip_viewport uniform having invalid values when drawing. | ||||
* | Merge pull request #543 from Subv/uniforms | bunnei | 2018-06-07 | 1 | -3/+4 |
|\ | | | | | GLRenderer: Write the shader stage configuration UBO data *before* copying it to the GPU. | ||||
| * | GLRenderer: Write the shader stage configuration UBO data *before* copying it to the GPU. | Subv | 2018-06-07 | 1 | -3/+4 |
| | | | | | | | | This should fix the bug with the vs_config UBO being uninitialized during shader execution. | ||||
* | | Merge pull request #542 from bunnei/bfe_imm | bunnei | 2018-06-07 | 2 | -7/+44 |
|\ \ | | | | | | | gl_shader_decompiler: Implement BFE_IMM instruction. | ||||
| * | | gl_shader_decompiler: Implement BFE_IMM instruction. | bunnei | 2018-06-07 | 2 | -7/+44 |
| | | | |||||
* | | | Merge pull request #541 from Subv/blittextures | bunnei | 2018-06-07 | 1 | -56/+9 |
|\ \ \ | |/ / |/| | | GLCache: Fixed copying compressed textures in the rasterizer cache. | ||||
| * | | GLCache: Use the full uncompressed size when blitting from one texture to another. | Subv | 2018-06-07 | 1 | -3/+6 |
| | | | | | | | | | | | | This avoids the problem of only copying a tiny piece of the textures when they are compressed. | ||||
| * | | GLCache: Simplify the logic to copy from one texture to another in BlitTextures. | Subv | 2018-06-07 | 1 | -53/+3 |
| |/ | | | | | | | | | | | We now use glCopyImageSubData, this should avoid errors with trying to attach a compressed texture as a framebuffer's color attachment and then blitting to it. Maybe in the future we can change this to glCopyTextureSubImage which only requires GL_ARB_direct_state_access. | ||||
* | | gl_shader_decompiler: F2F: Implement rounding modes. | bunnei | 2018-06-07 | 2 | -10/+35 |
| | | |||||
* | | Merge pull request #537 from bunnei/misc-shader | bunnei | 2018-06-07 | 2 | -8/+24 |
|\ \ | | | | | | | gl_shader_decompiler: Additional decodings, remove unused stuff from TEX | ||||
| * | | gl_shader_decompiler: Remove some attribute stuff that has nothing to do with TEX/TEXS. | bunnei | 2018-06-07 | 1 | -8/+4 |
| | | | |||||
| * | | shader_bytecode: Add instruction decodings for BFE, IMNMX, and XMAD. | bunnei | 2018-06-07 | 1 | -0/+20 |
| |/ | |||||
* | | Merge pull request #535 from Subv/gpu_swizzle | bunnei | 2018-06-07 | 6 | -0/+65 |
|\ \ | | | | | | | GPU: Support changing the texture swizzles for Maxwell textures. | ||||
| * | | GPU: Support changing the texture swizzles for Maxwell textures. | Subv | 2018-06-07 | 3 | -0/+45 |
| | | | |||||
| * | | GLState: Support changing the GL_TEXTURE_SWIZZLE parameter of each texture unit. | Subv | 2018-06-07 | 3 | -0/+20 |
| |/ | |||||
* / | gl_shader_decompiler: Implement ISETP_IMM instruction. | bunnei | 2018-06-07 | 1 | -8/+9 |
|/ | |||||
* | Merge pull request #534 from Subv/multitexturing | bunnei | 2018-06-07 | 9 | -69/+172 |
|\ | | | | | GPU: Implement sampling multiple textures in the generated glsl shaders. | ||||
| * | GPU: Implement sampling multiple textures in the generated glsl shaders. | Subv | 2018-06-06 | 9 | -69/+172 |
| | | | | | | | | | | | | All tested games that use a single texture show no regression. Only Texture2D textures are supported right now, each shader gets its own "tex_fs/vs/gs" sampler array to maintain independent textures between shader stages, the textures themselves are reused if possible. | ||||
* | | gl_shader_decompiler: Implement LD_C instruction. | bunnei | 2018-06-07 | 2 | -0/+43 |
| | | |||||
* | | gl_shader_gen: Add uniform handling for indirect const buffer access. | bunnei | 2018-06-07 | 3 | -4/+40 |
| | | |||||
* | | gl_shader_decompiler: Refactor uniform handling to allow different decodings. | bunnei | 2018-06-06 | 2 | -26/+29 |
|/ | |||||
* | Merge pull request #531 from bunnei/fix-shl | Sebastian Valle | 2018-06-06 | 1 | -1/+1 |
|\ | | | | | gl_shader_decompiler: Fix un/signed mismatch with SHL. | ||||
| * | gl_shader_decompiler: Fix un/signed mismatch with SHL. | bunnei | 2018-06-06 | 1 | -1/+1 |
| | | |||||
* | | Merge pull request #530 from bunnei/wrap-mirror | Sebastian Valle | 2018-06-06 | 1 | -0/+2 |
|\ \ | | | | | | | maxwell_to_gl: Implement WrapMode Mirror. | ||||
| * | | maxwell_to_gl: Implement WrapMode Mirror. | bunnei | 2018-06-06 | 1 | -0/+2 |
| |/ | |||||
* | | Merge pull request #527 from Subv/rgba32f_texcopy | bunnei | 2018-06-06 | 2 | -0/+5 |
|\ \ | | | | | | | GPU: Allow the usage of RGBA32_FLOAT and RGBA16_FLOAT in the texture copy engine. | ||||
| * | | GPU: Allow the usage of RGBA16_FLOAT in the texture copy engine. | Subv | 2018-06-06 | 1 | -0/+2 |
| | | | |||||
| * | | GPU: Allow the usage of RGBA32_FLOAT in the texture copy engine. | Subv | 2018-06-06 | 2 | -0/+3 |
| | | | |||||
* | | | Merge pull request #528 from Subv/rg11b10f | bunnei | 2018-06-06 | 4 | -12/+31 |
|\ \ \ | |_|/ |/| | | GPU: Implemented the R11FG11FB10F texture and rendertarget formats. | ||||
| * | | GPU: Implemented the R11FG11FB10F texture and rendertarget formats. | Subv | 2018-06-06 | 4 | -11/+30 |
| | | | |||||
| * | | GPU: Fixed the compression factor for RGBA16F textures. | Subv | 2018-06-06 | 1 | -1/+1 |
| |/ | | | | | | | They're not compressed. | ||||
* | | Merge pull request #516 from Subv/f2i_r | bunnei | 2018-06-06 | 2 | -7/+64 |
|\ \ | | | | | | | GPU: Implemented the F2I_R shader instruction. | ||||
| * | | GPU: Implemented the F2I_R shader instruction. | Subv | 2018-06-05 | 2 | -7/+64 |
| | | | |||||
* | | | Merge pull request #521 from Subv/bra | bunnei | 2018-06-05 | 1 | -4/+5 |
|\ \ \ | | | | | | | | | GPU: Corrected the branch targets for the shader bra instruction. | ||||
| * | | | GPU: Corrected the branch targets for the shader bra instruction. | Subv | 2018-06-05 | 1 | -4/+5 |
| | |/ | |/| | |||||
* | | | Merge pull request #520 from bunnei/shader-shl | bunnei | 2018-06-05 | 2 | -15/+48 |
|\ \ \ | |/ / |/| | | gl_shader_decompiler: Implement SHL instruction. | ||||
| * | | gl_shader_decompiler: Fix typo with ISCADD instruction. | bunnei | 2018-06-05 | 1 | -1/+1 |
| | | | |||||
| * | | gl_shader_decompiler: Implement SHL instruction. | bunnei | 2018-06-05 | 2 | -14/+47 |
| | | | |||||
* | | | Merge pull request #518 from Subv/incomplete_shaders | bunnei | 2018-06-05 | 1 | -5/+16 |
|\ \ \ | |/ / |/| | | GPU: Implemented predicated exit instructions in the shader programs. | ||||
| * | | GPU: Implement predicated exit instructions in the shader programs. | Subv | 2018-06-05 | 1 | -4/+6 |
| | | | |||||
| * | | GPU: Take into account predicated exits when performing shader control flow analysis. | Subv | 2018-06-05 | 1 | -1/+10 |
| | | | |||||
* | | | gl_shader_decompiler: Implement PredCondition::NotEqual. | bunnei | 2018-06-05 | 1 | -3/+3 |
| | | | |||||
* | | | GPU: Implement the ISCADD shader instructions. | Subv | 2018-06-05 | 2 | -0/+40 |
| | | | |||||
* | | | GPU: Added decodings for the ISCADD instructions. | Subv | 2018-06-05 | 1 | -0/+7 |
| |/ |/| | |||||
* | | Merge pull request #514 from Subv/lop32i | bunnei | 2018-06-05 | 2 | -1/+58 |
|\ \ | | | | | | | GPU: Implemented the LOP32I instruction. | ||||
| * | | GPU: Implemented the LOP32I instruction. | Subv | 2018-06-04 | 2 | -1/+58 |
| | | | |||||
* | | | Merge pull request #510 from Subv/isetp | bunnei | 2018-06-05 | 2 | -6/+63 |
|\ \ \ | | | | | | | | | GPU: Implemented the ISETP_R and ISETP_C instructions | ||||
| * | | | GPU: Use explicit types when retrieving the uniform values for fsetp/fset and isetp instead of the type of an invalid output register. | Subv | 2018-06-04 | 1 | -9/+18 |
| | | | | |||||
| * | | | GPU: Implemented the ISETP_R and ISETP_C shader instructions. | Subv | 2018-06-04 | 2 | -0/+48 |
| |/ / | |||||
* | | | Merge pull request #512 from Subv/fset | bunnei | 2018-06-05 | 2 | -4/+19 |
|\ \ \ | |_|/ |/| | | GPU: Corrected the FSET and I2F instructions. | ||||
| * | | GPU: Use the bf bit in FSET to determine whether to write 0xFFFFFFFF or 1.0f. | Subv | 2018-06-04 | 2 | -2/+7 |
| | | | |||||
| * | | GPU: Corrected the I2F_R implementation. | Subv | 2018-06-04 | 1 | -2/+12 |
| |/ | |||||
* | | Merge pull request #501 from Subv/shader_bra | bunnei | 2018-06-05 | 2 | -1/+45 |
|\ \ | | | | | | | GPU: Partially implemented the bra shader instruction | ||||
| * | | GPU: Partially implemented the shader BRA instruction. | Subv | 2018-06-04 | 2 | -1/+43 |
| | | | |||||
| * | | GPU: Added decoding for the BRA instruction. | Subv | 2018-06-04 | 1 | -0/+2 |
| |/ | |||||
* | | Merge pull request #515 from Subv/viewport_fix | bunnei | 2018-06-05 | 2 | -14/+30 |
|\ \ | | | | | | | GPU: Calculate the correct viewport dimensions based on the scale and translate registers. | ||||
| * | | GPU: Calculate the correct viewport dimensions based on the scale and translate registers. | Subv | 2018-06-04 | 2 | -14/+30 |
| |/ | | | | | | | This is how nouveau calculates the viewport width and height. For some reason some games set 0xFFFF in the VIEWPORT_HORIZ and VIEWPORT_VERT registers, maybe those are a misnomer and actually refer to something else? | ||||
* / | GLCache: Corrected a mismatch between storing compressed sizes and verifying the uncompressed alignment in GetSurface. | Subv | 2018-06-04 | 1 | -1/+2 |
|/ | |||||
* | Merge pull request #500 from Subv/long_queries | bunnei | 2018-06-04 | 1 | -9/+24 |
|\ | | | | | GPU: Partial implementation of long GPU queries. | ||||
| * | GPU: Partial implementation of long GPU queries. | Subv | 2018-06-04 | 1 | -9/+24 |
| | | | | | | | | | | | | | | | | Long queries write a 128-bit result value to memory, which consists of a 64 bit query value and a 64 bit timestamp. In this implementation, only select=Zero of the Crop unit is implemented, this writes the query sequence as a 64 bit value, and a 0u64 value for the timestamp, since we emulate an infinitely fast GPU. This specific type was hwtested, but more rigorous tests should be performed in the future for the other types. | ||||
* | | gl_shader_decompiler: Implement TEXS component mask. | bunnei | 2018-06-03 | 2 | -9/+26 |
| | | |||||
* | | Merge pull request #494 from bunnei/shader-tex | bunnei | 2018-06-03 | 2 | -2/+58 |
|\ \ | | | | | | | gl_shader_decompiler: Implement TEX, fixes for TEXS. | ||||
| * | | gl_shader_decompiler: Implement TEX instruction. | bunnei | 2018-06-01 | 2 | -1/+36 |
| | | | |||||
| * | | gl_shader_decompiler: Support multi-destination for TEXS. | bunnei | 2018-06-01 | 2 | -2/+23 |
| |/ | |||||
* | | Merge pull request #495 from bunnei/improve-rro | bunnei | 2018-06-03 | 2 | -9/+18 |
|\ \ | | | | | | | gl_shader_decompiler: Implement RRO as a register move. | ||||
| * | | gl_shader_decompiler: Implement RRO as a register move. | bunnei | 2018-06-03 | 2 | -9/+18 |
| |/ | |||||
* / | GPU: Implemented the DXN1 (BC4) texture format. | Subv | 2018-06-02 | 3 | -3/+16 |
|/ | |||||
* | gl_rasterizer_cache: Assert that component type is UNorm or format is RGBA16F. | bunnei | 2018-05-31 | 1 | -1/+2 |
| | |||||
* | gl_rasterizer_cache: Implement PixelFormat RGBA16F. | bunnei | 2018-05-31 | 3 | -6/+22 |
| | |||||
* | Merge pull request #489 from Subv/vertexid | bunnei | 2018-05-30 | 2 | -1/+11 |
|\ | | | | | Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader. | ||||
| * | Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader. | Subv | 2018-05-30 | 2 | -1/+11 |
| | | |||||
* | | Merge pull request #483 from bunnei/sonic | Sebastian Valle | 2018-05-30 | 3 | -5/+16 |
|\ \ | | | | | | | Several GPU fixes to boot Sonic Mania | ||||
| * | | gl_shader_decompiler: F2F_R instruction: Implement abs. | bunnei | 2018-05-30 | 1 | -1/+7 |
| | | | |||||
| * | | gl_shader_decompiler: Partially implement F2F_R instruction. | bunnei | 2018-05-30 | 2 | -4/+9 |
| | | | |||||
| * | | gl_rasterize_cache: Invert order of tex format RGB565. | bunnei | 2018-05-30 | 1 | -1/+1 |
| |/ | |||||
* / | GPU: Implemented the R8 texture format (0x1D) | Subv | 2018-05-30 | 3 | -5/+18 |
|/ | |||||
* | add all the known TextureFormat (#474) | greggameplayer | 2018-05-29 | 1 | -2/+71 |
| | |||||
* | Merge pull request #472 from bunnei/greater-equal | bunnei | 2018-05-27 | 1 | -4/+3 |
|\ | | | | | gl_shader_decompiler: Implement GetPredicateComparison GreaterEqual. | ||||
| * | gl_shader_decompiler: Implement GetPredicateComparison GreaterEqual. | bunnei | 2018-05-26 | 1 | -4/+3 |
| | | |||||
* | | Merge pull request #476 from Subv/a1bgr5 | bunnei | 2018-05-27 | 4 | -5/+21 |
|\ \ | | | | | | | GPU: Implemented the A1B5G5R5 texture format (0x14) | ||||
| * | | GPU: Implemented the A1B5G5R5 texture format (0x14) | Subv | 2018-05-27 | 4 | -5/+21 |
| |/ | |||||
* / | shader_bytecode: Implement other variants of FMNMX. | bunnei | 2018-05-26 | 2 | -4/+10 |
|/ | |||||
* | Merge pull request #468 from Subv/compound_preds | bunnei | 2018-05-26 | 1 | -46/+66 |
|\ | | | | | Shader: Implemented compound predicates in the fset and fsetp instructions | ||||
| * | Shader: Implemented compound predicates in fset. | Subv | 2018-05-25 | 1 | -28/+12 |
| | | | | | | | | | | | | You can specify a predicate in the fset instruction: Result = ((Value1 Comp Value2) OP P0) ? 1.0 : 0.0; | ||||
| * | Shader: Implemented compound predicates in fsetp. | Subv | 2018-05-25 | 1 | -19/+55 |
| | | | | | | | | | | | | | | You can specify three predicates in an fsetp instruction: P1 = (Value1 Comp Value2) OP P0; P2 = !(Value1 Comp Value2) OP P0; | ||||
* | | GPU: Allow command lists to rebind a channel to another engine in the middle of the command list. | Subv | 2018-05-25 | 1 | -1/+0 |
|/ | |||||
* | Merge pull request #456 from Subv/unmap_buffer | bunnei | 2018-05-21 | 2 | -0/+20 |
|\ | | | | | Implemented nvhost-as-gpu's UnmapBuffer and nvmap's Free ioctls. | ||||
| * | GPU: Implemented nvhost-as-gpu's UnmapBuffer ioctl. | Subv | 2018-05-20 | 2 | -0/+20 |
| | | | | | | | | It removes a mapping previously created with the MapBufferEx ioctl. | ||||
* | | Merge pull request #458 from Subv/fmnmx | bunnei | 2018-05-21 | 2 | -6/+26 |
|\ \ | | | | | | | Shaders: Implemented the FMNMX shader instruction. | ||||
| * | | Shaders: Implemented the FMNMX shader instruction. | Subv | 2018-05-21 | 2 | -6/+26 |
| |/ | |||||
* | | Merge pull request #452 from Subv/psetp | Sebastian Valle | 2018-05-21 | 1 | -0/+3 |
|\ \ | | | | | | | ShadersDecompiler: Added decoding for the PSETP instruction. | ||||
| * | | ShadersDecompiler: Added decoding for the PSETP instruction. | Subv | 2018-05-19 | 1 | -0/+3 |
| |/ | |||||
* | | Merge pull request #451 from Subv/gl_array_size | Sebastian Valle | 2018-05-21 | 2 | -13/+3 |
|\ \ | | | | | | | GLRenderer: Remove unused vertex buffer and increase the size of the stream buffer to 128 MB. | ||||
| * | | GLRenderer: Remove unused hw_vao_enabled_attributes variable. | Subv | 2018-05-19 | 2 | -4/+0 |
| | | | |||||
| * | | GLRenderer: Remove unused vertex buffer and increase the size of the stream buffer to 128 MB. | Subv | 2018-05-19 | 2 | -9/+3 |
| |/ | | | | | | | The stream buffer is where all the vertex data is copied, some games require this to be much bigger than the 4 MB we used to have. | ||||
* / | GLRenderer: Log the shader source code when program linking fails. | Subv | 2018-05-19 | 1 | -0/+27 |
|/ | |||||
* | general: Make formatting of logged hex values more straightforward | Lioncash | 2018-05-02 | 1 | -1/+1 |
| | | | | | | This makes the formatting expectations more obvious (e.g. any zero padding specified is padding that's entirely dedicated to the value being printed, not any pretty-printing that also gets tacked on). | ||||
* | Merge pull request #422 from bunnei/shader-mov | bunnei | 2018-04-30 | 4 | -0/+30 |
|\ | | | | | Shader instructions MOV_C, MOV_R, and several minor GPU things | ||||
| * | maxwell_3d: Reset vertex counts after drawing. | bunnei | 2018-04-29 | 1 | -0/+10 |
| | | |||||
| * | gl_shader_decompiler: Implement MOV_R. | bunnei | 2018-04-29 | 1 | -1/+2 |
| | | |||||
| * | maxwell_to_gl: Implement type SignedNorm, Size_8_8_8_8. | bunnei | 2018-04-29 | 1 | -0/+12 |
| | | |||||
| * | shader_bytecode: Add decoding for FMNMX instruction. | bunnei | 2018-04-29 | 1 | -0/+2 |
| | | |||||
| * | gl_shader_decompiler: Implement MOV_C. | bunnei | 2018-04-29 | 1 | -0/+5 |
| | | |||||
* | | Shaders: Implemented predicate condition 3 (LessEqual) in the fset and fsetp instructions. | Subv | 2018-04-29 | 1 | -0/+7 |
|/ | |||||
* | Merge pull request #416 from bunnei/shader-ints-p3 | bunnei | 2018-04-29 | 2 | -114/+206 |
|\ | | | | | gl_shader_decompiler: Implement MOV32I, partially implement I2I, I2F | ||||
| * | gl_shader_decompiler: Partially implement I2I_R, and I2F_R. | bunnei | 2018-04-29 | 2 | -8/+34 |
| | | |||||
| * | gl_shader_decompiler: More cleanups, etc. with how we handle register types. | bunnei | 2018-04-29 | 1 | -44/+120 |
| | | |||||
| * | GLSLRegister: Simplify register declarations, etc. | bunnei | 2018-04-29 | 1 | -63/+31 |
| | | |||||
| * | shader_bytecode: Add decodings for i2i instructions. | bunnei | 2018-04-29 | 1 | -3/+20 |
| | | |||||
| * | gl_shader_decompiler: Implement MOV32_IMM instruction. | bunnei | 2018-04-29 | 2 | -2/+7 |
| | | |||||
* | | fermi_2d: Fix surface copy block height. | bunnei | 2018-04-29 | 2 | -2/+7 |
|/ | |||||
* | Merge pull request #408 from bunnei/shader-ints-p2 | bunnei | 2018-04-27 | 1 | -154/+262 |
|\ | | | | | gl_shader_decompiler: Add GLSLRegisterManager class to track register state. | ||||
| * | gl_shader_decompiler: Add GLSLRegisterManager class to track register state. | bunnei | 2018-04-27 | 1 | -154/+262 |
| | | |||||
* | | renderer_opengl: Replace usages of LOG_GENERIC with fmt-capable equivalents | Lioncash | 2018-04-27 | 1 | -6/+7 |
|/ | |||||
* | general: Convert assertion macros over to be fmt-compatible | Lioncash | 2018-04-27 | 4 | -7/+7 |
| | |||||
* | gl_shader_decompiler: Boilerplate for handling integer instructions. | bunnei | 2018-04-26 | 2 | -6/+111 |
| | |||||
* | gl_shader_decompiler: Move color output to EXIT instruction. | bunnei | 2018-04-26 | 1 | -6/+12 |
| | |||||
* | Merge pull request #396 from Subv/shader_ops | bunnei | 2018-04-26 | 2 | -9/+89 |
|\ | | | | | Shaders: Implemented the FSET instruction. | ||||
| * | Shaders: Added bit decodings for the I2I instruction. | Subv | 2018-04-25 | 1 | -0/+6 |
| | | |||||
| * | Shaders: Implemented the FSET instruction. | Subv | 2018-04-25 | 1 | -0/+53 |
| | | | | | | | | This instruction is similar to the FSETP instruction, but it doesn't set a predicate, it sets the destination register to 1.0 if the condition holds, and 0 otherwise. | ||||
| * | Shaders: Added decodings for the FSET instructions. | Subv | 2018-04-25 | 2 | -9/+30 |
| | | |||||
* | | GPU: Partially implemented the Fermi2D surface copy operation. | Subv | 2018-04-25 | 2 | -0/+59 |
| | | | | | | | | | | The hardware allows for some rather complicated operations to be performed on the data during the copy, this is not implemented. Only same-format same-size raw copies are implemented for now. | ||||
* | | GPU: Make the Textures::CopySwizzledData function accessible from the outside of the file. | Subv | 2018-04-25 | 2 | -3/+6 |
| | | |||||
* | | GPU: Added a function to retrieve the bytes per pixel of the render target formats. | Subv | 2018-04-25 | 2 | -0/+15 |
| | | |||||
* | | GPU: Added surface copy registers to Fermi2D | Subv | 2018-04-25 | 1 | -1/+57 |
| | | |||||
* | | GPU: Added boilerplate code for the Fermi2D engine | Subv | 2018-04-25 | 3 | -3/+34 |
| | | |||||
* | | GPU: Reduce the number of registers of Maxwell3D to 0xE00. | Subv | 2018-04-25 | 2 | -5/+5 |
| | | | | | | | | The rest are just macro shim registers. | ||||
* | | GPU: Move the Maxwell3D macro uploading code to the inside of the Maxwell3D processor. | Subv | 2018-04-25 | 4 | -40/+23 |
| | | | | | | | | It doesn't belong in the PFIFO handler. | ||||
* | | GPU: Corrected the upper bound of the PFIFO method ids in the command processor. | Subv | 2018-04-25 | 1 | -1/+1 |
| | | |||||
* | | video-core: Move logging macros over to new fmt-capable ones | Lioncash | 2018-04-25 | 5 | -18/+20 |
|/ | |||||
* | renderer_opengl: Use correct byte order for framebuffer pixel format ABGR8. | bunnei | 2018-04-25 | 1 | -2/+1 |
| | |||||
* | gl_rasterizer_cache: Use CHAR_BIT for bpp conversions instead of 8. | bunnei | 2018-04-25 | 2 | -4/+4 |
| | |||||
* | gl_rasterizer_cache: Use GPU PAGE_BITS/SIZE, not CPU. | bunnei | 2018-04-25 | 1 | -5/+5 |
| | |||||
* | gl_rasterizer_cache: Use new logger. | bunnei | 2018-04-25 | 1 | -4/+4 |
| | |||||
* | gl_rasterizer_cache: Add a function for finding framebuffer GPU address. | bunnei | 2018-04-25 | 3 | -0/+31 |
| | |||||
* | gl_rasterizer_cache: Handle compressed texture sizes. | bunnei | 2018-04-25 | 2 | -24/+65 |
| | |||||
* | gl_rasterizer_cache: Update to be based on GPU addresses, not CPU addresses. | bunnei | 2018-04-25 | 8 | -50/+72 |
| | |||||
* | memory_manager: Add implement CpuToGpuAddress. | bunnei | 2018-04-24 | 2 | -0/+27 |
| | |||||
* | memory_manager: Make GpuToCpuAddress return an optional. | bunnei | 2018-04-24 | 6 | -24/+33 |
| | |||||
* | memory_manager: Use GPUVAdddr, not PAddr, for GPU addresses. | bunnei | 2018-04-24 | 6 | -58/+55 |
| | |||||
* | Merge pull request #386 from Subv/gpu_query | bunnei | 2018-04-24 | 2 | -2/+53 |
|\ | | | | | GPU: Added asserts to our code for handling the QUERY_GET GPU command. | ||||
| * | GPU: Added asserts to our code for handling the QUERY_GET GPU command. | Subv | 2018-04-24 | 2 | -2/+53 |
| | | | | | | | | | | This is based on research from nouveau. Many things are currently unknown and will require hwtests in the future. This commit also stubs QueryMode::Write2 to do the same as Write. Nouveau code treats them interchangeably, it is currently unknown what the difference is. | ||||
* | | renderer_opengl: Silence a -Wdangling-else warning in DrawScreenTriangles() | Lioncash | 2018-04-24 | 1 | -1/+2 |
| | | |||||
* | | Merge pull request #379 from Subv/multi_buffers | bunnei | 2018-04-24 | 3 | -43/+89 |
|\ \ | | | | | | | GPU: Support multiple enabled vertex arrays. | ||||
| * | | GPU: Support multiple enabled vertex arrays. | Subv | 2018-04-23 | 3 | -43/+89 |
| |/ | | | | | | | | | | | The vertex arrays will be copied to the stream buffer one after the other, and the attributes will be set using the ARB_vertex_attrib_binding extension. yuzu now thus requires OpenGL 4.3 or the ARB_vertex_attrib_binding extension. | ||||
* | | Merge pull request #383 from Subv/gpu_mmu | bunnei | 2018-04-23 | 2 | -34/+25 |
|\ \ | | | | | | | GPU: Make the GPU virtual memory manager use 16 page bits and 10 pagetable bits. | ||||
| * | | GPU: Make the GPU virtual memory manager use 16 page bits and 10 page table bits. | Subv | 2018-04-23 | 2 | -34/+25 |
| |/ | | | | | | | Also removed some dead code and added memory map consistency asserts. | ||||
* | | GPU: Implement the RGB10_A2 RenderTarget format, it will use the same format as the A2BGR10 texture format. | Subv | 2018-04-23 | 1 | -0/+2 |
| | | |||||
* | | GPU: Implement the A2BGR10 texture format. | Subv | 2018-04-22 | 4 | -6/+18 |
|/ | |||||
* | Merge pull request #376 from bunnei/shader-decoder | bunnei | 2018-04-21 | 2 | -210/+249 |
|\ | | | | | Shader opcode decoding | ||||
| * | gl_shader_decompiler: Skip RRO instruction. | bunnei | 2018-04-21 | 1 | -0/+4 |
| | | |||||
| * | gl_shader_decompiler: Cleanup error logging. | bunnei | 2018-04-21 | 1 | -14/+6 |
| | | |||||
| * | shader_bytecode: Add several more instruction decodings. | bunnei | 2018-04-21 | 1 | -5/+52 |
| | | |||||
| * | shader_bytecode: Decode instructions based on bit strings. | bunnei | 2018-04-21 | 2 | -205/+201 |
| | | |||||
* | | Merge pull request #375 from lioncash/header | bunnei | 2018-04-21 | 4 | -11/+0 |
|\ \ | |/ |/| | opengl: Remove unnecessary header inclusions | ||||
| * | opengl: Remove unnecessary header inclusions | Lioncash | 2018-04-21 | 4 | -11/+0 |
| | | |||||
* | | Merge pull request #369 from Subv/shader_instr2 | bunnei | 2018-04-21 | 2 | -4/+179 |
|\ \ | | | | | | | ShaderGen: Implemented fsetp/kil and predicated instruction execution. | ||||
| * | | ShaderGen: Implemented the KIL instruction, which is equivalent to 'discard'. | Subv | 2018-04-21 | 1 | -1/+7 |
| | | | |||||
| * | | ShaderGen: Implemented predicated instruction execution. | Subv | 2018-04-21 | 2 | -1/+40 |
| | | | | | | | | | | | | Each predicated instruction will be wrapped in an `if (predicate) { instruction_body; }` in the GLSL, where `predicate` is one of the predicate boolean variables previously set by fsetp. | ||||
| * | | ShaderGen: Implemented the fsetp instruction. | Subv | 2018-04-21 | 2 | -3/+112 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Predicate variables are now added to the generated shader code in the form of 'pX' where X is the predicate id. These predicate variables are initialized to false on shader startup and are set via the fsetp instructions. TODO: * Not all the comparison types are implemented. * Only the single-predicate version is implemented. | ||||
| * | | ShaderGen: Register id 255 is special and is hardcoded to return 0 (SR_ZERO). | Subv | 2018-04-20 | 2 | -0/+5 |
| | | | |||||
| * | | ShaderGen: Ignore the 'sched' instruction when generating shaders. | Subv | 2018-04-20 | 1 | -0/+16 |
| |/ | | | | | | | The 'sched' instruction has a very convoluted encoding, but fortunately it seems to only appear on a fixed interval (once every 4 instructions). | ||||
* | | Merge pull request #374 from lioncash/noexcept | bunnei | 2018-04-21 | 1 | -20/+19 |
|\ \ | | | | | | | gl_resource_manager: Add missing noexcept specifiers to move constructors and assignment operators | ||||
| * | | gl_resource_manager: Add missing noexcept specifiers to move constructors and assignment operators | Lioncash | 2018-04-21 | 1 | -20/+19 |
| |/ | | | | | | | | | | | | | | | Standard library containers may use std::move_if_noexcept to perform move operations. If a move cannot be performed under these circumstances, then a copy is attempted. Given we only intend for these types to be move-only this can be somewhat problematic. By defining these to be noexcept we prevent cases where copies may be attempted. | ||||
* / | gl_rasterizer_cache: Make MatchFlags an enum class | Lioncash | 2018-04-21 | 1 | -4/+9 |
|/ | | | | Prevents implicit conversions and scope pollution. | ||||
* | Merge pull request #367 from lioncash/clamp | bunnei | 2018-04-20 | 2 | -16/+17 |
|\ | | | | | math_util: Remove the Clamp() function | ||||
| * | math_util: Remove the Clamp() function | Lioncash | 2018-04-20 | 2 | -16/+17 |
| | | | | | | | | | | C++17 adds clamp() to the standard library, so we can remove ours in favor of it. | ||||
* | | Merge pull request #363 from lioncash/array-size | bunnei | 2018-04-20 | 1 | -2/+2 |
|\ \ | | | | | | | common_funcs: Remove ARRAY_SIZE macro | ||||
| * | | common_funcs: Remove ARRAY_SIZE macro | Lioncash | 2018-04-20 | 1 | -2/+2 |
| |/ | | | | | | | C++17 has non-member size() which we can just call where necessary. | ||||
* / | renderer_opengl: Add missing header guards | Lioncash | 2018-04-20 | 2 | -0/+4 |
|/ | |||||
* | Merge pull request #356 from lioncash/shader | bunnei | 2018-04-20 | 1 | -12/+30 |
|\ | | | | | glsl_shader_decompiler: Minor API changes to ShaderWriter | ||||
| * | glsl_shader_decompiler: Use std::string_view instead of std::string for AddLine() | Lioncash | 2018-04-20 | 1 | -1/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | This function doesn't need to take ownership of the string data being given to it, considering all we do is append the characters to the internal string instance. Instead, use a string view to simply reference the string data without any potential heap allocation. Now anything that is a raw const char* won't need to be converted to a std::string before appending. | ||||
| * | glsl_shader_decompiler: Add AddNewLine() function to ShaderWriter | Lioncash | 2018-04-20 | 1 | -6/+12 |
| | | | | | | | | Avoids constructing a std::string just to append a newline character | ||||
| * | glsl_shader_decompiler: Add char overload for ShaderWriter's AddLine() | Lioncash | 2018-04-20 | 1 | -4/+11 |
| | | | | | | | | Avoids constructing a std::string just to append a character. | ||||
| * | glsl_shader_decompiler: Append indentation without constructing a separate std::string | Lioncash | 2018-04-20 | 1 | -1/+5 |
| | | | | | | | | | | The interface of std::string already lets us append N copies of a character to an existing string. | ||||
* | | ShaderGen: Implemented the fmul32i shader instruction. | Subv | 2018-04-19 | 2 | -9/+30 |
| | | |||||
* | | ShaderGen: Fixed a case where the TEXS instruction would use the same registers for the input and the output. | Subv | 2018-04-19 | 1 | -2/+9 |
|/ | | | | It will now save the coords before writing the outputs in a subscope. | ||||
* | GPU: Add support for the DXT23 and DXT45 compressed texture formats. | Subv | 2018-04-19 | 3 | -28/+35 |
| | |||||
* | Merge pull request #351 from Subv/tex_formats | bunnei | 2018-04-19 | 4 | -8/+28 |
|\ | | | | | GPU: Implemented the B5G6R5 format. | ||||
| * | GPU: Implemented the B5G6R5 format. | Subv | 2018-04-19 | 4 | -8/+28 |
| | | |||||
* | | gl_shader_gen: Support vertical/horizontal viewport flipping. (#347) | bunnei | 2018-04-18 | 4 | -5/+29 |
|/ | | | | | | * gl_shader_gen: Support vertical/horizontal viewport flipping. * fixup! gl_shader_gen: Support vertical/horizontal viewport flipping. | ||||
* | GLCache: Added boilerplate code to make supporting configurable texture component types. | Subv | 2018-04-18 | 3 | -9/+69 |
| | | | | For now only the UNORM type is supported. | ||||
* | GLCache: Unify texture and framebuffer formats when converting to OpenGL. | Subv | 2018-04-18 | 2 | -26/+13 |
| | |||||
* | GPU: Texture format 8 and framebuffer format 0xD5 are actually ABGR8. | Subv | 2018-04-18 | 2 | -10/+10 |
| | |||||
* | GPU: Pitch textures are now supported, don't assert when encountering them. | Subv | 2018-04-18 | 1 | -2/+3 |
| | |||||
* | GLCache: Take into account the texture's block height when caching and unswizzling. | Subv | 2018-04-18 | 3 | -43/+43 |
| | |||||
* | GLCache: Added a function to convert cached PixelFormats back to texture formats. | Subv | 2018-04-18 | 1 | -0/+12 |
| | | | | TODO: The way we handle cached formats must change, framebuffer and texture formats are too different to keep them in the same place. | ||||
* | GPU: Allow using a configurable block height when unswizzling textures. | Subv | 2018-04-18 | 4 | -7/+23 |
| | |||||
* | GPU/TIC: Added the pitch and block height fields to the TIC structure. | Subv | 2018-04-18 | 1 | -1/+16 |
| | |||||
* | Merge pull request #346 from bunnei/misc-gpu-improvements | bunnei | 2018-04-18 | 4 | -2/+11 |
|\ | | | | | Misc gpu improvements | ||||
| * | gl_rasterizer_cache: Add missing LOG statements. | bunnei | 2018-04-18 | 1 | -0/+3 |
| | | |||||
| * | texture: Add missing formats. | bunnei | 2018-04-18 | 1 | -1/+3 |
| | | |||||
| * | gpu: Add several framebuffer formats to RenderTargetFormat. | bunnei | 2018-04-18 | 1 | -0/+3 |
| | | |||||
| * | maxwell3d: Allow Texture2DNoMipmap as Texture2D. | bunnei | 2018-04-18 | 1 | -1/+2 |
| | | |||||
* | | Merge pull request #344 from bunnei/shader-decompiler-p2 | bunnei | 2018-04-18 | 3 | -67/+179 |
|\ \ | | | | | | | Shader decompiler changes part 2 | ||||
| * | | shader_bytecode: Make ctor's constexpr and explicit. | bunnei | 2018-04-18 | 1 | -7/+7 |
| | | | |||||
| * | | gl_shader_decompiler: Fix warnings with MarkAsUsed. | bunnei | 2018-04-17 | 1 | -1/+2 |
| | | | |||||
| * | | gl_shader_decompiler: Cleanup logging, updating to NGLOG_*. | bunnei | 2018-04-17 | 1 | -24/+22 |
| | | | |||||
| * | | gl_shader_decompiler: Implement several MUFU subops and abs_d. | bunnei | 2018-04-17 | 1 | -7/+21 |
| | | | |||||
| * | | gl_shader_decompiler: Fix swizzle in GetRegister. | bunnei | 2018-04-17 | 1 | -1/+1 |
| | | | |||||
| * | | gl_shader_decompiler: Implement FMUL/FADD/FFMA immediate instructions. | bunnei | 2018-04-17 | 2 | -12/+53 |
| | | | |||||
| * | | gl_shader_decompiler: Allow vertex position to be used in fragment shader. | bunnei | 2018-04-17 | 2 | -16/+18 |
| | | | |||||
| * | | gl_shader_decompiler: Implement IPA instruction. | bunnei | 2018-04-17 | 1 | -0/+11 |
| | | | |||||
| * | | gl_shader_decompiler: Add support for TEXS instruction. | bunnei | 2018-04-17 | 2 | -12/+43 |
| | | | |||||
| * | | gl_shader_decompiler: Use fragment output color for GPR 0-3. | bunnei | 2018-04-17 | 1 | -0/+5 |
| | | | |||||
| * | | gl_shader_decompiler: Partially implement MUFU. | bunnei | 2018-04-17 | 1 | -2/+11 |
| |/ | |||||
* / | renderer_opengl: Implement BlendEquation and BlendFunc. | bunnei | 2018-04-18 | 6 | -7/+140 |
|/ | |||||
* | Merge pull request #343 from Subv/tex_wrap_4 | bunnei | 2018-04-17 | 1 | -0/+7 |
|\ | | | | | GPU: Implement some wrap modes | ||||
| * | MaxwellToGL: Implemented tex wrap mode 1 (Wrap, GL_REPEAT). | Subv | 2018-04-17 | 1 | -0/+2 |
| | | |||||
| * | MaxwellToGL: Added a TODO and partial implementation of maxwell wrap mode 4 (Clamp, GL_CLAMP). | Subv | 2018-04-17 | 1 | -0/+5 |
| | | | | | | | | This clamp mode was removed from OpenGL as of 3.1, we can emulate it by using GL_CLAMP_TO_BORDER to get the border color of the texture, and then manually sampling the edge to mix them in the fragment shader. | ||||
* | | gl_rendering: Use NGLOG* for changed code. | bunnei | 2018-04-17 | 2 | -10/+11 |
| | | |||||
* | | gl_rasterizer: Implement indexed vertex mode. | bunnei | 2018-04-17 | 5 | -23/+92 |
|/ | |||||
* | GPU: Use the same buffer names in the generated GLSL and the buffer uploading code. | Subv | 2018-04-15 | 4 | -17/+24 |
| | |||||
* | GPU: Don't use explicit binding points when uploading the constbuffers to opengl. | Subv | 2018-04-15 | 3 | -7/+47 |
| | | | | The bindpoints will now be dynamically calculated based on the number of buffers used by the previous shader stage. | ||||
* | GPU: Don't use GetPointer when uploading the constbuffer data to the GPU. | Subv | 2018-04-15 | 1 | -3/+4 |
| | |||||
* | GPU: Use the buffer hints from the shader decompiler to upload only the necessary const buffers for each shader stage. | Subv | 2018-04-15 | 3 | -31/+41 |
| | |||||
* | shaders: Expose hints about used const buffers. | bunnei | 2018-04-15 | 5 | -31/+146 |
| | |||||
* | GPU: Upload the entirety of each constbuffer for each shader stage as SSBOs. | Subv | 2018-04-15 | 4 | -14/+48 |
| | | | | We're going to need the shader generator to give us a mapping of the actual used const buffers to properly bind them to the shader. | ||||
* | GPU: Allow configuring ssbos in the opengl state manager. | Subv | 2018-04-15 | 4 | -0/+30 |
| | |||||
* | GPU: Added a function to determine whether a shader stage is enabled or not. | Subv | 2018-04-15 | 3 | -3/+27 |
| | |||||
* | shaders: Add NumTextureSamplers const, remove unused #pragma. | bunnei | 2018-04-15 | 4 | -4/+5 |
| | |||||
* | shaders: Address PR review feedback. | bunnei | 2018-04-14 | 2 | -7/+9 |
| | |||||
* | gl_shader_decompiler: Cleanup log statements. | bunnei | 2018-04-14 | 1 | -15/+15 |
| | |||||
* | shaders: Fix GCC and clang build issues. | bunnei | 2018-04-14 | 3 | -5/+5 |
| | |||||
* | gl_shader_decompiler: Implement negate, abs, etc. and lots of cleanup. | bunnei | 2018-04-14 | 2 | -40/+96 |
| | |||||
* | shader_bytecode: Add FSETP and KIL to GetInfo. | bunnei | 2018-04-14 | 1 | -0/+3 |
| | |||||
* | shader_bytecode: Add SubOp decoding. | bunnei | 2018-04-14 | 1 | -0/+10 |
| | |||||
* | gl_shader_decompiler: Add shader stage hint. | bunnei | 2018-04-14 | 2 | -5/+12 |
| | |||||
* | renderer_opengl: Fix Morton copy byteswap, etc. | bunnei | 2018-04-14 | 2 | -6/+6 |
| | |||||
* | gl_shader_manager: Implement SetShaderSamplerBindings. | bunnei | 2018-04-14 | 1 | -0/+8 |
| | |||||
* | gl_rasterizer: Generate shaders and upload uniforms. | bunnei | 2018-04-14 | 2 | -32/+77 |
| | |||||
* | gl_shader_decompiler: Basic impl. for very simple vertex shaders. | bunnei | 2018-04-14 | 2 | -16/+311 |
| | | | | - Tested with Puyo Puyo Tetris and Cave Story+ | ||||
* | gl_shader_manager: Cleanup and consolidate uniform handling. | bunnei | 2018-04-14 | 2 | -26/+24 |
| | |||||
* | maxwell_3d: Make memory_manager public. | bunnei | 2018-04-14 | 1 | -2/+1 |
| | |||||
* | maxwell_3d: Fix shader_config decodings. | bunnei | 2018-04-14 | 1 | -6/+3 |
| | |||||
* | gl_rasterizer: Use shader program manager, remove test shader. | bunnei | 2018-04-14 | 2 | -196/+31 |
| | |||||
* | renderer_opengl: Add gl_shader_manager class. | bunnei | 2018-04-14 | 3 | -0/+209 |
| | |||||
* | maxwell_to_gl: Add a few types, etc. | bunnei | 2018-04-14 | 1 | -0/+10 |
| | |||||
* | gl_shader_gen: Add hashable setup/config structs. | bunnei | 2018-04-14 | 2 | -29/+50 |
| | |||||
* | gl_shader_util: Add missing includes. | bunnei | 2018-04-14 | 1 | -0/+2 |
| | |||||
* | renderer_opengl: Use OGLProgram instead of OGLShader. | bunnei | 2018-04-14 | 6 | -6/+6 |
| | |||||
* | gl_shader_util: Grab latest upstream. | bunnei | 2018-04-14 | 2 | -149/+74 |
| | |||||
* | gl_resource_manager: Grab latest upstream. | bunnei | 2018-04-14 | 1 | -30/+86 |
| | |||||
* | gl_shader_decompiler: Add skeleton code from Citra for shader analysis. | bunnei | 2018-04-14 | 2 | -44/+142 |
| | |||||
* | shader_bytecode: Add initial module for shader decoding. | bunnei | 2018-04-14 | 2 | -0/+298 |
| | |||||
* | Fix clang format issues | James Rowe | 2018-04-07 | 1 | -1/+1 |
| | |||||
* | GPU: Assert when finding a texture with a format type other than UNORM. | Subv | 2018-04-07 | 2 | -4/+16 |
| | |||||
* | GL: Set up the textures used for each draw call. | Subv | 2018-04-07 | 2 | -2/+39 |
| | | | | | Each Maxwell shader stage can have an arbitrary number of textures, but we're limited to a certain number in OpenGL. We try to only use the minimum amount of host textures by not keeping a 1:1 relation between guest texture ids and host texture ids, ie, guest texture id 8 can be host texture id 0 if it's the only texture used in the guest shader program. This mapping will have to be passed to the shader decompiler so it can rewrite the texture accesses. | ||||
* | GL: Bind the textures to the shaders used for drawing. | Subv | 2018-04-07 | 1 | -2/+11 |
| | |||||
* | GLCache: Specialize the MortonCopy function for the DXT1 texture format. | Subv | 2018-04-07 | 1 | -1/+15 |
| | | | | It will now use the UnswizzleTexture function instead of the MortonCopyPixels128, which doesn't seem to work for textures. | ||||
* | GLCache: Implemented GetTextureSurface. | Subv | 2018-04-07 | 1 | -3/+28 |
| | |||||
* | GLCache: Support uploading compressed textures to the GPU. | Subv | 2018-04-07 | 1 | -5/+17 |
| | | | | Compressed texture formats like DXT1, DXT2, DXT3, etc will use this to ease the load on the CPU. | ||||
* | GL: Remove remaining references to 3DS-specific pixel formats | Subv | 2018-04-07 | 1 | -83/+22 |
| | |||||
* | RasterizerCache: Remove 3DS-specific pixel formats. | Subv | 2018-04-07 | 2 | -71/+32 |
| | | | | We're only left with RGB8 and DXT1 for now. More will be added as they are needed. | ||||
* | GL: Create the sampler objects when starting up the GL rasterizer. | Subv | 2018-04-07 | 1 | -0/+6 |
| | |||||
* | GL: Ported the SamplerInfo struct from citra. | Subv | 2018-04-07 | 2 | -1/+59 |
| | |||||
* | GL: Rename PicaTexture to MaxwellTexture. | Subv | 2018-04-07 | 2 | -2/+2 |
| | |||||
* | GL: Added functions to convert Maxwell tex filters and wrap modes to OpenGL. | Subv | 2018-04-07 | 1 | -0/+23 |
| | |||||
* | Textures: Added a helper function to know if a texture is blocklinear or pitch. | Subv | 2018-04-07 | 1 | -0/+5 |
| | |||||
* | rasterizer_interface.h: Update from citra to yuzu | N00byKing | 2018-04-04 | 1 | -3/+3 |
| | |||||
* | gl_rasterizer_cache.cpp: Update from citra to yuzu | N00byKing | 2018-04-04 | 1 | -1/+1 |
| | |||||
* | gl_rasterizer_cache.h: Update from citra to yuzu | N00byKing | 2018-04-04 | 1 | -3/+3 |
| | |||||
* | renderer_opengl.h: Update from citra to yuzu | N00byKing | 2018-04-04 | 1 | -2/+2 |
| | |||||
* | GPU: Use the MacroInterpreter class to execute the GPU macros instead of HLEing them. | Subv | 2018-04-01 | 2 | -121/+13 |
| | |||||
* | GPU: Implemented a gpu macro interpreter. | Subv | 2018-04-01 | 5 | -0/+431 |
| | | | | | | The Ryujinx macro interpreter and envydis were used as reference. Macros are programs that are uploaded by the games during boot and can later be called by writing to their method id in a GPU command buffer. | ||||
* | renderer_opengl: Use better naming for DrawScreens and DrawSingleScreen. | bunnei | 2018-03-27 | 2 | -8/+8 |
| | |||||
* | gl_rasterizer: Move code to bind framebuffer surfaces before draw to its own function. | bunnei | 2018-03-27 | 2 | -22/+31 |
| | |||||
* | gl_rasterizer: Add a SyncViewport method. | bunnei | 2018-03-27 | 3 | -18/+30 |
| | |||||
* | gl_rasterizer: Move PrimitiveTopology check to MaxwellToGL. | bunnei | 2018-03-27 | 2 | -11/+12 |
| | |||||
* | graphics_surface: Fix merge conflicts. | bunnei | 2018-03-27 | 1 | -0/+1 |
| | |||||
* | gl_rasterizer: Use ReadBlock instead of GetPointer for SetupVertexArray. | bunnei | 2018-03-27 | 1 | -1/+1 |
| | |||||
* | gl_rasterizer: Normalize vertex array data as appropriate. | bunnei | 2018-03-27 | 2 | -1/+5 |
| | |||||
* | maxwel_to_gl: Fix string formatting in log statements. | bunnei | 2018-03-27 | 1 | -2/+2 |
| | |||||
* | rasterizer: Rename DrawTriangles to DrawArrays. | bunnei | 2018-03-27 | 3 | -5/+5 |
| | |||||
* | gl_rasterizer: Use passthrough shader for SetupVertexShader. | bunnei | 2018-03-27 | 1 | -1/+2 |
| | |||||
* | renderer_opengl: Logging, etc. cleanup. | bunnei | 2018-03-27 | 6 | -33/+34 |
| | |||||
* | renderer_opengl: Remove framebuffer RasterizerFlushVirtualRegion hack. | bunnei | 2018-03-27 | 1 | -5/+0 |
| | |||||
* | gl_rasterizer_cache: Implement UpdatePagesCachedCount. | bunnei | 2018-03-27 | 2 | -8/+37 |
| | |||||
* | gl_rasterizer: Implement SetupVertexArray. | bunnei | 2018-03-27 | 1 | -20/+38 |
| | |||||
* | gl_rasterizer_cache: Fix an ASSERT_MSG. | bunnei | 2018-03-27 | 1 | -1/+1 |
| | |||||
* | maxwell_to_gl: Add module and function for decoding VertexType. | bunnei | 2018-03-27 | 2 | -0/+41 |
| | |||||
* | maxwell_3d: Use names that match envytools for VertexType. | bunnei | 2018-03-27 | 1 | -8/+8 |
| | |||||
* | maxwell_3d: Add VertexAttribute struct and cleanup. | bunnei | 2018-03-27 | 1 | -121/+160 |
| | |||||
* | gl_rasterizer: Use 32 texture units instead of 3. | bunnei | 2018-03-27 | 3 | -2/+3 |
| | |||||
* | gl_rasterizer: Implement DrawTriangles. | bunnei | 2018-03-27 | 1 | -1/+194 |
| | |||||
* | Maxwell3D: Call AccelerateDrawBatch on DrawArrays. | bunnei | 2018-03-27 | 1 | -1/+8 |
| | |||||
* | gl_rasterizer: Implement AnalyzeVertexArray. | bunnei | 2018-03-27 | 2 | -1/+56 |
| | |||||
* | gl_rasterizer_cache: MortonCopy Switch-style. | bunnei | 2018-03-27 | 1 | -72/+32 |
| | |||||
* | gl_rasterizer_cache: Implement GetFramebufferSurfaces. | bunnei | 2018-03-27 | 2 | -4/+104 |
| | |||||
* | maxwell: Add RenderTargetFormat enum. | bunnei | 2018-03-27 | 2 | -4/+5 |
| | |||||
* | renderer_opengl: Only draw the screen if a framebuffer is specified. | bunnei | 2018-03-27 | 1 | -6/+7 |
| | |||||
* | GPU: Load the sampler info (TSC) when retrieving active textures. | Subv | 2018-03-26 | 2 | -21/+67 |
| | |||||
* | GPU: Added the TSC structure. It contains information about the sampler. | Subv | 2018-03-26 | 1 | -0/+50 |
| | |||||
* | GPU: Added more fields to the TIC structure. | Subv | 2018-03-26 | 1 | -4/+30 |
| | |||||
* | GPU: Make the debug_context variable a member of the frontend instead of a global. | Subv | 2018-03-25 | 3 | -15/+13 |
| | |||||
* | GPU: Added a function to retrieve the active textures for a shader stage. | Subv | 2018-03-24 | 2 | -50/+59 |
| | | | | TODO: A shader may not use all of these textures at the same time, shader analysis should be performed to determine which textures are actually sampled. | ||||
* | Frontend: Updated the surface view debug widget to work with Maxwell surfaces. | Subv | 2018-03-24 | 2 | -0/+15 |
| | |||||
* | GPU: Implement the Incoming/FinishedPrimitiveBatch debug breakpoints. | Subv | 2018-03-24 | 1 | -0/+7 |
| | |||||
* | GPU: Implement the MaxwellCommandLoaded/Processed debug breakpoints. | Subv | 2018-03-24 | 1 | -0/+10 |
| | |||||
* | Frontend: Ported the GPU breakpoints and surface viewer widgets from citra. | Subv | 2018-03-24 | 5 | -0/+242 |
| | |||||
* | GPU: Added a method to unswizzle a texture without decoding it. | Subv | 2018-03-24 | 4 | -5/+95 |
| | | | | Allow unswizzling of DXT1 textures. | ||||
* | GPU: Preliminary work for texture decoding. | Subv | 2018-03-24 | 5 | -0/+139 |
| | |||||
* | GPU: Added viewport registers to Maxwell3D's reg structure. | Subv | 2018-03-24 | 1 | -1/+18 |
| | |||||
* | gl_rasterizer: Fake render in green, because it's cooler. | bunnei | 2018-03-24 | 1 | -1/+1 |
| | |||||
* | gl_rasterizer: Log warning instead of sync'ing unimplemented funcs. | bunnei | 2018-03-24 | 1 | -7/+1 |
| | |||||
* | gl_rasterizer_cache: Add missing include for vm_manager. | bunnei | 2018-03-23 | 1 | -0/+1 |
| | |||||
* | renderer_opengl: Only invalidate the framebuffer region, not flush. | bunnei | 2018-03-23 | 1 | -4/+3 |
| | |||||
* | renderer_opengl: Fixes for properly flushing & rendering the framebuffer. | bunnei | 2018-03-23 | 1 | -6/+12 |
| | |||||
* | RasterizerCacheOpenGL: FlushAll should flush full memory region. | bunnei | 2018-03-23 | 1 | -1/+1 |
| | |||||
* | rasterizer: Flush and invalidate regions should be 64-bit. | bunnei | 2018-03-23 | 3 | -9/+9 |
| | |||||
* | renderer_opengl: Add framebuffer_transform_flags member variable. | bunnei | 2018-03-23 | 1 | -2/+2 |
| | |||||
* | renderer_opengl: Better handling of framebuffer transform flags. | bunnei | 2018-03-23 | 2 | -3/+20 |
| | |||||
* | renderer_opengl: Use accelerated framebuffer load with LoadFBToScreenInfo. | bunnei | 2018-03-23 | 1 | -31/+25 |
| | |||||
* | gl_rasterizer: Implement AccelerateDisplay method from Citra. | bunnei | 2018-03-23 | 2 | -2/+44 |
| | |||||
* | LoadGLBuffer: Use bytes_per_pixel, not bits. | bunnei | 2018-03-23 | 1 | -1/+2 |
| | |||||
* | gl_rasterizer_cache: LoadGLBuffer should do a morton copy. | bunnei | 2018-03-23 | 1 | -16/+5 |
| | |||||
* | video_core: Move MortonCopyPixels128 to utils header. | bunnei | 2018-03-23 | 2 | -111/+113 |
| | |||||
* | video_core: Remove usage of PAddr and replace with VAddr. | bunnei | 2018-03-23 | 5 | -39/+39 |
| | |||||
* | video_core: Move FramebufferInfo to FramebufferConfig in GPU. | bunnei | 2018-03-23 | 7 | -66/+74 |
| | |||||
* | gl_rasterizer: Replace a bunch of UNIMPLEMENTED with ASSERT. | bunnei | 2018-03-23 | 2 | -20/+20 |
| | |||||
* | gl_rasterizer: Add a simple passthrough shader in lieu of shader generation. | bunnei | 2018-03-23 | 2 | -5/+68 |
| | |||||
* | gpu: Expose Maxwell3D engine. | bunnei | 2018-03-23 | 1 | -0/+4 |
| | |||||
* | maxwell_3d: Add some format decodings and string helper functions. | bunnei | 2018-03-23 | 1 | -3/+107 |
| | |||||
* | renderer: Create rasterizer and cleanup. | bunnei | 2018-03-23 | 4 | -4/+16 |
| | |||||
* | GPU: Added vertex attribute format registers. | Subv | 2018-03-21 | 1 | -1/+14 |
| | |||||
* | GPU: Added registers for the number of vertices to render. | Subv | 2018-03-21 | 1 | -2/+13 |
| | |||||
* | Merge pull request #254 from bunnei/port-citra-renderer | bunnei | 2018-03-21 | 18 | -101/+2905 |
|\ | | | | | Port Citra OpenGL rasterizer code | ||||
| * | renderer_gl: Port boilerplate rasterizer code over from Citra. | bunnei | 2018-03-20 | 5 | -1/+495 |
| | | |||||
| * | gl_shader_util: Sync latest version with Citra. | bunnei | 2018-03-20 | 3 | -46/+116 |
| | | |||||
| * | renderer_gl: Port over gl_shader_gen module from Citra. | bunnei | 2018-03-20 | 3 | -0/+88 |
| | | |||||
| * | renderer_gl: Port over gl_shader_decompiler module from Citra. | bunnei | 2018-03-20 | 3 | -0/+87 |
| | | |||||
| * | renderer_gl: Port over gl_rasterizer_cache module from Citra. | bunnei | 2018-03-20 | 3 | -0/+1714 |
| | | |||||
| * | gl_resource_manager: Sync latest version with Citra. | bunnei | 2018-03-20 | 1 | -8/+77 |
| | | |||||
| * | renderer_gl: Port over gl_stream_buffer module from Citra. | bunnei | 2018-03-20 | 3 | -0/+218 |
| | | |||||
| * | gl_state: Sync latest version with Citra. | bunnei | 2018-03-20 | 2 | -47/+111 |
| | | |||||
* | | Merge pull request #253 from Subv/rt_depth | Mat M | 2018-03-20 | 1 | -1/+48 |
|\ \ | |/ |/| | GPU: Added registers for color and Z buffers. | ||||
| * | GPU: Added Z buffer registers to Maxwell3D's reg structure. | Subv | 2018-03-19 | 1 | -1/+17 |
| | | |||||
| * | GPU: Added the render target (RT) registers to Maxwell3D's reg structure. | Subv | 2018-03-19 | 1 | -1/+32 |
| | | |||||
* | | Clang Fixes | N00byKing | 2018-03-19 | 1 | -1/+2 |
| | | |||||
* | | Clean Warnings (?) | N00byKing | 2018-03-19 | 1 | -1/+1 |
|/ | |||||
* | GPU: Added the TSC registers to the Maxwell3D register structure. | Subv | 2018-03-19 | 1 | -1/+15 |
| | |||||
* | GPU: Added the TIC registers to the Maxwell3D register structure. | Subv | 2018-03-19 | 1 | -1/+16 |
| | |||||
* | GPU: Implement macro 0xE1A BindTextureInfoBuffer in HLE. | Subv | 2018-03-19 | 2 | -1/+29 |
| | | | | This macro simply sets the current CB_ADDRESS to the texture buffer address for the input shader stage. | ||||
* | GPU: Implement the BindStorageBuffer macro method in HLE. | Subv | 2018-03-18 | 2 | -1/+36 |
| | | | | | | This macro binds the SSBO Info Buffer as the current ConstBuffer. This buffer is usually bound to c0 during shader execution. Games seem to use this macro instead of directly writing the address for some reason. | ||||
* | GPU: Handle writes to the CB_DATA method. | Subv | 2018-03-18 | 2 | -0/+39 |
| | | | | | | Writing to this method will cause the written value to be stored in the currently-set ConstBuffer plus CB_POS. This method is usually used to upload uniforms or other shader-visible data. | ||||
* | GPU: Move the GPU's class constructor and destructors to a cpp file. | Subv | 2018-03-18 | 3 | -10/+30 |
| | | | | This should reduce recompile times when editing the Maxwell3D register structure. | ||||
* | GPU: Store uploaded GPU macros and keep track of the number of method parameters. | Subv | 2018-03-18 | 4 | -27/+74 |
| | |||||
* | GPU: Macros are specific to the Maxwell3D engine, so handle them internally. | Subv | 2018-03-18 | 8 | -63/+55 |
| | |||||
* | GPU: Renamed ShaderType to ShaderStage as that is less confusing. | Subv | 2018-03-18 | 2 | -19/+19 |
| | |||||
* | GPU: Store shader constbuffer bindings in the GPU state. | Subv | 2018-03-18 | 2 | -5/+61 |
| | |||||
* | GPU: Corrected some register offsets and removed superfluous macro registers. | Subv | 2018-03-18 | 1 | -9/+3 |
| | |||||
* | GPU: Make the SetShader macro call do the same as the real macro's code. | Subv | 2018-03-18 | 2 | -3/+44 |
| | | | | | | It'll now set the CB_SIZE, CB_ADDRESS and CB_BIND registers when it's called. Presumably this SetShader function is binding the constant shader uniforms to buffer 1 (c1[]). | ||||
* | GPU: Corrected the parameter documentation for the SetShader macro call. | Subv | 2018-03-17 | 2 | -11/+12 |
| | | | | | | Register 0xE24 is actually a macro that sets some shader parameters in the register structure. Macros are uploaded to the GPU at startup and have their own ISA, we'll probably write an interpreter for this in the future. | ||||
* | Merge pull request #242 from Subv/set_shader | bunnei | 2018-03-17 | 2 | -4/+38 |
|\ | | | | | GPU: Handle the SetShader method call (0xE24) and store the shader config. | ||||
| * | GPU: Handle the SetShader method call (0xE24) and store the shader config. | Subv | 2018-03-17 | 2 | -4/+38 |
| | | |||||
* | | GPU: Added the vertex array registers. | Subv | 2018-03-17 | 1 | -2/+33 |
|/ | |||||
* | Merge pull request #241 from Subv/gpu_method_call | bunnei | 2018-03-17 | 9 | -8/+97 |
|\ | | | | | GPU: Process command mode 5 (IncreaseOnce) differently from other commands | ||||
| * | GPU: Process command mode 5 (IncreaseOnce) differently from other commands. | Subv | 2018-03-17 | 9 | -8/+97 |
| | | | | | | | | | | | | Accumulate all arguments before calling the desired method. Note: Maybe we should do the same for the NonIncreasing mode? | ||||
* | | GPU: Assert that we get a 0 CODE_ADDRESS register in the 3D engine. | Subv | 2018-03-17 | 1 | -0/+8 |
| | | | | | | | | Shader address calculation depends on this value to some extent, we do not currently know what it being 0 entails. | ||||
* | | GPU: Added Maxwell registers for Shader Program control. | Subv | 2018-03-17 | 1 | -2/+55 |
|/ | |||||
* | GPU: Intercept writes to the VERTEX_END_GL register. | Subv | 2018-03-05 | 2 | -1/+18 |
| | | | | | | This is the register that gets written after a game calls DrawArrays(). We should collect all GPU state and draw using our graphics API here. | ||||
* | maxwell_3d: Make constructor explicit | Lioncash | 2018-02-14 | 1 | -1/+1 |
| | |||||
* | Merge pull request #187 from Subv/maxwell3d_query | bunnei | 2018-02-14 | 3 | -3/+95 |
|\ | | | | | GPU: Partially implemented the QUERY_* registers in the Maxwell3D engine. | ||||
| * | GPU: Partially implemented the QUERY_* registers in the Maxwell3D engine. | Subv | 2018-02-12 | 3 | -3/+95 |
| | | | | | | | | Only QueryMode::Write is supported at the moment. | ||||
* | | Merge pull request #178 from Subv/command_buffers | bunnei | 2018-02-12 | 12 | -0/+488 |
|\| | | | | | GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines | ||||
| * | Make a GPU class in VideoCore to contain the GPU state. | Subv | 2018-02-12 | 12 | -44/+252 |
| | | | | | | | | Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there. |