| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
| |
|
| |
|
|
|
|
| |
Using this smaller type saves 1536 bytes in the compiled executable.
|
|
|
|
| |
Follow-up to 99ceb03a1cfcf35968cab589ea188a8c406cda52
|
|
|
|
|
| |
This formats all copyright comments according to SPDX formatting guidelines.
Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
|
|\
| |
| | |
codec: Fix GPU decoder detection on Windows and a memory leak
|
| | |
|
| |
| |
| |
| |
| |
| | |
It was reported that this method causes crashes on certain Linux decoding backends, hence the check to avoid it.
This subsequently caused Windows GPU decoders to never be selected and always fall back to CPU decoding, disable the check on Windows for now.
|
|/ |
|
|
|
|
|
|
| |
src/video_core/command_classes/codecs/codec.cpp:177:16: error: assigning to 'AVCodec *' from 'const AVCodec *' discards qualifiers
av_codec = avcodec_find_decoder(codec);
^~~~~~~~~~~~~~~~~~~~~~~~~~~
|
| |
|
| |
|
| |
|
| |
|
|
|
|
|
| |
... this would resolve some edge-cases where multiple devices are
present and ffmpeg is unable to auto-supply the hw surfaces
|
|
|
|
|
| |
* skip impersonated VAAPI implementaions ("imposter detection")
* place VAAPI priority below CUDA/NVDEC/CUVID
|
| |
|
| |
|
| |
|
|
|
|
|
|
|
|
|
|
| |
* when someone installed Intel video drivers on an AMD system, the
decoder will select the Intel VA-API decoding driver and yuzu will
crash due to incorrect driver selection; the fix will check if the
currently about-to-use driver is loaded in the kernel
* when using NVIDIA driver on Linux with a ffmpeg that does not have
CUDA capability enabled, the decoder will crash; the fix simply
making the decoder prefers the VDPAU driver over CUDA on Linux
|
|
|
|
| |
These functions were composing the entire frame, not just the headers. Rename to more accurately describe them.
|
|
|
|
| |
Enables frame decoding with FFmpeg
|
| |
|
|\
| |
| | |
vic: Use the minimum of surface/frame dimensions when writing the final frame to the GPU
|
| |
| |
| |
| | |
Addresses possible buffer overflow behavior.
|
|\ \
| |/
|/| |
h264: Use max allowed max_num_ref_frames when using CPU decoding
|
| | |
|
| |
| |
| |
| |
| | |
Touhou Genso Wanderer Lotus Labyrinth R decodes 1920x1080 videos into 1920x1088 surface.
Only allow mismatch for height, since larger width would result in increasingly offset rows and somewhat defeat entire purpose of this check.
|
| |
| |
| |
| | |
This is a work around to avoid buffer overflow errors until multi channel/multi stream decoding is supported.
|
| | |
|
|/ |
|
| |
|
|\
| |
| | |
nvdec: Add GPU video decoding for all capable drivers and platforms
|
| |
| |
| |
| | |
GPU decoding seems to be more picky when it comes to the maximum number of reference frames.
|
| |
| |
| |
| |
| |
| | |
Some system configurations may see visual regressions or lower performance using GPU decoding compared to CPU decoding. This setting provides the option for users to specify their decoding preference.
Co-Authored-By: yzct12345 <87620833+yzct12345@users.noreply.github.com>
|
| | |
|
| | |
|
| | |
|
| |
| |
| |
| | |
Supplements the VAAPI intel gpu decoder by implementing the D3D11VA decoder for Windows, and CUVID/VDPAU for Nvidia and AMD on drivers linux respectively.
|
| | |
|
| | |
|
|\ \
| |/
|/| |
vic: Specify sws_scale height stride.
|
| |
| |
| |
| | |
Silences a sws_scale runtime warning about unaligned strides.
|
|/
|
|
| |
Silences a runtime error due to the first frame missing the frame data, and being set to hidden despite being a key-frame.
|
| |
|
|\
| |
| | |
nvdec: Fix VP9 reference frame refreshes
|
| |
| |
| |
| |
| | |
With reference frames refreshes fix, we no longer need to buffer two frames in advance.
We can also remove other unused or otherwise unneeded variables.
|
| |
| |
| |
| | |
This resolves the artifacting when decoding VP9 streams.
|
|/
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
* nvdec: VA-API
* Verify formatting
* Forgot a semicolon for Windows
* Clarify comment about AV_PIX_FMT_NV12
* Fix assert log spam from missing negation
* vic: Remove forgotten debug code
* Address lioncash's review
* Mention VA-API is Intel/AMD
* Address v1993's review
* Hopefully fix CMakeLists style this time
* vic: Improve cache locality
* vic: Fix off-by-one error
* codec: Async
* codec: Forgot the GetValue()
* nvdec: Address ameerj's review
* codec: Fallback to CPU without VA-API support
* cmake: Address lat9nq's review
* cmake: Make VA-API optional
* vaapi: Multiple GPU
* Apply suggestions from code review
Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com>
* nvdec: Address ameerj's review
* codec: Use anonymous instead of static
* nvdec: Remove enum and fix memory leak
* nvdec: Address ameerj's review
* codec: Remove preparation for threading
Co-authored-by: Ameer J <52414509+ameerj@users.noreply.github.com>
|
|\
| |
| | |
nvdec: Fix Submit Ioctl data source, vic frame dimension computations
|
| |
| |
| |
| | |
Fixes out of bound memory crashes in Mario Golf
|
|\ \
| | |
| | | |
general: Enforce multiple warnings in MSVC
|
| | | |
|
|/ / |
|
|/ |
|
|
|
|
|
|
|
|
| |
Avoid sending null pointer to memcpy as reported by Undefined Behaviour
Sanitizer. Replaces the std::memcpy calls in SpliceVectors with
std::copy calls. Opting to replace all the memcpy's with copy's.
Co-authored-by: LC <mathew1800@gmail.com>
|
| |
|
| |
|
| |
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Reimplement the buffer cache using cached bindings and page level
granularity for modification tracking. This also drops the usage of
shared pointers and virtual functions from the cache.
- Bindings are cached, allowing to skip work when the game changes few
bits between draws.
- OpenGL Assembly shaders no longer copy when a region has been modified
from the GPU to emulate constant buffers, instead GL_EXT_memory_object
is used to alias sub-buffers within the same allocation.
- OpenGL Assembly shaders stream constant buffer data using
glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In
theory this should save one hash table resolve inside the driver
compared to glBufferSubData.
- A new OpenGL stream buffer is implemented based on fences for drivers
that are not Nvidia's proprietary, due to their low performance on
partial glBufferSubData calls synchronized with 3D rendering (that
some games use a lot).
- Most optimizations are shared between APIs now, allowing Vulkan to
cache more bindings than before, skipping unnecesarry work.
This commit adds the necessary infrastructure to use Vulkan object from
OpenGL. Overall, it improves performance and fixes some bugs present on
the old cache. There are still some edge cases hit by some games that
harm performance on some vendors, this are planned to be fixed in later
commits.
|
|
|
|
| |
Makes for less code that we need to maintain.
|
|
|
| |
Co-authored-by: LC <mathew1800@gmail.com>
|
| |
|
|
|
|
| |
laying the groundwork for async gpu, although this does not fully implement async nvdec operations
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
The current texture cache has several points that hurt maintainability
and performance. It's easy to break unrelated parts of the cache
when doing minor changes. The cache can easily forget valuable
information about the cached textures by CPU writes or simply by its
normal usage.The current texture cache has several points that hurt
maintainability and performance. It's easy to break unrelated parts
of the cache when doing minor changes. The cache can easily forget
valuable information about the cached textures by CPU writes or simply
by its normal usage.
This commit aims to address those issues.
|
|
|
|
|
|
|
| |
fmt now automatically prints the numeric value of an enum class member
by default, so we don't need to use casts any more.
Reduces the line noise a bit.
|
|
|
|
|
|
|
| |
Migrates the video core code closer to enabling variable shadowing
warnings as errors.
This primarily sorts out shadowing occurrences within the Vulkan code.
|
|\
| |
| | |
video_core: Resolve more variable shadowing scenarios
|
| |
| |
| |
| |
| |
| | |
Resolves variable shadowing scenarios up to the end of the OpenGL code
to make it nicer to review. The rest will be resolved in a following
commit.
|
|/
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This was only necessary for use with the
avcodec_decode_video2/avcoded_decode_audio4 APIs which are also
deprecated.
Given we use avcodec_send_packet/avcodec_receive_frame, this isn't
necessary, this is even indicated directly within the FFmpeg API changes
document here on 2017-09-26:
https://github.com/FFmpeg/FFmpeg/blob/master/doc/APIchanges#L410
This prevents our code from breaking whenever we update to a newer
version of FFmpeg in the future if they ever decide to fully remove this
API member.
|
|
|
|
|
| |
Removes the usage of moves that don't result in behavior different from
a copy, or otherwise would prevent copy elision from occurring.
|
|\
| |
| | |
nvdec: Queue and display all decoded frames, cleanup decoders
|
| |
| |
| |
| | |
Workaround for ZLA, which seems to decode and queue twice as many frames as it displays.
|
| |
| |
| |
| |
| |
| | |
remove some redundant moves, make deleter match naming guidelines.
Co-Authored-By: LC <712067+lioncash@users.noreply.github.com>
|
| | |
|
|/ |
|
|
|
|
|
| |
Prevents bugs from occurring where the results of a function are
accidentally discarded
|
|
|
|
|
|
| |
It's deprecated in the language to autogenerate these if the destructor
for a type is specified, so we can explicitly specify how we want these
to be generated.
|
|
|
|
|
| |
Prevents values from mistakenly being discarded in cases where it's a
bug to do so.
|
|
|
|
|
|
|
|
|
| |
The API of VP9 exposes a WasFrameHidden() function which accesses this
member. Given the constructor previously didn't initialize this member,
it's a potential vector for an uninitialized read.
Instead, we can initialize this to a deterministic value to prevent that
from occurring.
|
|
|
|
|
| |
These helper functions don't directly modify any member state and can be
hidden from view.
|
|\
| |
| | |
nvdec: Minor tidying up
|
| |
| |
| |
| | |
Enforces the type of the desired value in calling code.
|
| |
| |
| |
| | |
We can move the buffer here to avoid a heap reallocation
|
| | |
|
| | |
|
| |
| |
| |
| | |
The move will already occur without std::move.
|
| | |
|
| |
| |
| |
| | |
Prevents a few unnecessary inclusions.
|
|\ \
| | |
| | | |
sync_manager: Amend parameter order of calls to SyncptIncr constructor
|
| |/
| |
| |
| | |
Corrects some cases where the arguments would be incorrectly swapped.
|
|\ \
| | |
| | | |
video_core: Enforce -Werror=type-limits
|
| |/
| |
| |
| | |
Silences one warning and avoids introducing more in the future.
|
|/
|
|
| |
Silence three warnings and make them errors to avoid introducing more in the future.
|
|
This commit aims to implement the NVDEC (Nvidia Decoder) functionality, with video frame decoding being handled by the FFmpeg library.
The process begins with Ioctl commands being sent to the NVDEC and VIC (Video Image Composer) emulated devices. These allocate the necessary GPU buffers for the frame data, along with providing information on the incoming video data. A Submit command then signals the GPU to process and decode the frame data.
To decode the frame, the respective codec's header must be manually composed from the information provided by NVDEC, then sent with the raw frame data to the ffmpeg library.
Currently, H264 and VP9 are supported, with VP9 having some minor artifacting issues related mainly to the reference frame composition in its uncompressed header.
Async GPU is not properly implemented at the moment.
Co-Authored-By: David <25727384+ogniK5377@users.noreply.github.com>
|