ffmpeg从某站提取视频音频详解

Posted 2023-04-12 RobertLikeDurian

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了ffmpeg从某站提取视频音频详解相关的知识，希望对你有一定的参考价值。

ffmpeg从某站提取视频、音频、详解

事件背景

准备链接

第一步安装下载

ffmpeg是开源软件，安装网址http://ffmpeg.org/download.html#build-windows
本人用的windows10系统

打开网址后随便你怎么下载都行，Git、或者直接下等等

按图片输入上述网址也一样

第二步解压安装

下载完成后，解压即可无需安装，但是需要配置path（像Java的JDK一样）

右键我的电脑，高级设置中找到环境变量，中修改path，添加程序文件的bin目录的环境变量
我将程序放在

D:\\Program Files\\ffmpeg\\ffmpeg-2021-01-27

文件夹下那么，配置的path就是

D:\\Program Files\\ffmpeg\\ffmpeg-2021-01-27\\bin

配置好后打开cmd
输入ffmpeg -version
显示如下即安装完成

第三步从小破站下载音频视频

我用Chrome浏览器打开，F12进入某个模式如下图

我们“简单分析”一下NetWork很容易发现本视频的媒体文件是
134142927-1-30080与134142927-1-30280两个
一个是视频的图像文件，一个是视频的音源文件，全部都是m4s为后缀的文件

那我们点击我们要下载的图像和音源文件，复制它链接的前部分内容，复制到.m4s就行，然后输入到地址栏进行文件下载
by the way：
可以看到30280与30080结尾的文件很多，我们随便下哪个都行，都一样

可以看到，下载完成的文件应该是相对来讲一大一小两个文件，本视频时长3分钟多，一个在60+MB大小，一个在4.5MB大小
根据常理我们判断，较大的文件是图像文件，较小的文件是音源文件

第四步ffmpeg命令

下载好的视频尽量放在一个特定的文件夹，方便操作，不然还得输入绝对路径
我放在了D：/Temp下

在地址栏输入cmd打开命令

ffmpeg -i 30080.m4s -i 30280.m4s -codec copy video.mp4

将文件合并为视频文件，（将视频文件放在前，音源文件放在后）

完成后视频已经可以打开了

可以看到能够正常播放

继续输入命令

ffmpeg -i video.mp4 -vn  -acodec libmp3lame -ac 2 -qscale:a 4 -ar 48000  audio.mp3

将video.mp4的音源提取成audio.mp3

可以看到能够正常播放

20221231日补充

补充第三步，发现直接访问m4s的链接会被返回404，不能直接下载了；经过一系列的搜索终于找到了解决方案就是通过写代码java的io流来实现

package com.testdownbili.demo1.bil;

import java.io.*;
import java.net.URL;
import java.net.URLConnection;

public class DownloadM4sApplet 


    public static void main(String[] args)
        // 第三步的m4s地址
        String videoUrl="https://xy49x86x255x16xy.mcdn.bilivideo.cn:4483/upgcxcode/64/99/517569964/517569964_x2-1-30077.m4s?e=ig8euxZM2rNcNbdlhoNvNC8BqJIzNbfqXBvEqxTEto8BTrNvN0GvT90Wxxxxxx";
        // 文件名称 建议和链接保持一致
        String name="517569964_x2-1-30077.m4s";

        downloadMovie(videoUrl,name);
    

    public static void downloadMovie(String BLUrl, String fileName) 
        InputStream inputStream = null;
        try 
            URL url = new URL(BLUrl);
            URLConnection urlConnection = url.openConnection();
            urlConnection.setRequestProperty("Referer", "https://www.bilibili.com/video/BV"); // 填需要爬取的bv号
            urlConnection.setRequestProperty("Sec-Fetch-Mode", "no-cors");
            urlConnection.setRequestProperty("User-Agent", "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/100.0.4896.127 Safari/537.36");
            urlConnection.setRequestProperty("User-Agent", "Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; DigExt)");
            urlConnection.setConnectTimeout(10 * 1000);
            inputStream = urlConnection.getInputStream();
         catch (IOException e) 
            e.printStackTrace();
        
        //定义路径
        String path = "D:\\\\Temp\\\\biliDown\\\\" + fileName;
        File file = new File(path);
        int i = 1;
        try 
            BufferedInputStream bis = new BufferedInputStream(inputStream);
            BufferedOutputStream bos = new BufferedOutputStream(new FileOutputStream(file));
            byte[] bys = new byte[1024];
            int len = 0;
            while ((len = bis.read(bys)) != -1) 
                bos.write(bys, 0, len);
            
            bis.close();
            bos.close();
         catch (Exception e) 
            e.printStackTrace();

就下载成功了

下载成功后再执行上述步骤即可

ffmpeg-python 任意提取视频帧

▶ 环境准备

1、安装 FFmpeg

音/视频工具 FFmpeg 简易安装文档

2、安装 ffmpeg-python

pip3 install ffmpeg-python

3、【可选】安装 opencv-python

pip3 install opencv-python

4、【可选】安装 numpy

pip3 install numpy

▶ 视频帧提取

准备视频素材

抖音视频素材下载：https://anoyi.com/dy/top

基于视频帧数提取任意一帧

import ffmpeg
import numpy
import cv2
import sys
import random


def read_frame_as_jpeg(in_file, frame_num):
    """
    指定帧数读取任意帧
    """
    out, err = (
        ffmpeg.input(in_file)
              .filter(\'select\', \'gte(n,{})\'.format(frame_num))
              .output(\'pipe:\', vframes=1, format=\'image2\', vcodec=\'mjpeg\')
              .run(capture_stdout=True)
    )
    return out


def get_video_info(in_file):
    """
    获取视频基本信息
    """
    try:
        probe = ffmpeg.probe(in_file)
        video_stream = next((stream for stream in probe[\'streams\'] if stream[\'codec_type\'] == \'video\'), None)
        if video_stream is None:
            print(\'No video stream found\', file=sys.stderr)
            sys.exit(1)
        return video_stream
    except ffmpeg.Error as err:
        print(str(err.stderr, encoding=\'utf8\'))
        sys.exit(1)


if __name__ == \'__main__\':
    file_path = \'/Users/admin/Downloads/拜无忧.mp4\'
    video_info = get_video_info(file_path)
    total_frames = int(video_info[\'nb_frames\'])
    print(\'总帧数：\' + str(total_frames))
    random_frame = random.randint(1, total_frames)
    print(\'随机帧：\' + str(random_frame))
    out = read_frame_as_jpeg(file_path, random_frame)
    image_array = numpy.asarray(bytearray(out), dtype="uint8")
    image = cv2.imdecode(image_array, cv2.IMREAD_COLOR)
    cv2.imshow(\'frame\', image)
    cv2.waitKey()

基于时间提取任意一帧

import ffmpeg
import numpy
import cv2
import sys
import random


def read_frame_by_time(in_file, time):
    """
    指定时间节点读取任意帧
    """
    out, err = (
        ffmpeg.input(in_file, ss=time)
              .output(\'pipe:\', vframes=1, format=\'image2\', vcodec=\'mjpeg\')
              .run(capture_stdout=True)
    )
    return out


def get_video_info(in_file):
    """
    获取视频基本信息
    """
    try:
        probe = ffmpeg.probe(in_file)
        video_stream = next((stream for stream in probe[\'streams\'] if stream[\'codec_type\'] == \'video\'), None)
        if video_stream is None:
            print(\'No video stream found\', file=sys.stderr)
            sys.exit(1)
        return video_stream
    except ffmpeg.Error as err:
        print(str(err.stderr, encoding=\'utf8\'))
        sys.exit(1)

if __name__ == \'__main__\':
    file_path = \'/Users/admin/Downloads/拜无忧.mp4\'
    video_info = get_video_info(file_path)
    total_duration = video_info[\'duration\']
    print(\'总时间：\' + total_duration + \'s\')
    random_time = random.randint(1, int(float(total_duration)) - 1) + random.random()
    print(\'随机时间：\' + str(random_time) + \'s\')
    out = read_frame_by_time(file_path, random_time)
    image_array = numpy.asarray(bytearray(out), dtype="uint8")
    image = cv2.imdecode(image_array, cv2.IMREAD_COLOR)
    cv2.imshow(\'frame\', image)
    cv2.waitKey()

▶ 相关资料

https://github.com/kkroening/ffmpeg-python/tree/master/examples

本文由博客一文多发平台 OpenWrite 发布！

以上是关于ffmpeg从某站提取视频音频详解的主要内容，如果未能解决你的问题，请参考以下文章