iOS-调戏CoreML-这是花瓶？

作者: YI_LIN | 来源:发表于2017-06-07 17:14 被阅读4115次

iOS-调戏CoreML-这是花瓶？
iOS-私有API与runtime
iOS-代码混淆加固策略
iOS-性能优化深入探究
iOS-自动打包及分发（三）
iOS-自动打包及分发（二）
iOS-底层原理 06：malloc 源码分析思路
iOS-导航头像缩放，支持点击回调
iOS-取消延迟执行的方法cancelPreviousPerfo
iOS- UI事件传递& 响应

CoreML 是 Apple 在 WWDC 2017 推出的机器学习框架。但是其到底有什么功能呢，能不能识别花瓶，看看就知道了。

原文发表在个人博客iOS-CoreML-初探，转载请注明出处。

模型

在 CoreML 中， Apple 定义了一套自己的模型格式，后缀名为： mimodel，通过 CoreML 框架，以及模型库，可以在 App 层面进行机器学习的功能研发。

CoreML

官网已经提供四个模型库供下载。

iOS-CoreML-Model

Demo

官网提供了一个 Demo，要求 XCode 9 + iOS 11 的环境。

下载下来 Run 了一下，不得不说，Apple 对开发者还是非常友好的，直接将模型文件拖到项目中，Xcode 会自动生成接口文件:

import CoreML

class MarsHabitatPricerInput : MLFeatureProvider {
    var solarPanels: Double
    var greenhouses: Double
    var size: Double
    
    var featureNames: Set<String> {
        get {
            return ["solarPanels", "greenhouses", "size"]
        }
    }
    
    func featureValue(for featureName: String) -> MLFeatureValue? {
        if (featureName == "solarPanels") {
            return MLFeatureValue(double: solarPanels)
        }
        if (featureName == "greenhouses") {
            return MLFeatureValue(double: greenhouses)
        }
        if (featureName == "size") {
            return MLFeatureValue(double: size)
        }
        return nil
    }
    
    init(solarPanels: Double, greenhouses: Double, size: Double) {
        self.solarPanels = solarPanels
        self.greenhouses = greenhouses
        self.size = size
    }
}

class MarsHabitatPricerOutput : MLFeatureProvider {
    let price: Double
    
    var featureNames: Set<String> {
        get {
            return ["price"]
        }
    }
    
    func featureValue(for featureName: String) -> MLFeatureValue? {
        if (featureName == "price") {
            return MLFeatureValue(double: price)
        }
        return nil
    }
    
    init(price: Double) {
        self.price = price
    }
}

@objc class MarsHabitatPricer:NSObject {
    var model: MLModel
    init(contentsOf url: URL) throws {
        self.model = try MLModel(contentsOf: url)
    }
    convenience override init() {
        let bundle = Bundle(for: MarsHabitatPricer.self)
        let assetPath = bundle.url(forResource: "MarsHabitatPricer", withExtension:"mlmodelc")
        try! self.init(contentsOf: assetPath!)
    }
    func prediction(input: MarsHabitatPricerInput) throws -> MarsHabitatPricerOutput {
        let outFeatures = try model.prediction(from: input)
        let result = MarsHabitatPricerOutput(price: outFeatures.featureValue(for: "price")!.doubleValue)
        return result
    }
    func prediction(solarPanels: Double, greenhouses: Double, size: Double) throws -> MarsHabitatPricerOutput {
        let input_ = MarsHabitatPricerInput(solarPanels: solarPanels, greenhouses: greenhouses, size: size)
        return try self.prediction(input: input_)
    }
}

可以看到，主要是定义了输入，输出以及预测的格式，调用的时候，也非常简单，传参即可。

但是这些接口文件并没有在 XCode 左边的文件树中出现。

查了一下，是生成在 DerivedData 目录下，估计是想开发者使用起来更简洁。

运行一下，可以看到，主要功能是对价格进行预测。

iOS-CoreML-Demo

貌似稍微有点不够高大上...

Resnet50

官网提供的四个模型库，我们还没用呢，当然要看下能用来干啥，看了一下，貌似主要是物体识别，OK，代码走起。

先下载模型库 Resnet50, 然后创建一个新的 Swift 项目，将其拖进去：

iOS-CoreML-Model-Resnet50

从描述里面可以看出来，其实一个神经网络的分类器，输入是一张像素为 (224 * 224) 的图片，输出为分类结果。

自动生成的接口文件：

import CoreML

class Resnet50Input : MLFeatureProvider {
    var image: CVPixelBuffer
    
    var featureNames: Set<String> {
        get {
            return ["image"]
        }
    }
    
    func featureValue(for featureName: String) -> MLFeatureValue? {
        if (featureName == "image") {
            return MLFeatureValue(pixelBuffer: image)
        }
        return nil
    }
    
    init(image: CVPixelBuffer) {
        self.image = image
    }
}

class Resnet50Output : MLFeatureProvider {
    let classLabelProbs: [String : Double]
    let classLabel: String
    
    var featureNames: Set<String> {
        get {
            return ["classLabelProbs", "classLabel"]
        }
    }
    
    func featureValue(for featureName: String) -> MLFeatureValue? {
        if (featureName == "classLabelProbs") {
            return try! MLFeatureValue(dictionary: classLabelProbs as [NSObject : NSNumber])
        }
        if (featureName == "classLabel") {
            return MLFeatureValue(string: classLabel)
        }
        return nil
    }
    
    init(classLabelProbs: [String : Double], classLabel: String) {
        self.classLabelProbs = classLabelProbs
        self.classLabel = classLabel
    }
}

@objc class Resnet50:NSObject {
    var model: MLModel
    init(contentsOf url: URL) throws {
        self.model = try MLModel(contentsOf: url)
    }
    convenience override init() {
        let bundle = Bundle(for: Resnet50.self)
        let assetPath = bundle.url(forResource: "Resnet50", withExtension:"mlmodelc")
        try! self.init(contentsOf: assetPath!)
    }
    func prediction(input: Resnet50Input) throws -> Resnet50Output {
        let outFeatures = try model.prediction(from: input)
        let result = Resnet50Output(classLabelProbs: outFeatures.featureValue(for: "classLabelProbs")!.dictionaryValue as! [String : Double], classLabel: outFeatures.featureValue(for: "classLabel")!.stringValue)
        return result
    }
    func prediction(image: CVPixelBuffer) throws -> Resnet50Output {
        let input_ = Resnet50Input(image: image)
        return try self.prediction(input: input_)
    }
}

OK，要照片，而且是 CVPixelBuffer 类型的。

但是每次从相册选太烦了，所以我们直接摄像头走起。将 AVCam 的主要功能类复制到项目中。

iOS-CoreML-AVCam

然后，禁用 CameraViewController 中一些不必要的按钮：

self.recordButton.isHidden = true
self.captureModeControl.isHidden = true
self.livePhotoModeButton.isHidden = true
self.depthDataDeliveryButton.isHidden = true

由于，AVCapturePhotoCaptureDelegate 拍照完成的回调为：

 func photoOutput(_ output: AVCapturePhotoOutput, didFinishProcessingPhoto photo: AVCapturePhoto, error: Error?)

看了下 AVCaputrePhoto 的定义，里面刚好有 CVPixelBuffer 格式的属性：

iOS-CoreML-AVCam-PreviewPixelBuffer

直接传进去试试：

// Predicte
if let pixelBuffer = photo.previewPixelBuffer {
    guard let Resnet50CategoryOutput = try? model.prediction(image:pixelBuffer) else {
        fatalError("Unexpected runtime error.")
    }
}

一切看起来很完美，编译通过，运行起来，点一下拍照按钮，额，Crash了，异常：

[core] Error Domain=com.apple.CoreML Code=1 "Input image feature image does not match model description" UserInfo={NSLocalizedDescription=Input image feature image does not match model description, NSUnderlyingError=0x1c0643420 {Error Domain=com.apple.CoreML Code=1 "Image is not valid width 224, instead is 852" UserInfo={NSLocalizedDescription=Image is not valid width 224, instead is 852}}}

哦，忘记改大小了，找到 photoSetting，加上宽高:

if !photoSettings.availablePreviewPhotoPixelFormatTypes.isEmpty {
    photoSettings.previewPhotoFormat = [kCVPixelBufferPixelFormatTypeKey as String: photoSettings.availablePreviewPhotoPixelFormatTypes.first!,
        kCVPixelBufferWidthKey as String : NSNumber(value:224),
        kCVPixelBufferHeightKey as String : NSNumber(value:224)]
}

重新 Run，WTF，Man，居然又报同样的错，好吧，Google 一下，貌似宽高的属性，在 Swift 里面不生效，额。。

没办法，那我们只能将 CVPixelBuffer 先转换成 UIImage，然后改下大小，再转回 CVPixelBuffer，试试：

photoData = photo.fileDataRepresentation()
 
// Change Data to Image
guard let photoData = photoData else {
    return
}
let image = UIImage(data: photoData)
                
// Resize
let newWidth:CGFloat = 224.0
let newHeight:CGFloat = 224.0
UIGraphicsBeginImageContext(CGSize(width:newWidth, height:newHeight))
image?.draw(in:CGRect(x:0, y:0, width:newWidth, height:newHeight))
let newImage = UIGraphicsGetImageFromCurrentImageContext()
 UIGraphicsEndImageContext()
                
guard let finalImage = newImage else {
    return
}
                
// Predicte
guard let Resnet50CategoryOutput = try? model.prediction(image:pixelBufferFromImage(image: finalImage)) else {
    fatalError("Unexpected runtime error.")
}

重新 Run，OK，一切很完美。

最后，为了用户体验，加上摄像头流的暂停和重启，免得在识别的时候，摄像头还一直在动，另外，识别结果通过提醒框弹出来，具体参考文末的源码。

开始玩啦，找支油笔试一下：

iOS-CoreML-Pen

识别成，橡皮擦，好吧，其实是有点像。

再拿小绿植试试：

iOS-CoreML-FlowerPot

花瓶，Are you kidding me ??

其实，效果还是蛮不错的。

刚好下午要去上海 CES Asia，一路拍过去玩，想想都有点小激动。

最后，源码奉上，想玩的同学直接下载编译就行了，别忘了 Star~

看了又看：
深度学习是怎么识别人脸的?
300行代码实现手写汉字识别
 如何在一周内做一款拼音输入法
 iOS-签名机制和证书原理
 iOS-线程同步详解

iOS-调戏CoreML-这是花瓶？
CoreML 是 Apple 在 WWDC 2017 推出的机器学习框架。但是其到底有什么功能呢，能不能识别花瓶，...
iOS-私有API与runtime
iOS-私有API与runtime iOS-私有API与runtime
iOS-代码混淆加固策略
iOS-代码混淆加固策略 iOS-代码混淆加固策略
iOS-性能优化深入探究
iOS-性能优化深入探究 iOS-性能优化深入探究
iOS-自动打包及分发（三）
iOS-自动打包及分发（一）iOS-自动打包及分发（二）iOS-自动打包及分发（三）废话不多说了，上正文：一、...
iOS-自动打包及分发（二）
iOS-自动打包及分发（一）iOS-自动打包及分发（二）iOS-自动打包及分发（三）本篇介绍：自动打包及分发xc...
iOS-底层原理 06：malloc 源码分析思路
iOS-底层原理 06：malloc 源码分析在iOS-底层原理 02：alloc & init & new 源...
iOS-导航头像缩放，支持点击回调
iOS-导航头像缩放，支持点击回调 iOS-导航头像缩放，支持点击回调
iOS-取消延迟执行的方法cancelPreviousPerfo
前面写过一篇文章iOS-利用performSelector调用方法iOS-利用performSelector调用方...
iOS- UI事件传递& 响应
UI视图相关重要知识点传送带： iOS- UITableView重用机制 iOS- UITableView数据源...

网友评论

凉风起君子意如何:Undefined symbols for architecture arm64:
"__T0So22AVCapturePhotoSettingsC12AVFoundation01_abC16SwiftNativeTypesACWP", referenced from:
__T010TestCoreML20CameraViewControllerC12capturePhoto33_2A095F8A7348BA63FCD1F6911CB608E7LLySo8UIButtonCFyycfU_ in CameraViewController.o
"__T012AVFoundation39_AVCapturePhotoSettingsSwiftNativeTypesPAAE016availablePreviewc11PixelFormatG0Says6UInt32VGfg", referenced from:
__T010TestCoreML20CameraViewControllerC12capturePhoto33_2A095F8A7348BA63FCD1F6911CB608E7LLySo8UIButtonCFyycfU_ in CameraViewController.o
ld: symbol(s) not found for architecture arm64
clang: error: linker command failed with exit code 1 (use -v to see invocation)
为何啊，Stackoverflow查了也无法解决
凉风起君子意如何:xcode9+7p ios11.0.3
3c16f7ac4c0c:和楼主一样最后将CVPixelBuffer转成UIImage resize后再转回CVPixelBuffer，但是总感觉这样很耗时耗性能，找了好久没找到直接对captureData resize的方法，感觉好坑。
YI_LIN:我是赶着去参加CES，所以短平快地完成了，办法应该是有的，回去再弄一下~
再换一个好了吧:不知道是不是苹果只是拿出来个简易版本的DEMO，总感觉low的一批。
YI_LIN:那个demo确实low.....
26be94ce0b6a:手机怎么升级到iOS11的
再换一个好了吧:直接搜索beat版的描述文件，下载完就可以更新了
AncientMing:66666
YI_LIN:
大号鱼骨头:看来这是个大势，连苹果都加了相关的库
YI_LIN:是的，后面会有越来越多的开发包..
58cb860debdd:写的很好。
YI_LIN:谢谢~
Dragon_wen:请问楼主，这个model导入，出来的ipa有多大？
YI_LIN:@Dragon_wen debug版本的ipa139M，内存还没时间看，等回去跟一下~
Dragon_wen:另外app实际占用了手机多少多少内存
CrazySteven:666
YI_LIN:
四爷在此:挺好的，更通用的有caffe2，tensorflow，就是移动端的开发环境不好搭建，需要NDK啥的。
YI_LIN:CoreML还是非常 developer friendly 的
6ea55c14e552:好腻害呀！收徒弟不？？0基础
YI_LIN:哈哈，可以关注我的博客，一起学习交流~
非典型技术宅:Xcode 9 iOS 11.简直要人命
玺睿siri:@西城_d61e ok .非常感谢！
玺睿siri:在网上没有找到xcode9啊，现在还用不了
YI_LIN:@非典型技术宅现在还是beta版，好多bug~
小冰山口:楼主666啊
YI_LIN:@学豆谢谢