Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert initializers to NumPower #356

Open
wants to merge 4 commits into
base: 3.0
Choose a base branch
from

Conversation

SkibidiProduction
Copy link

@SkibidiProduction SkibidiProduction commented Mar 25, 2025

Initializers have been converted to NumPower.

For He, Xavier, LeCun initializers, a division into two types was made. The first type is based on a truncated normal distribution, the second on a unified distribution.

The final distributions of the initializers are made in accordance with their analogs in the keras library.

A fork of the original NumPower is used (https://github.com/RubixML/numpower) due to the fact that there are difficulties with accepting changes and fixing bugs in the original extension.

@SkibidiProduction SkibidiProduction self-assigned this Mar 25, 2025
@SkibidiProduction SkibidiProduction changed the title #355 Convert initializers to NumPower Convert initializers to NumPower Mar 25, 2025
Copy link
Member

@andrewdalpino andrewdalpino left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @SkibidiProduction overall a pretty good start!

I really like the thoroughness of the unit tests but would encourage you to come up with some more descriptive names for the test methods instead of initializeTest1() initializeTest2() etc. I also like that you broke the initializers into Normal and Uniform versions. Finally, I like that you converted to more modern PHP syntax.

I'm concerned with strict types being unnecessary and slowing down our runtime. I'm also not keen on the idea of introducing additional exception classes without a compelling reason to do so. Also, I think we should stick to calling classes that provide API as "interfaces" and not introduce the "contract" term to mean the same thing as it has the potential to confuse people but provides no additional benefit. Lastly, I think we should avoid creating additional namespace hierarchies unless they are necessary.

@@ -0,0 +1,38 @@
<?php

declare(strict_types=1);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this necessary? If not let's not use strict types as it can impact runtime performance.

Enabling strict types can improve code robustness and type safety, but it may also introduce performance overhead due to the additional type-checking at runtime.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We agreed to leave it at that


declare(strict_types=1);

namespace Rubix\ML\NeuralNet\Initializers\Base\Contracts;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be fine to put this Abstract class directly in the Initializers namespace, without adding a Base or introducing the concept of a "Contract." Rubix ML uses the term "interface" instead of contract and I wouldn't want to confuse people unnecessarily by introducing another name for something that already exists. In addition, this class does not actually provide an interface or contract, it's just a base class.

I think it makes sense to put Exception classes in their own folder, however.

How about we have

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We agreed to leave it temporary

protected function validateInitParams(int $fanIn, int $fanOut) : void
{
if ($fanIn < 1) {
throw new InvalidFanInException(message: "Fan in cannot be less than 1, $fanIn given");
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To me, this exception does not meet the threshold for having it's own class. I would roughly define that threshold as being ...

  1. Will the user want to catch this exception specifically? - IMO probably not
  2. Does the exception have complex logic? - It does not

@@ -0,0 +1,32 @@
<?php
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's colocate the interface with their implementations and avoid creating unnecessary/duplicate semantics around base classes and interfaces. The PHP language already provides ubiquitous semantics for these concepts.

* @throws InvalidFanInException Initializer parameter fanIn is less than 1
* @throws InvalidFanOutException Initializer parameter fanOut is less than 1
*/
protected function validateInitParams(int $fanIn, int $fanOut) : void
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This method name is necessarily generic for what it's actually doing. Since this method specifically validates the fan in and fan out perhaps a more accurate name would be validateFanInFanOut().

{
$this->validateInitParams(fanIn: $fanIn, fanOut: $fanOut);

$std = sqrt(2 / ($fanOut + $fanIn));
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For consistency, let's use the variable name $stdDev, I believe this is how we call it throughout the codebase.

@@ -0,0 +1,147 @@
<?php
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice thorough tests!

}

#[Test]
#[TestDox('The initializer object os created correctly')]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Typo

#[TestDox('The initializer object is created correctly')]

#[Test]
#[TestDox('The result matrix has correct shape')]
#[DataProvider('initializeTest1DataProvider')]
public function initializeTest1(int $fanIn, int $fanOut) : void
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we make these test method names more descriptive?

Example initializeWithCorrectShape()

#[Test]
#[TestDox('An exception is thrown during initialization')]
#[DataProvider('initializeTest3DataProvider')]
public function initializeTest3(int $fanIn, int $fanOut) : void
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we make these test method names more descriptive?

Example initializeWithBadFanInFanOut()

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants