examples/charRNN Crashes When Run #542

ghost · 2023-01-30T16:41:16Z

Hi there, I tried out

examples/charRNN

and it crashes when it is run (using "go run .") here is the output + call-stack:

2023/01/30 08:28:47 Going to predict now
panic: runtime error: index out of range [52] with length 52

goroutine 1 [running]:
gorgonia.org/tensor.SampleIndex({0x10387c040?, 0x14000682f00?})
        /Users/tcassidy/go/pkg/mod/gorgonia.org/tensor@v0.9.23/api_utils.go:107 +0x220
main.sample({0x10388b4c0?, 0x14000682f00?})
        /Users/tcassidy/github/gorgonia/examples/charRNN/util.go:15 +0x44
main.(*charRNN).predict(0x14000384000)
        /Users/tcassidy/github/gorgonia/examples/charRNN/model.go:379 +0x224
main.main()
        /Users/tcassidy/github/gorgonia/examples/charRNN/main.go:104 +0x3a4
exit status 2

The work-around appears to be to update the following (add range limits to for loops):

api_utils.go:

	case *Dense:
[SNIP]
		case reflect.Float64:
[SNIP]
			for i < len(data) {
[SNIP]
		case reflect.Float32:
[SNIP]
			for i < len(data) {

But then after dong that - there is a different crash here:

2023/01/30 08:34:30 Going to predict now
panic: runtime error: index out of range [-1]

goroutine 1 [running]:
main.(*charRNN).predict(0x1400021a100)
        /Users/tcassidy/github/gorgonia/examples/charRNN/model.go:381 +0x678
main.main()
        /Users/tcassidy/github/gorgonia/examples/charRNN/main.go:104 +0x3a4
exit status 2

I notice this example is quite old - is there a more up to date example of using RNN in gorgonia?

Thanks!

The text was updated successfully, but these errors were encountered:

artheus · 2023-03-23T13:33:57Z

I am getting the same error, running charRNN with gorgonia.org/tensor@v0.9.24

wzzhu · 2023-03-31T08:52:27Z

The problem was caused by sampling from a log prob distribution instead of from a normal prob. Since log probs are negative numbers, the sampling will always go out of bound without having a sum larger than a positive number in 0 to 1.

Update the sample func in examples/chartRNN/util.go by converting the log probs back to normal probs will suppress this error.


func sample(val Value) int {
	var t tensor.Tensor
	var ok bool
	if t, ok = val.(tensor.Tensor); !ok {
		panic("Expects a tensor")
	}

	// It is logprob, convert it back to prob
	t2, err := tensor.Exp(t, tensor.AsSameType())
	if err != nil {
		panic("Error converting to prob")
	}

	return tensor.SampleIndex(t2)
}

However, there are other errors that will crash.

In addition, gorgonia has no real capability of doing RNN as it is using a static graph for computation.
It doesn't have dynamic unrolling to support BPTT (Back Propagation Through Time). Therefore the example CharRNN is just like a feed forward network, as it cannot share the LSTM internal weights like the python counter part in Karpathy's example at https://gist.github.com/karpathy/d4dee566867f8291f086 to capture the characteristics of input sequence.

To really support LSTM or other RNN, it may need to create new Op for RNN capable of doing BPTT.

chirmicci · 2023-05-03T14:04:30Z

Hi @wzzhu can you suggest a library to do RNN (LSTM, GRU), pls

wzzhu · 2023-05-11T10:25:16Z

Hi @wzzhu can you suggest a library to do RNN (LSTM, GRU), pls

I correct one wrong statement above. "In addition, gorgonia has no real capability of doing RNN as it is using a static graph for computation.". By not following the implementation of CharRNN which doesn't share weights, we can implement the RNN directly using the shared weights to construct a unrolled graph given the fixed sequence size. So gorgonia can still do RNN on condition that we construct the computation graph with shared weight nodes and loop each input in the sequence.

Therefore we can still use gorgonia for RNN. But it needs heavy implementation.

e.g.,

g := gorgonia.NewGraph()

x := gorgonia.NewVector(g, tensor.Float64, gorgonia.WithShape(seqLen, inputSize), gorgonia.WithName("x"))
y := gorgonia.NewVector(g, tensor.Float64, gorgonia.WithShape(seqLen, outputSize), gorgonia.WithName("y"))
h := gorgonia.NewVector(g, tensor.Float64, gorgonia.WithShape(seqLen, hiddenSize), gorgonia.WithName("h"))

// the input-to-hidden weights
Wxh := gorgonia.NewMatrix(g, tensor.Float64, gorgonia.WithShape(inputSize, hiddenSize), gorgonia.WithName("Wxh"))
// the hidden-to-hidden weights
Whh := gorgonia.NewMatrix(g, tensor.Float64, gorgonia.WithShape(hiddenSize, hiddenSize), gorgonia.WithName("Whh"))
// the hidden-to-output weights
Why := gorgonia.NewMatrix(g, tensor.Float64, gorgonia.WithShape(hiddenSize, outputSize), gorgonia.WithName("Why"))
// the hidden biases
bh := gorgonia.NewVector(g, tensor.Float64, gorgonia.WithShape(hiddenSize), gorgonia.WithName("bh"))
// the output biases
by := gorgonia.NewVector(g, tensor.Float64, gorgonia.WithShape(outputSize), gorgonia.WithName("by"))

// Define the forward pass
var hPrev *gorgonia.Node
for i := 0; i < seqLen; i++ {
    // Compute the input-to-hidden activations
    hI := gorgonia.Must(gorgonia.Mul(x.Slice(i, i+1), Wxh))
    
    // Compute the hidden-to-hidden activations
    hH := gorgonia.Must(gorgonia.Mul(hPrev, Whh))
    
    // Compute the total hidden activations
    hTotal := gorgonia.Must(gorgonia.Add(hI, hH, bh))
    
    // Compute the hidden activations
    h[i], err = gorgonia.Tanh(hTotal)
    if err != nil {
        log.Fatal(err)
    }
    
    // Compute the output activations
    oI := gorgonia.Must(gorgonia.Mul(h[i], Why))
    y[i], err = gorgonia.Sigmoid(gorgonia.Must(gorgonia.Add(oI, by)))
    if err != nil {
        log.Fatal(err)
    }
    
    hPrev = h[i]
}

loss := gorgonia.Must(gorgonia.Mean(gorgonia.Must(gorgonia.Square(gorgonia.Must(gorgonia.Sub(y, yTrue))))))

grads, err := gorgonia.Grad(loss, Wxh, Whh, Why, bh, by)
if err != nil {
    log.Fatal(err)
}

solver := gorgonia.NewRMSPropSolver(gorgonia.WithLearnRate(0.001))

 vm=gorgonia.NewTapeMachine(g)
for i := 0; i < numEpochs; i++ {
    // Perform the forward pass and compute the loss and gradients
    gorgonia.Let(x, ...)
   gorgonia.Let(y, ...)
   vm.Run()
  // Back prop
   solver.Step(gorgonia.NodesToValueGrads( []*gorgonia.Node{Wxh, Whh, Why, bh, by})
  vm.Reset()
}

chirmicci · 2023-05-24T11:03:18Z

@wzzhu sorry that I ask too many questions
But where I can find any example of how to build the NER model in Gorgonia?
Maybe have you already done something like that?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples/charRNN Crashes When Run #542

examples/charRNN Crashes When Run #542

ghost commented Jan 30, 2023

artheus commented Mar 23, 2023

wzzhu commented Mar 31, 2023 •

edited

chirmicci commented May 3, 2023

wzzhu commented May 11, 2023

chirmicci commented May 24, 2023

examples/charRNN Crashes When Run #542

examples/charRNN Crashes When Run #542

Comments

ghost commented Jan 30, 2023

artheus commented Mar 23, 2023

wzzhu commented Mar 31, 2023 • edited

chirmicci commented May 3, 2023

wzzhu commented May 11, 2023

chirmicci commented May 24, 2023

wzzhu commented Mar 31, 2023 •

edited